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I 2) INfORl-lATION FOR SEQ ID NO:i 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1362 base pairs 
(BJ TYPE: nucleic acid 

(C) SmAtWBVUBSS : double 

(D) TOPOLOGY: circular 

Ui) HOLECULE TYPE: DHA (genomic) 

fiii) HYPOTHETICAL: HO 

tiv> AMTl-SEMSE: NO 

fvi) ORIGIHAL SOURCE: 

(A) ORGAIIISM: PORYPHYROI-IONAS GINGIVALIS 

(Ix) FEATURE: 

(A) HA^fE/KEY: misc feature 

(B) LOCATIOM 1. . .1362 

ixi) SEQUENCE DESCRIPTIOtI: SEQ ID HO:l 

TTCTGTGTCA TGGCAAAAGT TATAAAAACA AAAAAAGGCC TTGCACTTAA TCTGAAAGGA 60 

AAACCGCTGC CCGAGATGCT GGCCGAACCG GCCCAAAGTC CTACTTACGC GGTCGTGCCC 120 

GACGATTTl'G AAGGTGTTAT CCCCAAGGTG ACGCCTCGTC CGjGGGATAA GGTGCGTGCC 180 

GGCTCAGCAC TGATGCACCA CAAGGCATAT CCGGAC.ATGA AGTTTACAAG TCCGGTTAGC 24 0 

GGCGAAGTGA TCGCGCTGAA TCGCGGTGCC AAGCGCAAGG TGTTGAGCAT CGAGGTGAAA 300 

CCGGACGGAC TGAAC6AATA CGAGTCATTC CCTGTCGGGG ATCCGTCTGC CCTCTCTGCC 3 SO 

GAACAGATCA AGGAGCTTTT ACTGTCGAGC GGTATGTGGG GTm'ATTAA GCAACGTCCT 420 

TACGACATAG TGGCTACACC GGATATAGCT CCACGCGACA TTTATATTAC TGCCAACTTT 480 

ACTGCACCAT TGGCTCCGGA CTTCGA'ITTC ATCGTTCGAG GAGAAGAACG CGCCCTGCAG 54 0 

ACrCCCATCG ATGCCTTGGC CAAACTCACG ACA6GAAAGG TGTATGTGGG CCTGAAGCCG 600 

GGTTCATCTC TGGGCTTGCA CAATGCAGAA ATCGTAGAAG TACACGGACC TCATCCGGCA 660 

6GTAACGTGG GCGTGCTGAT CAATCATACG AAGCCAATCA ATCGGGGCGA AACGCl'GTGG 720 

ACGCTCAAGG CTACCGACCT GATCGTGATC GGACGTTTCC TGCTTACGGG CAAAGCCGAT 780 

TTTACCAGAA TGATTGCCAT QACCGGCTCA GACGCTGCAG CTCACGGATA CGTCCGTATT 840 

ATGCCGGGTT GCAATGTCTT TGCTTCCTTC CCCGGCCGAC TGACAATAAA GGAATCTCAC 900 

GAGCGTGTGA TCGATGGCAA TGTGCTGACC GGTAAGAAGC TCTGCGAGAA GGAGCCTTTC 960 

CTGTCAGCCC RGTGTGACCA GATCACGGTG ATCCCCGAAG GCGACGATGT GGACGAACTC 1020 

1*TCGGGTGGG CTGCACCCCG TCTCGATCAG TACAGCATGA GCAGAGCTTA TTTCTCTTGG 1080 

TTGCAGGGGA AAAACAAA6A GTACGTACTC GATCCCCGGA TCAAGGGTGG CGAACGTGCT 1140 

ATGATCATGA GCAACGAGTA TGACCGCGTT TTCCCGATGG ACATCTATCC GGAGTATTTG 1200 

CTCAAGGCTA TTATAGCATT CGACATCGAC AAGATGGAGG ACTTAGGCAT ATATGAAGTG 1260 

GCTCCGGAGG ACTTTGCCAC TTGCGAATTT GTGGATACAT CCAAGATCGA GCTGCAGCGT 1320 

ATCGTTCGCG AGGGCTTGGA TATGCTCTAT AAGGAAATGA AT 1362 



(2) IHF0R]1ATT0^4 FOR SEQ ID HO: 2 

(i) SEQUEMCE CHARACTERISTICS: 

(A} LEHGTH: 603 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPbLOG"]^: circular 

(11) MOLECULE TYPE: DMA t genomic! 

(iii( HYPOTHETICAL: HO 

(ivj AIJTI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOHAS GINGiVALIS 

(ixl FEATURE: 

(Al HAJIE/KEY: misc feature 
(B| LOCATION 1...603 

txi) SEQUENCE OESCRIPTIOff: SEQ ID NO: 2 

GAACTAAGCA AGTGTTACAT GGATAAAGTG AGCTATGCTC TGGGATTGAG CATCGGTAAT 60 

AATTTCAAGT CTTCGGGCAT CGACAGCGTC GTTATG'SATG ATTTCATGCA AGGTCTGTCT 120 

GATGTACTGG AAGAAAAAGC CCCTCAGCTC TCCTATGAQG AGGCCAAGCG CGAAATAGAG 100 

GCGTATTTCA TGGATTTGCA GCAGAAGGCT GTGWACTGA ACAAAGAGGC CGGA6AAGAA 240 
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10 (2) INFORMATION FOR StQ ID NO: 3 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 837 base pairs 
-c TYPE: nucleic acid 

tC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: MM (genomic) 

20 (ill) HYPOTHETICAL; NO 

(iv) AJJTI- SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa-IONAS GIJfGIVALIS 
( ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1 . . .837 



(2) INFORHATTON FOR SEQ ID NO: 4 



(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 471 base pairs 
{B) TYPE: nucleic acid 
EC STRAfJDEDNESS: double 

(D^ TOPOLOGY: circular 

(IJ) MOLECULE TYPE: DMA (genomic) 

tiii) HYPOTHETICAL: NO 

(ivj AMTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
65 ORGANISM: PORYPHYROMOl^AS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: mlsc feature 

(B) LOCATION 1. . . 471 

70 fxi) SKQUEWCE DESCRIPTION: SEQ ID NO: 4 

^?^2^G?AG l^C^l^ GgS?^^^^ ATCGGCTG.C GCGTGGAGAC GCAAGAGGGG 
GGGAGCTATC ^^S^^S Sn^J^S^fS? CCTCCGATGA AAGGCAAGGG CGATCTGGTC 
75 CGTTC^??^ I?GC^??? ??Sat??a2 5^^™ CGCTTTCTGA TATTCCGGCT 
v-«-^jii iiATATCTAx TCTCGTATTT CACTGGGTTC GGGCACGCAG 



PCT/AU98/01023 



^CT™ I^GgSSS ^2^^^°^ GTGACGACCT TACCGAGCGG CTTGCAATAC 300 

TATCA^G?A ^^^f^^^ GGGCCCGAAA CCCACCCTTT CGGACACGGT AACCTGTCAT 360 

S???c2 ctSSgaS^ ^"^^^A'^-^^^' CrATGGACAG GGGAG^S^ 420 

GTAGGATcS S^SJ^n^S? AGTTATAGCC GCCTGCACGG AGATTCTTCA ATTAATGCCT 4flO 

§^2^ca5^ ?c^™ ^^^^J^'^^^ agcgatctgg cgtatggaga tcgtggtgS 54S 

OG^GAACATA TCAAACCGCG TAGTACGCTC ATTTTl'ATAA TCGAATTArr GAGTATCAAC 600 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3 

ToS 

is Si^s ss^ 
iSE si 

40 GGCAAGA^ A^Ag5?gg 5^^^^^^^ GCTCTCAAGA CTAGTGTCAA GATTGAAGCC 
?ni^^^i« ACATAGTTGG TTATATTGCT AATACCGCAG GTGGACATCC TATCGGATGT 

^^?C ^l^^ l^r^l^." '^'^"^^^"^ CTATAT^S A«5^^ 
TATGTcS^TC gIS^S^^^*"^ CAAGCAGTTC CTACCTTAAA TTACAACATC 

— — — Tc5 T^^'-^ -° 



60 
] ?0 
180 
240 
300 
360 
420 
480 
540 
600 
660 



60 
120 
180 
240 
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GATGTTCTCC AGCATAGGAT GAAAGTGTAT CCCAACCCTG CCACGACGGA GCTGCATGTG 
GAAGCCCTGT CCGCATGGGT GGGCGAGCAG GCTGCGGTAT ATGATATGCG tS^^g?™ 
GTATCGGCTC GGACGGTGGA TAGCGAGAAG CTGTGCATCG ACATTGC^C Ic^GCcJg^G 
GGCGTCTATA TGCTGCGCAT CGGCAGCTAC TCGGCCAAGT TCGAG^SvG ^""^^^^^^^^ 

(2) INrORIIATIOtJ FOR SEQ ID NO: 5 

ill SEQtJEHCE CHARACTERISTICS: 

(A) LENGTH: 1686 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(li) HOLECULF. TYPE: DNA (genomic) 

(iii) H^'POTHETICAL: NO 

(iv) AtiTI-SEMSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROWONAS GINGIV7VLIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION 1.,.1686 

Uil SEQUENCE DESCRIPTION: SEQ ID NO:S 



300 
360 
420 
471 



ACAAGAAATG 
CTGCTGCTGG 
ATTGGTGGAG 
TTTTAITACA 
CGTATTTATT 
AAGTTAGGGT 
ATTGTAACGG 
TATGTAGGTG 
AATAATACAG 
ATCGCCTCCA 
GOrGTCGCCG 
GATGCTGGAG 
AATAGAGTAG 
CTAATGGGAG 
TCCAACTTTG 
GACTGTGGAT 
TTCGATGACA 
TACCCGGGCC 
ACAAAGAAAG 
GAAGGCGATC 
AAATACGATG 
GCTAAAGATA 
TGTTGGGTAT 
GTAATTATTA 
GACGTATTGC 
CTGATCAACC 
ATCGTGGAGA 
AAAGGTACGT 
GTAQAG 



TTTCAerCAT 
CGACTTTTGC 
ACGATGTATT 
AGATGGCTGT 
TTGATGATGT 
CTTTGGGGTC 
GAAAGGATGA 
GTACTATTGG 
AGTGTGTGTA 
ACTACCGTGC 
TTAGTGGCTC 
TACACTTTGA 
ACCTTTCATT 
TCGTATTCGA 
TGGACTATGA 
GGACTGATTT 
AITCGGATAA 
ATTACGTATA 
ATCTGGTCTT 
GTTATCTGAC 
ACATTAACTC 
AAAAGAGGCG 
GGCATACTCG 
AACATTTCCT 
AGAAGGAGGG 
TACCCAAAGA 
AAGTTTCATT 
ACATGCTGAA 



AAAAATGCCA 
CATAGTTTTA 
GATCGAGACG 
GGCAGACAAT 
CAGGCTGTAT 
TTTGGTGCCT 
AGATGATATC 
CAATGGCGTT 
CAAGAAGGAT 
GCCCTCTCCT 
CGGAAGCGAT 
GCAAAAGCGT 
AGGCAGTACA 
AATGAATAAG 
TCCCCCCTAT 
TAATCCTTTG 
TACCGTGGGT 
TCCGAAGCAA 
TAAACACTGT 
TACTTTTCAA 
TTTTTATGGT 
CCGTCCGCAA 
CAAGAGCCCA 
ATGGTCCGAT 
TAGCATGAAG 
AGGGGGGCAC 
TTCAGGGAAA 
AGTTGTAGCG 



AGAATTATGA 
TTTAGCCCTC 
ATGTCCACCC 
GGATGGATCT 
CGTTCCAAAG 
TATGACTTCG 
AATGTTTGGA 
TTGCTGATGC 
TTCCCTAATA 
TACGGITTGG 
CACAGCTTCT 
ATTTACACAA 
TCTCCTTCTC 
AACCl'TGATG 
GCGTGGTCTG 
GGAGCACTAA 
GGAGAACGCT 
TCTTTCAATT 
ATAGGTATTC 
GATCACAATC 
TGGAGTTCGC 
GTAGCACTCA 
TATGATGAAA 
ACGGAGTGGG 
CTCTACCCCA 
GAGGCACTCG 
GAATATAAGC 
GATACGGAGT 



AATTAAAAAT 
TGGCCAAGGC 
TATCAGGATA 
ATGTGATGl'T 
ACGGTGGTGC 
ATGTCTCGCA 
CAGTCATGAC 
ATCGCCATGA 
ATAGACTGAT 
GGGGC6ATCC 
TGGACTATAT 
GACCCCAAAA 
TTGGATTTAA 
GCTTCGACAT 
AACCGATAAT 
GTATAGAGAT 
CCCATAACTT 
ATTCTCCCGG 
CGGCTTTGGC 
TAATGAGATA 
CATATGTATA 
ATCCTACCAA 
CCAAACCACA 
TACATGCTCT 
ATCCTGCCAA 
TATACGACAT 
TGAATGTGCA 
ATTTCGTGGA 



TGCACTCAGA 
CCAGATGGAT 
TTCAGAGGAT 
GGATTTCTCT 
TACTTACCAA 
TTGCGATTTT 
AGCATTCGAA 
TGCAGATATC 
GGGTGTAGCC 
TTTTGCTCTC 
TTTTTCGTTA 
ACTGACTATC 
TACTTGGCCA 
TGGTTTCATT 
AATAGAAGAA 
CCAAATGATG 
CCTGATCACl' 
ACATACACCG 
ATACGATAAG 
CAGATGGATC 
TGCAAAAGAA 
TGGAAAGGCT 
TCCTACTCCT 
GGACGTGGGG 
AGAATATGTT 
GCAGGGCCGA 
GTATCTGTCC 
AAAAATCATT 



(2) INFORIIATTON FOR SEQ ID NO; 6 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 
to STRANDEDNESS: double 
(D| TOPOLOGY: circular 

(ill MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(Iv) Af^TI -SENSE: UNKNOWN 

(vi) ORIGINAL SOURCE: 



60 
120 
IBO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1686 
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(A) ORGAWISM: poryphyromonajs gingivalis 

(ix) FEATURE: 

(A] NA/IE/KEY: mi sc feature 

5 (B) LocATiorr i . . , 1T73 

(r.i) SEQUENCE DESCRIPl'IW: SEQ ID HO; 6 

CAAATGAAAC GATTACTCCC CTTTCTCCTT TTAGCAGGAC TCGTAGCCGT AGGAAACGTG 60 

10 TCTGCTCA'ST CACCCCGAAT CCCTCAAGTG GATGTACACA CTCGCATCGC AAGAAATGCC 120 

CGTTATCGAC TGGACAAGAT CAGTGTCCCG GATTCTCCTC AGATATTCGA TTACTTCTAT 180 

AAAGAAGAAA CGATACCCAC TAAAATACAA ACGACCACAG GAGGTGCAAT TACAAGCATC 24 0 

GATTCGCTTT TCTATGAAGA CGACACGTTG GTTCAGGTGC GCTATTTTGA CAATAACCTT 300 

GAATTAAAAC AAGCGGAGAA GTATGTATAC GACGGTTCTA AGCTGGTCCT TCGAGAAATT 360 

15 CGCAAGTCGC CGACAGACGA AACGCCAATA AAGAAAGTTA GCTATCACTA TCTCTGTGGC 4 20 

AGCGATATGC CTTTTGAGAT TACGACAGAG ATGAGCGATG GCTATTTTGA AAGCCATACG 480 

CTTAACl-ATC TGAATGGAAA GATTGCCCGA ATAGATATCA TGACTCAACA GAACCCATCG 540 

GCCGAATTGA TCGAAACGGG TAGAATGGTA TATGAGTTTG ATGCCAATAA TGATGCTGTA 600 

CTGCTTCGTG ACAGTGTATT TCTTCCTCTT CAAAACAAGT GGGTAGAAAT GTTTACTCAC 660 

20 CGTTATACAT ACGACAATAA GCATAATTGT ATTCGTTGGG AACAAGACGA ATTCGGCACC 720 

CTCACCCTTG CCAACAACTT CGAATACGAC ACCflCTATCC CTCTGTCGTC TGTATTGTTC 780 

CCCACGCATG AGGAGTTCTT CCGTCCTCTT CTTCCCAATT TTATGAAGCA TATGCGTACG 84 0 

AAGCAAACGT ATTTCAATAA CTCCGGAGAA GGCTTGTCAG AGGTATGCGA TTACAACTAC 900 

TTCTATACCG ATATGCAGGG TAATGCACTG ACCGATC?1TG CCGTGAACGA ATCGATCAAG 9 GO 

25 ATTl'ATCCTC GTCCTGCCAC GGATl'TTCTG CGTATAGAAG GTTCGCAACT GCTTCGCCTT X020 

TCGCTATTCG ACATGAACGG GAAGCTCATC AGAGCTACCG AATTGACACG CGATTrGGCC 1060 

ATTATCGGAG TTGCATCTCT TCCGAGAGGC ACTTACATCG CAGAAATAAC TGCTGCAAAC 1140 

AGCAAAACCA TACGTGCAAA AGTATCGCTC AGA 1173 



30 



40 



(2) IMFORllATIor-) FOR SEQ ID NO: 7 



ti) SEQUEMCE CHARACTERISTICS: 

(A) LBWGTH: 12 B4 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRAHDEDHBSS : double 
tD) TOPOLOGY: circular 



(iij MOLECULE TYPt: DMA (genomic) 

(iii) HYPOTHETICAL: HO 

{ivj ANTI-SENSE: NO 

45 {vij ORIGINAL SOURCE: 

(A» ORGANISM: PORYPHYROMOWAS GIHGIVALTS 

(iKi FEATURE: 

(A) HAI-ir./KEV: misc_featuce 
50 (8} LOCATIOU l...i204 

(XU SEQUENCE DESCRIPTION: SEQ ID NO:7 

AAGAGGAATC CCTTACCTTT GACGGCATCA AATCGAAAGA TTTTTATAAA AATGAGACAG 60 

55 CATTTATCTC TATTTCCTTT TATCTTCrrr CTGCTTCTTG CCTTCTCTTA TGTCGGTTGC 120 

AGAACAGTCC GACAAACACC TAAGCAGTCG GAACGGTACG TCGTAGTCCT GTCTTTGGAC 180 

GGCTTCCGAC CGGACTATAC CGATCGGGCA CGTACACCGG CGTTGGATCG GATGGCACAG 24 0 

GAGGGATTGA GCGGGTCGCT CCAACCATGC TTCCCCTCGC TTACATTTCC CAATCATTAC 300 

AGC7VTGGCTA CGGGGCTTTA CCCCGATCAT CACGGTATCG TAGCCAATGA GTTTGTGGAT 360 

60 TCGCTACTGG GCATCTTTCG TATATCCGAC CGAAAAGCCG TGGAGACCCC CGGATTTTGG 4 20 

GGCGGCGAGC CGfJT'lTGGAA TACGGCCGCA CGCCAAGGCA TCCGTACCGG TGTCTACTTT 480 

TGGGTAGGAT CCGAAACGGC TGTGAACGGA A.'^TCGGCCGT GGCGGTGGAA AAAATTCTCC 54 0 

TCCACCGTTC CGTTTCGTGA CCCTGCCGAC TCCGTCATCG CGTGGCTCGG ACTGCCCGAA 600 

AAGGAGCGAC CGCGCTTGCT CATGTGCTAC ATCGAGGAGC CGGATATGAT CGGACACAGC 660 

65 CAAACGCCCG AAAGCCCGCT GACACTGGCA ATGGTAGAGC GGTTGGACAG TGTGGTCGGC 720 

TATTICCGCA AGCGGTTGGA CTCTCTGCCC ATAGCCGrAC AGACCGACTT CATCATAGTA 7 8C 

TCCGATCACG GTATGGCCAC GTACGAAAAT GAGAAATGTG TCAATCTGTC GCATTATCTG 84 0 

CCTGCGGACA GTrTCCTCTA CATGGCCACC GGGGCCTTCA CCCACTTGTA CCLGAAGCCC 900 

TCCTATACCG AGCGAGCCTA TGAGATCCTG CGGGCCAT'J'C CACATATATC GGTTTACCGC 960 

70 AAGG-3GGAGG TGCCCAAGCG TTTGCGCTGT GGCACCAATC CTCGTrrGGG CGAACTGGTC 1020 

GTGATTCCGG ACATAGGCTC CACCGTCITT TTCGCAATAA ATGAAGACGT TCGTCCGGGA 1080 

GCGGCACATG GCTATGACAA CCAAGCACCG GAAATGCGGG CTTTACTCCG GGCTGTCGGA 114 0 

CCCGATTTCC 6TCCGGGCAG TAGGGTGGAA AACCTGCCGA ATATCACCAT CTATCCGCTC 1200 

ATATGCAGGC TGTTGGGTAT AGAGCCTGCA CCCAACGATG CGGACGAAAC GTTGCTGAAC 1260 

75 GGCCTGATCC GAGACAAACG ACCA 1284 
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(2) IHFX5RHATI0N FOR SZQ ID NO: 8 

(1) SEQUENCt: CHARACTERISTICS- 

(A) LENGTH: 946 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIOEDHESS: doubl- 
(DJ TOPOLOGY: circular 

tii3 MOLECULE TYPE: DNA (genomic) 

(ill J HYPOTHETICAL: NO 

(iv) AHTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRCWONAS GINCIVALTS 
FEATURE: 

(A) NAME/ KEY: mi »c featur- 

(B) LOCATIOJ>J 1. . .sTe 



25 SEQUEI^CE DESCRIPTION: SEQ ID NO: 8 

TTGTTCATCC CCTGTTATGT CAATGCAG-^^ ^A^cJ^A^r ^^J^^'' ^^^^-AGGT 120 

CTGCTGAAGA GTTTGGACAT AGATGTcS^ Jac^^I ISrfJ^^?'''^ CACGTACAAA 180 

30 CCTATGCCCA ATGCCGGATT CGAACAGAAA gS^S^I^^ J^JSiS^^^ TTGCGGCCAG 240 

CTGTTCGAGT CGTATGATGT AGTC^aS^ ^S^of ATTCGAAGAG 300 

GAAAACTATG ATCATATCCT CaSccgXS S^Ar^^SS ^^TCCGTTGC TTTCGTGAAA 360 

CGGGATATAT GCGAOTTCTT GCACgS?^ ??ACATGTCT GCAAGTCGGC AGCCAAGCTT 4 20 

GCCCATAAGG TGAgS^^ cSc^^^^c SSt^ ^^IfS!^^ CTCCCGATTC VhO 

35 CCCAGTGAAG TGCACCGACC GTACCACAAC S^^SS? SSSJ^S!*^ TCTGTCCACC 54 0 

GGCATAGAGG TATTCGAGCC GAAGCGAMA ^^I^^ ^^^"^ GATGGTGCAG 600 

TCGGTGGAGG AGCCGGAGGT ATCCACCTG? ^Trr^l^ GCGGTTTCGG CGGTATGTAC 660 

TCCACAGGTG CGGAGTACAT CAC^^g ^r^^^J^ ACAAGGTGCT GGATCACATA 720 
GTGATAGACA GAGAGAAATT ScSX^^r ^^^^^A^GCA TATGCAGGGA 
40 AACTTA ^MMMTT GCCGATCAAG ACAATTCATG CAGTAGAAAT TTTAGCAGCA 



(2) IHPORHATION FOR SEQ ID NO: 9 

ID SEQUENCE CHARACTERISTICS- 

(A) LENGTH: -753 bare pair* 
(BJ TYPE: nucleic acid 

(C) STRANDEDNESS: doubl- 

(D) TOPOLOGY: circular 

ili) MOLECULE TYPE: DNA (genomic) 
Uii) HYPOTHETICAL: NO 
55 (Iv) Aim -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISTS: PORYPHYRa-lONAS GINGIVALIS 
(ix) FEATURE: 



60 

65 '""^^ SEQUENCE DESCRIPTION: SEQ ID NO 



(A) NATvfE/KEY: misc feature 

(B) L0CATI014 1. . .753 



60 



720 
780 
840 
846 



S^^c ^^^^ «0 

TCCGCCGGAG ACTTCGTCTA TCT^^^?^ ^^J^^IT^.? AflAATTTGAA TCTGACCCTT 120 

AAGGCTTTGT ATCCTGAGGT GCCTATCTCT ^r^^l CGGGGAAGAG CACTTTGCTG 180 

70 CTG«CAAAGT TGAAACGGAA gSgTT^-^ ^^^^1 ^^""'^ ^GATTATGAT 2^0 

CRGGATTTCC AGTTGCTGAA CGGACGTa?? ctJS^aS ^J^J'"^ CATTGTGTTT 300 

GCTACGGACT GGAAAAACCCi AGCCGATCg? ^J^f^S^ ^^"GOATIT CGTTTTGCGA 360 

gtgggaatgt ctcggaaggc ??A'r^M§A l^"'^ tttgacccgt 420 

CGTGTGGGTA TAGCCAGAGC TTTGCTTCCG Sr'^^S^S TGTCCGGAGG GGAGCAACAA 4 80 

75 ACAGGCAACC TCGA„CGG. 0ACC^^§ ---^ --.^ACCC S40 
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GATGCTGTTT CAAGAAAAAA ^^^G^?^ CTTTGGTCGA GCTGAGTGCA 720 

5 

(2) INFORIIATTOH FOR SEQ ID NO: 10 

(I) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 714 base pairt- 

(B) TYPE: nucleic acid 
tC) STRAIIDEDTJESS: double 
JD) TOPOLOGY: circular 

(11) MOLECULE TYPE: DUA (genome) 

(iii) HYPOTHETICAL: NO 

fiv) AUTI-SENSE: tiO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGAJIISH: PORYPHYROIOMAS GINGIVALIS 
(ix) FEATURE: 
oc NAIIE/ICEY: mlsc feature 

i^) LOCATIOM 1...7r4 

Ui) SEQUENCE DESCRIPTIO^J: SEQ ID HO: 10 



30 ^^^^11 a'S^S.Z J^????^ IS^^?'^ TAAAAACAAO TAAAACAATG 
OATGGCGTAT COCtS^^: ^^^^ ^^^^ ^1^^^^ ^S^SS^f 

TGCGGTAAGT CCACTCTGCT CAATATrr^r- nr-r^r-.-t^ii^ TX? ^ GCCTTCGGGA 180 

TACAAGCTCC; ATGGGgSgA ^G^^C CTcSIflS ^f^^f^'^^ TTCCGGTATC 2^0 

AAGGGCAATA TCGGCTTCGT ATTCCAGAGC AAGACAQGAC TGCCGTCCGT 300 

35 GAGAACGTGG ACT^GCCG^T CCT^^^O S^?fl^ TCGAAGAGAT GACGGTAAGC 360 

GTGGAGGAGG CACK^l ^^P^TC ^C^^G C^^SS °'^°^«« «0 

CTCTCCGGAG GACAACAGf^ r-r-r-^X-i™ AQCCACCGGG CCGGCCACTT CCCCAATCAG 480 

CTCAtS?S^ ^S^S CTC^S:^ CCGTGGTGGC CAATCCGAAG 540 

GAACTGCTCA GAGCTC?^^ 5^^^^ S^^^CG 5???^^??^^ -^^ATGTCATG 600 
40 CACGATGCAC GTAGTGCCGG CCG^^^^^ ^^ScG aSS^ r^^'^""" 

(2) IHFORMATIOI4 FOR SEQ ID HO: 11 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1812 baso pairs 
(D) TYPE: nucleic acid 
(C) STRAHDEWESS: double 
5Q (D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DNA fgenomici 

(iii) HYPOTHETICAL: NO 

(iv) A/>IT1-SENSE: NO 

{ vi ) ORIGINAL SOURCE : 

(A) ORGAJUSM: PORYPHYROMONAS GINGIVALIS 
60 (ix) FEATURE: 

(A) NAME/KEY: mlsc feature 

(B) LOCATION 1 . . . ifi;* 

55 f^i) SEQUENCE DESCRIPTION: SEQ ID N0:11 

AGCACAGAAA CAAATAGCAA ATCTGAAATG AAAGAATtt^t Tr-nA«^-^^ - 

ATCCTCGGGG TTATAACGGC AGGAATCATC ^^^li .^S^^I! ^TTCGCCTCG 60 
GGCATCGTAG CCGGTATTGC CTCCAAGGCA I^r^^^ln i^^^^TT TATCTTTTTC 120 
AACTCCATCC TACATATANA cL^^t^ J^C^^f^ S^!!!'''^^ GATCGAAGCA 180 

sssj sss^ ™i E^ii s= S3= 

GACAAACTCT ACCTCAATCC ^GGAATG ACTACCTCTC CAGTATTGCA 

75 ATGTTCACA AAGAXGC^^ C^^^^ JS^^ ^^^^^^ 



300 
360 
420 
480 
540 
600 
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10 



15 



20 



ACCTACAAGG 

CAAATCACCA 

CGCAAGACGG 

GCCGAGAAAG 

AAAGAACTCA 

TCrCAGGTTC 

TTTGCCGAAG 

TCCATCACAC 

GCCGTAGTAC 

AAGCAGGTAG 

GCCTCGGGCG 

CTGACCGGCT 

ATAGGAGTGA 

GCTCCGATGA 

CTCTTCCTCA 

GCTCAAGGCC 

GGAGGTTTGG 

AGCATAGAGT 

GCGGATATGA 

CGCGAACTCC 

TTCATGCCGT 



CAGCCGTAGA 
CATACATAAA 
CAATGGATTC 
CGGTGGAGAT 
AA/\AGATGTC 
TGGCCAATGG 
GTGAAATAAC 
AAGAACTCGC 
TTCGTGTCAA 
CCGATCTCAA 
GATACTACAT 
CCATCGGCAT 
ATATGGACG-r 
CGGTCGAAGA 
CTCGCGTATC 
GTGTATGGCT 
ACACAGCTAT 
ATGGCAAGAC 
AGTCTGCCAT 
GCTCCATGCC 
AC 



GCCATTCy\TG 
CGGGCTTTGG 
CGTGAAAATG 
GAAGCTCGTG 
CCAACGCGGA 
CCCGATGAAC 
GGAAGAAATA 
CAAAGAAATC 
TTCTCCGGGA 
GGCCAAAAAG 
AGCCTGCGCA 
ATTCGGCATG 
CGTACAGACA 
TCGTGCCCTC 
GGAAGGCCGC 
CGGCGACAAA 
CAAACGGGCC 
CAAGCGCAAC 
CCTGAGTACC 
GCCCCGTCCT 



CTCAACAGGA 
GACAAGATCA 
TTTGCCGACA 
GATGAGCTGG 
GAGAAAGATG 
AAAACGAAAG 
ATAAAGAAGC 
AAGGCAGCAG 
GGTAGTGCTT 
CCTATCGTGG 
GCCAACAGTA 
TTCCCGAACT 
TCCAAGTATG 
ATCCAACGCT 
AACCGCACCA 
GCTCTTGCAC 
GCGAAGCTGG 
TTCTTCGAAG 
ATl'd'CTCCG 
TCXGGCATAC 



TGAGCGATGC 
CATCCGATAT 
AAGGCGAAAT 
CTTACCGTAC 
AACTTCGGTT 
GCAGTCGGAT 
CGTTCGACAC 
CCGATGACGA 
TCACTTOCGA 
TCTCCATGGG 
TCGTGGCAGA 
TCGCGGGCGT 
CAGACTTGGG 
ACATAGAGCA 
AGGCACAGAT 
TCGGTTTGGl' 
CTCAGCTCGG 
AGTTGCTCTC 
ATCCGGAAAT 
AGGCACGTCT 



CAATCGCGAA 

TGCAGAGTCG 

GTTCGGTCTT 

CGATGTGGAG 

CGTATCGCTT 

CGCCGTTCTC 

TGACGGTAGC 

TGATATCAAA 

ACAGATATGG 

aSACGTAGCA 

GCATACGACT 

AGCCAAGAAG 

CAACACCTTC 

GGGCTACGAC 

CGACAGCATC 

GGAT6AGCTT 

TGGCAACTAC 

CTCATCAGCA 

AGAAGTTCTG 

CCCCTATTAC 



660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1012 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) rNFOWIATlOH FOR SEQ ID HO: 12 

(i) SEQUENCE CHARACTERISTICS: 

<A} LENGTH: 972 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

111) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
Uv) ANTI-SENSE: NO 

(vX) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIV?iLIS 
( Ix ) FEATURE : 

(A) NAME/KEY; misc feature 

( 3 ) LOCATION 1 . . . 97 2 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:12 



CTAACGTTGT 
TCCGTTTCGG 
ACATACGAGG 
AATATACCGG 
ACACTGGCCA 
AAGCGCACCT 
CGCGAATCGT 
AAACTCGACA 
ACCAATCGGG 
TTGGATCGCG 
TCCCATCAAA 
GGACTGCTCT 
GACATGAGAG 
AAGGGCGATG 
CACGTAGTGC 
AAGAACCTCT 
CTGCGCTTGC 



TTTGTTGCAA 
TTCTCTTTTT 
CTTATGTGAA 
CAAGCATCAC 
GCGTACACAA 
ATCGTACCGA 
ATGAAGATCA 
GAGAAGACTA 
GCTATGCCAA 
AGAAGTACCC 
CGACCAAGCA 
ACATCATAGC 
CCTCCAAACT 
TGATCTATCT 
GTGTGGGCGA 
ACAAGCTCAA 
GC 



CTATTTCAAA 
CTTCGGGACA 
ACAGTACGCC 
CATAGCACAG 
CAATCACTTC 
CGATGCGCCG 
TTCCCGATTT 
TCGGGGCTGG 
TCTGCTGATC 
CTCATGGTTC 
GAAGCAGAGC 
CAAGCAAGGC 
GGCCAAATAC 
GGAGAAAAAG 
TTCGATGCAC 
CGACAAGGAT 



CAGATGAGAG 
GCGATCGGAC 
GACGAAGCTA 
GCTTTGGTGG 
GGGATCAAAT 
AACGAATGCT 
CTGCTCCAAC 
GCTACGGGGT 
AAGATGGTGG 
CACAAGTCTT 
GGACTCAAGC 
GATACCTTCG 
AACGATGCTC 
CACGCATGCT 
AGTATCTCCC 
CGCGAATATA 



CAAACATTTG 
AGGCTCAGAG 
TCCGACAGAT 
AGACAGGAGC 
GCCACAAATC 
TCCGCAGCTA 
CACGCTATCG 
TGCAACGCTG 
AGCTGTATGA 
ACCCCGGGTC 
ACGAAGCTTA 
ATTCTTTGGC 
CCGTGGATTT 
CCATCTCCAA 
AACGCTATGG 
TACCCCAAGA 



GCAGATACTT 
TCGAAACCGT 
GAGCCGCTAC 
CGGAGCCAGT 
GTGGACGGGC 
TTCGGCCGCT 
TCCCCTGTTC 
TGGCTATGCC 
GCTATATGCT 
CAACAAAAAA 
CTTCAGCTAC 
CGAAGAGTTC 
CCCGATCGAA 
ACACACACAG 
CATCCGGATG 
GGGCGATATA 



(2) INFORi-IATION FOR SEQ ID NO: 13 

li) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 1599 base pairs 
(B) TYPE: nucleic acid 
(CJ STRAt'lDEDHESS: double 
(Dl TOPOLOGY: circular 

(11) MOLECULE TYPE: DHA (genomic) 



60 
120 
ISO 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
700 
840 
900 
960 
972 
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ttii) HYPOTHETICAL: MO 



tivl AI^TI-SEHSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGAt'IISM: PORY PilVROMOMAS GINGIVALIS 



10 



(Ix} FEATURE: 

(AJ TJAHE/KEY: mlsc feature 
(B) LOCATION 1. . .1599 



ixi) SEQUENCE DESCRIPTION: SF.Q ID HO: 13 



15 



20 



25 



30 



35 



40 



AGGATCCCCG 
CATCAGGCTA 
ACTATCACTT 
GCTGAAACTG 
ACAAACAAGC 
AAATATGCAG 
GTCGGTACTA 
ATCCAACACA 
GCCGGTCAGA 

AAGCTcrerG 

GAATCCAGAC 
TCGTCCATAT 
Ari'GCATCAT 
CTGAGTAAGT 
AAACATCTGC 
CGCCGTCATG 
GTATGTAAGG 
CATCTGAT'JC 
AAAGCAGAAG 
ATGGATCTGA 
GCCGTGTACG 
TGCAATTTGC 
GCTATCGTAC 
CTTCGTAATT 
CTCGAATCTT 
ATCCTAATGC 
AGGCGGCAAA 



ACGAGCAGAC 
TCGAAGCCAA 
TGCAGAACTA 
AAGCGGGAGA 
CTATCGCCCG 
CAGTTATCGA 
CTTCGGTGGA 
ATGTACTCAA 
AAGGAACTGT 
CCGAGGTTAA 
GAGTGGACAG 
TCTAT6TTTC 
TGATCGATCG 
CCGTGGAGCG 
TTGAGTACGA 
CTTTGATGGG 
CTCTQATTGA 
GTGCACTCGC 
AGCTGACCGA 
TCGCAGAAGT 
AGCGCATTCT 
GTGAAGCGGA 
TGCATACTAT 
CCGTTCAGAA 
ACGAACTGTT 
GTGCTCGGAT 
TAGAAATCCG 



CGGACGTATC 
AGAGCATGTG 
TTTCCGCATG 
GCTTTGGGAC 
TAAGGATATG 
AGAGATTGTA 
AATATCCGAA 
TGCCAAATTG 
TACCA'i-CGCA 
GAAAGCCGGG 
ACAGCTTCGT 
CCTTGAAGAT 
TTTAGGTTTC 
TGCTCAAAAG 
TGATGTAATG 
AGAGCGTATC 
CAATTATGCA 
GATAGAATCT 
TATGCTTTTC 
GGCCCACCCT 
AATCCCCATT 
TGAAACTCAA 
CGATGAGTCT 
TGCCAGCTAC 
CCGCAAGATG 
ACCGGTACCG 
ACATGCAACC 



ATGGACGGAC 
AAAGTAGAGG 
TATCATAAGC 
ATCTACAAAC 
AATGATCGTA 
CGTCTTGTCG 
TTGTl'GAGCC 
CATCAGAAGG 
ACGAACATGG 
GGTTTGGCTA 
GGTCGITCCG 
CATCTGATGC 
AAGGAAGGAG 
AAGGTGGAAG 
AATTCGCAGC 
GGTATGGATG 
GAAGCCAATG 
CCl'ATCACGC 
GATGAAGCTT 
GTGGTTCATC 
ACGGATGCTA 
GGGAAAAGCA 
TGGAAAGA.i\C 
GAAAACAAAG 
GTAGAAGCCA 
GAGGCTCCTT 
CAACAACGT 



GTCGATATTC 
CTGCGACACA 
TGGCAGGGAT 
TGGACGTTGT 
TCTATAAGAC 
AAGAGGGCAG 
GTATGTTACG 
AGGCCGAGAT 
CCGGTCGTGG 
TCATTGGTAC 
GCCGTCAGGG 
GCCTCTTTGC 
AAGTGCTCGA 
AGAACAACTT 
GTGAAGTCAT 
TACTCAATAC 
ATTTCGAAGG 
AAGAAATATT 
ACy\AGTCTTT 
AGGTATTCGA 
AACGTGTCTA 
TCATCAAAGA 
ATCTGCGTGA 
ATCCACTACT 
TGAACCGTAA 
CCCAAGAAGA 



GGATGGCCTC 
GACATTTGCA 
GACCGGTACT 
AGTTATTCCG 
GGCACGTGAA 
ACCTGTACTT 
CTTGCGTGGC 
TGTAGCTCAG 
TACCGACATC 
GGAAAGGCAC 
TGATCCCGGT 
CACAGAAAAG 
AAACAACATG 
CGGTATCCGT 
TTATACCCGT 
CATATACGAC 
CTTCAAGGAA 
CAGAGI5TAAG 
CCAACGTAAG 
GACCCAAGCC 
TAACATAGGA 
ATTTGAGAAA 
GATGGACGAG 
TATCTATAAA 
GACCGTAGCG 
GCTGGAACAC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 

Q^o 

900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1599 



45 



50 



55 



(21 IWFORHATTOH FOR SEQ LD WO: 14 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2160 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtJDEDNF.SS : double 
(U) TOPOLOGY: circular 

Hi) MOLECULE TYPE: DNA (genomic) 

(tii) HYPOTHETICAL: NO 

(iv) AHTI-SEWSE: NO 



60 



(vif ORIGINAL SOURCE: 

(A) ORGAh/ISM: PORYPHYROMONAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAi'IE/KEY: mlsc_Ceature 

(B) LOCATION 1. . .2160 



65 



70 



75 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 14 



AAATCTTGCC 
CCGTGCCATC 
GCTTTC6CGC 
GCTCTACAAG 
AACCTGACTG 
AGGCGCGACC 
CTGATTGCCG 
ATAGGCTATC 
GAGCAGGCTT 
ATGGCCGGAC 



GTGTTATTGG 
AAAGCCTATC 
GGAAAGGCCA 
GCGTACAACA 
TCGATTTCGG 
GTATCGCTCT 
GCATGGAAGA 
TGCCGCAGGT 
TCAACGATAT 
GTACGGACTA 



CCAGACGAGG 
CGGAGTTTGC 
AAGAAGGCAT 
TCATCAAACG 
CACCCGTCTG 
TGTAGGGAAG 
ACCGACATCC 
GATGCGTTTG 
TCGCCAAATA 
CGAATCGGAT 



CGATACGGAT 
CGATGCCTAT 
CGCCGATCTG 
AATGAGCACG 
CTCTTCGATC 
AACGGTGCCG 
GGACACATAG 
CAGGACGGAC 
GAGGAAGAGA 
GACTATATCC 



GCTGCCCTGT 
'ITCAATCGCG 
AGTCGGGCAG 
AAGTCATGAT 
AGGTATCATT 
GCAA6AGTAC 
CACGCCCCAA 
ACACGGTTTA 
TACGGCGTCT 
GACTGATAGA 



CGGACTACGA 
GCCTGCTGTT 
GCGAATACGG 
CTCCGTCAAT 
CGTCATCAAC 
GCTGCTCAAG 
GGGGATCCGC 
CGAAGAGGTC 
GTCCGATGAA 
GCATTATACG 



60 
120 
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240 
300 
360 
420 
480 
540 
600 
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AATATGAGCG 
ACATTGATCG 
GGAGGATGGC 
CTGCTCGACG 
ATCGCCACCA 
ACCACGACAC 
AGCCACTTATG 
CAGCAGAAGA 
AAGTCCGTAC 
GACGAGCGGG 
TATCCGCTAA 
GCTACATACA 
AAAAGTACCA 
CTGGGGCACA 
GATCTCACGG 
AACGATTTGC 
CTGAGTGGAG 
TTCCrrATTC 
GAGfiCGATCA 
GATGGGCTTG 
GGTATATACG 
ACTACGATCG 
TACCGTCGGC 
TGCGAGGAGC 
GATCCGAAAC 
CTCGAAAAGG 



AGACCCTCTC 
GTCTGGGCTT 
GTATGCGGAT 
AGCCGACCAA 
ATGCAGGAGC 
GCACTATCGA 
TGGAGCTGCG 
TGATCCGCGA 
AGGTACAGAG 
ATCGTTCGGC 
TAGTGGATGA 
CCATCGAAAG 
TGGTCAAGTG 
ACGTGCAGCT 
TATTCGACAC 
TCGGGGCTTT 
GAGAACGAOC 
TCGATGAGCC 
AGAACTTCGA 
TCAGCAAGGT 
ACTATCTCCG 
AAACAAAAAC 
AAAAGGAGGT 
GGATCGGAAA 
ATGCGACTGA 
CCATGGAGGA 



TCTCATGCAG 
CGGCCGAGAG 
AGAGCTGGCC 
TCACCTCGAC 
CGTTATCCTG 
AATAGAACTG 
CGAAGAGCGG 
TACGGAGGAC 
CCGGATCAAA 
ATTTCACTTC 
TTTGGCCAAG 
AGGCQAAAAG 
TATCATGGGA 
GGGCTACTTT 
GATAGACCGT 
TCTCTTCGGG 
ACGATTGGCr 
GACCAATCAC 
TGGGACTGTC 
GTATGAATTT 
GACCCGCCGT 
CACACGGGAG 
AGCCAAACAG 
ATTGGAGTCG 
CGCGAATCTG 
CTGGGAACAG 



CAGGGCAACT 
GACTTCCACC 
AAACTTCTGC 
ATCGAATCCA 
GTGTCGCACG 
GGACATATA]* 
CTGCGACAGC 
TTCATCGAAC 
CAGTTGGAGA 
CGCTTTATCC 
GCTTATGGCG 
GTGGCTTTCG 
GAGCTGACAG 
GCCCAAAACG 
GAGGCCGTGG 
GGCGAAGCAT 
ATTATCAGGC 
CTCGATATGC 
ATCGTAGTAT 
GCAGATGGAC 
ATGCAGACGC 
GCTATACCTG 
CTGCGCACGT 
GAATTACAGG 
TTCGAGCGAT 
GCTTCCGAAG 



ATCATGCTGC 
GCCCCACAGC 
TCCAACGCCC 
TCGGCTGGCT 
ACAGGGCATT 
ACGACTACAA 
AGATGCGTGC 
GATTCAGATA 
AAGTAGAGCG 
CGGCACAGCC 
ATCACCAGGT 
TAGGCAAAAA 
ACTACACCGG 
AAGCCCAAGA 
GCGACATCCG 
CGGAAAAGAA 
TTTTGCTACA 
GCTCGAAGGA 
CTCACGACCG 
AGGTGAACGA 
TGACAGAGCT 
AAACGGAAGC 
TGGAGCGAAC 
CAATAGAGAT 
ACGCCGGCAT 
CTTTATCCGA 



GATCX3AACAG 
CGATTTCAGC 
CGAAGTTTTG 
GGAGCAGTTC 
CATCGACAAT 
GACCAACTAC 
CTACGAGAAT 
CAAGGCCACG 
CGTGGAGATA 
TTCCGGCAGT 
GTTTTCCGGA 
CGGTGCCGGC 
CAAGCTCGAA 
GCTAAGAGGG 
TCTGC3CCTG 
AGTAAGTGTC 
GCCGGCTAAC 
TGTACTGAAA 
TGAGTTCCTC 
ACACCTCGGA 
GGAGCGAACC 
CAAAGCGGAC 
CGTAGCAACC 
GCTACTGCAA 
GAAACAAGAA 
AGCCrCAAGGA 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(2) INFORMATION EX>R SEQ ID NO: 15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPF: nucleic acid 
tC) STRANDEWESS : double 
(DJ TOPOLOGY: circular 

(ix) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PH YROMONAS GINGIVALIS 

( ix ) FEATURE : 

(A) NAUE/KEY: mlsc_£eatur« 

(B) LOCATION 1...11SB 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15 



TATTTGAAGC 

AAAATATTCT 

CGCGCCGTGA 

AGCGTTTGTA 

CATCTCGGAA 

TATCGAAGGA 

ATTTTTATCA 

TGCGAACATA 

TATTCAGGGC 

ATCTTACGTG 

CAAAGAGATG 

GGTTTTGAAT 

CTTATCATAG 

AATATGGAGT 

TCATCTGCTA 

GCAGAAGCAT 

ATCGAAAACG 

AAGTTACGCT 

TTGATGGTCA 

GAAATCGGCT 



TAITAATACT 
CTATGATTTA 
TCAACTTGGC 
CAAAAGAAGG 
TTAGGCTTTA 
TAAAAGCCCT 
ATACAATTTT 
TCTCTTATGA 
•ITGATGCCGT 
GACGCTCl'AA 
CTACTACTCA 
TCATGATAGA 
TCGGAGATGG 
CGCAAATAGA 
TTTATCTAAT 
ATGCACTACC 
GTCGCAATGG 
TATTGATGGA 
AATCCTACTC 
ACATGAAT 



ACAAATAGCC 

TCTGCTGTTA 

TAACAACCTG 

AGAGCCTTCC 

TGGCAATGCA 

ATACAAGAAG 

GTCTCAGATC 

TATTGCCCGC 

TGTAGCACTG 

AGCATATGTC 

CAAACAAATG 

ACATGCATCA 

CGAAAATGAA 

AATACATCCA 

GACGTCCCGT 

TATAATCTCA 

TTTCCTTGTG 

TGATGAAACT 

TCCGGCAAAT 



TTGATGAATT 
GATACAATAA 
CATGCCAATG 
TTCCAAGTAG 
TTAGCCCGCA 
CCTGAACCGG 
AGTAACAGAG 
CCTATTACAA 
ACAAAAAGAG 
ATACCCAATC 
TTGGCGATTG 
CGAGTGCTGC 
rCGATGCTAC 
TCTACACCGG 
TTCGAAGGAC 
TACGATTCTC 
CCAATGGAAG 
CT-TCGTAAGA 
ATCTATGAAT 



TCTTAAAAAA 
CAAACCGTGC 
GTCATCGCGT 
AAAAAGGAAT 
AAACAGTATA 
TTTTATTGAT 
GCAGAATATT 
AACGCAT7»AG 
ATCAGCAATC 
AAGTTTCATT 
GCAGGCTTAC 
GAGAAAGGCC 
GTAAAGAAAT 
AAATTCGCAA 
TACCAATGGT 
CGACCGGCCC 
CACATGAAGA 
AAATGGGACA 
GTTGGAAGAA 



AGAACCGTTT 
CGGTACAGAA 
ATCATTAGTC 
AGAAGTACAC 
TTTCAAGGCT 
AGGGACTAAT 
TACGATCGGA 
GGGGTTTCTG 
GTTCGAGGCA 
TACTACAGTC 
CfACCAGAAG 
TGATTGGAAG 
TGCATCTCGC 
ATACTACGAA 
ACTTCTCGAA 
GAGGGAACTG 
Cl-TCGCGGAT 
AGAATCAGAG 
ACTATTCGTC 



60, 
120 " 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 

1020 

1080 

1140 

1158 



(2) INFORMATION FOR SEQ ID NO: 16 

/O 
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(1) SEQUENCE CHARACTERrSTICS : 

(A) LENGTH; 1965 base pairs 
iB) TYPE: nucleic acid 
to STRAW DEDNESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) H\'POTHBTICAL: NO 



tiv) AI-!T1 -SENSE: NO 



{vil ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 



(ixl FEATURE: 

(A) NAME/ KEY: mlsc_featuce 
(B! LOCATIOtJ 1 . . .1965 



ixi) SEQUEUCE DESCRIPTION: SEQ ID MO: 16 



ACAACGAAAG AAAACGAGAC AAC7VACAAAA AACGAATA'J'A GAATTATGGG AAAAATCATT 60 

GGAATTGACT TAGGCACAAC GAACTCTTGT GTCTCTGTAT TGGAAGGTAA CGAACCTATC 120 

GTTATTACAA ACAGTGAGGG CAAGCGCACA ACGCCCTCGG TAGTGGCTTT TGTGGATGGT 180 

GGCGAGCGTA AGGTGGCCGA TCCGGCCAAG CGTCAGGCCA TC^CCAATCC GACCAAGACG 240 

ATATACTCTA TCAAACGCTT CATGGGCGAA ACTTACGATC AGGTTTCCAG AGAAGTGGAG 3O0 

AGAGTGCCAT TCAAGGTAGT ACGTGGGGAC AATAATACTC CGCGCGTAGA TATAGACGGT 360 

CGTC:rCTAT7\ CGCCGCAGGA AATTTCGGCC ATGATCCTTC AGAAGATGAA GAAGACGGCC 420 

GAAGACTACC TCGGTCAGGA AGTAACGGAG GCCGTGATCA CTGTGCCCGC ATACTTCAAC 4 BO 

GACGCTCAAC GTCAGGCAAC GAAAGAAGCA GGAGAGATCG CCGGCCTGAA AGTTCGCCGT S4 0 

ATTGTGAACG AGCCTACGGC AGCTTCTCTG GCCTACGGTC TGGACAAGTC CAATAAGGAT 600 

ATGAAGATCG CTGTCTTCGA CTTGGGTGGC GGTACCTTCG ATATCTCTAT CTTGGAATTG 660 

GGCGACGGCG TTTTCGAAGT GAAATCGACC AACGGTGATA CGCACCTCGG AGGAGACGAC 720 

TTCGACCACG TGATCATTGA CTGGCTGGCA GAAGAGTTCA AGTCTCAGGA AGGTGTGGAT 780 

CTTCGCCAGG ATCCTATGGC TATGCAGCGT CTGAAAGAAG CTGCCGAAAA AGCCAAGATA 64 0 

GAGCTCTCCA GCACTTCATC TACGGAGATC AACCTCCCCT ATATCATGCC GGTGAACGGC 900 

ATCCCCAAGC ACTTGGTGAT GACGCTTACA AGGGCTAAGT TCGAGCAGTT GGCCGATCGT 960 

CTGATTCAGG CATGTGTGGC ACCCTGCGAA ACGGCCTTGA AAGATGCCGG TATGTCACGT 1020 

GGCGATATCG ATGAAGTGAT TCTCGTAGGT GGTTCCACAC GTATTCCTGC TATTCAGGAG 1000 

ATTGTGGAGA AGATCTTCGG TAAGGCTCCG TCCAAGGGTG TGAATCCCGA CGAAGTGGTA 114 0 

GCTGTGGGTG CCGCTATTCA AGGCGGTGTT CTGACCGGTG AGGTAAAGGA TGTCTTGCTG 1200 

TTGGACGTTA CCCCCTTGTC GCTCGGTATC GAGACTATGG GAGGCGTGAT GACTCGCTTG 1260 

ATCGATGCCA ATACCACTAT CCCGACGAAG AAGAGCGAAA TCTTTACCAC AGCAGTGGAC 1320 

AATCAACCTT CGGTAGAGAT TCATGTACTT CAGGGTGAGC GTTCTTTGGC TAAGGACAAT 1300 

AAGAGCATCG GCCGTTTCAA CTTGGACGGT ATTGCTCCGG CGCCCCGTCA GACACCGCAG 144 0 

ATCGAAGTAA CGTTTGACAT CGATGCCAAC GGTATCCTGA ATGTAACGGC TCATGACAAA 1500 

GCTACCGGCA AGAAGCAGAA TATCCGCATC GAAGCCTCCA GCGGTTTGTC CGATGATGAG 1550 

ATCAAGCGCA TGAAGGAAGA GGCGCAGGCC AATGCCGAAG CAGATAAGAA AGAGAAAGAA 162 0 

CGTATCGACA AGATCAATCA GGCCGACAGC ATGATCTTCC AGACGGAAAA GCAGTTGAAG 1680 

GAGTTGGGAG ACAAATTCCC GGCCGACAAG AAGGCTCCGA TCGATACCGC TCTCGACAAA 17 4 0 

CTGAAAGAAG CACACAAAGC ACAGGATGTA GCTGCTATCG ATACAGCCAT GGCCGAACTG 1800 

CAAACCGCTC TaTCCGCAGC GGGCGAAGAG CTTTACAAGA ATGCCGGAGC AGCCCAAGGT i860 

GGCGCACAAC CCGGTCCGGA CTTCGGCGGT GCTCAACGTC CCTCTGCCGG TGATCAGCCC 1920 

TCTGACGACA AG7V\CGTCAC AGACGTAGAC TTCGAGGAAG TGAAG 1965 



(2) INFORMATION FOR SEQ ID NO: 17 

(t} SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1401 base pairs 
(B) TYPE: nucleic acid 
to STRAIIDEDNESS: double 
(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DMA (genomic] 

(iiil HYPOTHETICAL: NO 



(Iv) AiJTI-SEUSE: NO 



tvi) ORIGINAL SOURCE: 

I A) ORGANISM: PORV PHYROHONAS GINGIVALIS 



( i X ) FEATURE : 

(A) MAllE/KEV: mlEc featvir- 



SUBSTmrrE sheet (Rule 26) (RO/AU) 



v:^i if^ i*"^ 

^ :z::m 



ip; o o 



jj. arm, TT^sii tt=ii 



wo 99/29S70 



PCT/AU9aA)10a3 



11 / 490 



(Bl LOCATIOtJ 1. . . 14 01 



(Xi) SEQLiEHCE DESCRIPTION: SEQ ID NO: IT 



10 



15 



20 



25 



30 



35 



PJKG-rGGGCAC 
TACGACTTAG 
AAAGGTGGCC 
GAAGGATGTA 
ACGGCATCTA 
GCCAGAAAAG 
GCCGGAGTAG 
ATCGGCATTA 
TCGGAGACGT 
GAAGCTCTAC 
GGAATGGAGT 
CTGCCGGAAA 
GAAAAAGAAG 
GTTACGGTAG 
GTGGGACGTC 
GGTGTAAAGA 
ATTACAGGCT 
CAGATTTTGG 
ACCAATCCCS 
GCCTACACTG 
CAAGGCAATCi 
CTCATTGCCA 
ATGACGGA7C 
AAAGAAACTC 



GTACAACTAC 
CTATCZATCGG 
TGAAAACCCT 
TACCGACCAA 
AATATGCAGT 
GTAAAATCAT 
AGATOGTGAC 
CTGCGGGCGA 
TTATTCCACC 
AGAACAAAfiA 
TCGCTTCTTT 
TACTCAACGG 
GAATCARATT 
AATACGAAGG 
GCCCCGTGCT 
CTAATGAGAG 
TCTCGCTTTT 
GCAAAACAGA 
AGGTCGCCGG 
TTCGTCGCCT 
GAGAGTGCAA 
ATCCGxSCCGG 
GACAAATCGA 
TCGCCGGAGG 



TTTAAGGATA 
TGGAGGGCCG 
CCTAATTGAG 
GACGCTACTC 
AAGT'.TGAACG 
TCGCAAGCTG 
GGCAGAAGCT 
AGCACAGTAC 
CATCCCCGGA 
GATTCCGACC 
CTTCAACGGT 
TATCGATCCC 
CTACCTCGGG 
AGAAAGCAAA 
GCAAGGATTC 
GATGCAAACT 
GGCACATACG 
CGAAACGATG 
TGTGGGAGAG 
TCCTATGGCC 
ACTACTACTT 
CGAACTCATC 
ACGAATCATA 
T 



TCGAACGCAA 
GCCCGTTATA 
AAGAATGCTC 
TACTCGGCCA 
GCOSATGOAC 
ACTGCAGGCA 
ACCGTAACGG 
AAAGCTGCCA 
GTGGAGCAGA 
TCTCTCCTCA 
ATCGGTACGC 
GAACATGCAG 
CAGAAAGTAA 
GAGATCGAAG 
GAGTCGCTCG 
TCCCTGCCCA 
GCTGTACGGG 
AGCTACCGTG 
ACGGAAGAAT 
TTCTCCGGTC 
GATGAAGAGA 
GTAACCGCTG 
TTCCCTCATC 



CAATCTATAA 
CGGCTGCCGA 
TCGGTGGTGT 
AAGTGCTACA 
TTGACCrCGG 
TCCGTTCACG 
GATGCGATGC 
ACCTGCTACT 
CAGAGTATTG 
TCATCGGTGG 
AAGTGCACGT 
CTATGCTACG 
CATCGGTTCG 
GAGAACGTAT 
GATTGGTGCT 
ATGTCTATGC 
AAGCAGAGGT 
CCGTACCAGG 
CGCTTCGCAA 
GATTTGTAGC 
ACCGCTTGAT 
CCATGGCCAT 
CGACTGTAGG 



AACTATGCGC 
ACGTGCTGCC 
ATGCCTCAAC 
TCAAATTGCT 
CAAGGTGATT 
CXTTGACAGAG 
AGACGGCATC 
ATGTACCGGT 
GACAAACCGT 
TGGAGTGATC 
GGTGGAGATG 
CGCTCACTAT 
CAACGGAGCT 
CCTGATGAGT 
TGCCGGCAAA 
TGCAGGTGAT 
AGCAGTAGAT 
TGTGGTGTAC 
AGCAGGACGT 
AGAAAACGAA 
CGGAGCACAC 
CGAGACCGGC 
CGAAAYCCTA 



(21 INFORMATION FOR SEQ ID NO: 18 

(t> SEQUENCE CHARACTERISTICS: 

tA) LEHGTH: 2835 base pairs 
tB) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 



60 
120 
180 
240 
300 
360 
420 
480 
5^0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1401 



40 



45 



(li) MOLECULE TYPE: DMA (genomic) 

UiiJ HYPOTHETICAL: NO 

(iv> AKTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: PORYPHYROMONAS GINGIVALIS 



50 



(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 

(B) LOCATION 1...2e3S 



(xl) SEQUENCE DESCRIPTION: SEQ ID N0;I8 



55 



60 



65 



70 



75 



CCl'AAGATAC 
ATTCCGATGA 
GTCGGTAACT 
CGTGCGGATT 
GGTTTGGCTC 
AACTTGATCA 
ACCGGATTCG 
ATCGACTCCT 
GAGATCGACG 
CTTCCSTATGT 
ATGCCCATCQ 
TATAAGAAAT 
GACTATGTGG 
GCAGAGCGTA 
GATGCTGAGG 
GAAGTGCGAG 
GCCGTGAATG 
GGAGCTTTCT 
GCCACGGl'TC 
AGCCTCCGTC 
AAGCGATACG 
GAATACTCCA 



TTATGGAATT 
AGCTTTCTGC 
TGGACAACGG 
TCTTTATCGC 
ACTTCTTGGA 
ACTATCTCGA 
ACAAGACGGA 
GCTTGCTTAT 
AGGAGCGCGG 
TCGAGGCTAT 
GTCTGATGGA 
GGTATCGTCC 
AGAACAAGAT 
TCTATACGCC 
CTACTACCAC 
GATCGATATT 
AGCGTCTGTC 
TCTCTAACTT 
GTGAGGGTGA 
AGTTCGGTAT 
AGAATCAATA 
CCTACTTCAC 



GAAAAGATTT 
CCAACAC^GCT 
ATTGACTTAT 
ACAAAAGGTA 
ACACATGGCT 
AACGATCGGT 
ATATACGATA 
CCTGCATGAT 
TGTGATCCAG 
ACTTGCCAAG 
CGTCGTGCTC 
CGACCTGCAA 
CAAAGAACTC 
GGTAGAGGAC 
GCAGCTCTCC 
CGGACTTGTG 
CGAGATTACT 
CATGTACATC 
AGCGGAGAAA 
CACCAAAGGC 
CAACGAAAGA 
CGATGGCGGC 



ITATCACTTG 
CAGCCACTCC 
TTCATCCGTC 
GGTTCTATTC 
TTCAACGGTA 
GTACGTTfCG 
ATGQATGTGC 
TGGAGTAACA 
GAAGAGTGGC 
GCJATGCCGG 
AACTTCAAGC 
GGTCTGGTGA 
TTCAAGGACG 
AACGATGAGC 
ATCAGCTTCA 
GAAGACTATA 
CACAACCCTA 
ACCCAGACTA 
GCGATGAACG 
GAATACQATC 
GACAAGCGTA 
TATATCCCGG 



GTCTTCTGCT 
CTACAGATCC 
ACAACGAGAA 
TTGAAGAAGA 
CGAAGAACTT 
GTCAGAACCT 
CGACTACACG 
ATAITACCCT 
GTCCTCGTCG 
GTAATAAATA 
ATGATGAGCT 
TCGTGGGAGA 
TTCCTGCTCC 
CTATCGTAGC 
AGAGCGACCC 
TGAAACAGGT 
ACGCTCCTTT 
AGGACGCATT 
CATTGGTGGC 
GTGCACGCAC 
AGAACAATGC 
GTATTGAGGT 



TGTGGGATTC 
GGCTGTTOTT 
CCCGAAAGAT 
TAGCCAGTCC 
CCCCGGTAAG 
GAACGCTTCT 
TCAGGGAATC 
CGACGGGCAT 
CGATGCCAAC 
TGCAGAACGC 
GCGCAACTAT 
TATCGATGTG 
CGTGAATCCA 
CATTGCTACC 
CACTCCTCAA 
GATCACTACA 
CCTCAGTGCA 
CAATTTTGTT 
AGAGATAGAA 
GAATGTGCTC 
TTATGCCAAT 
GGAATATCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
04O 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 



SUBSTITUTE SHEET ORule 26) (RO/AU) 



tr=i! iP^i -li"'" a^ii ,ia ..if 

%3 r3 a «aL iT™ o a3 



»P«ii ti=^, tt^Hi 

u :oi icn: a tuu u 



wo 99/29870 
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10 



15 



20 



25 



ACGGTGAATC 
ATGATCGATC 
AAGATTCC(5A 
GAAGCCAAGA 
AAGATCGTTT 
GGCATCAAAG 
CTCAGCCCGG 
TTCATGAACG 
GGTCGCTCTG 
ACXiACTGTAG 
CGCAAGGATC 
CAGGAAGCCA 
AATCCGATGA 
ATGGCTTTCT 
AATCTGGATG 
CTCAAGCGTG 
GATTGCAAGT 
GGAAATGTGG 
CATCAGGTGT 
TTCGGCGGTC 
GATCCTGCTC 
AAGGAGGGCC 
AAAGAAAGTC 
GGAAATGACT 
CAAAAGTTTG 
GTTGCAAAGG 



CTTTTGCTCC 
CGGTGAAGAA 
GCGAAGCAGA 
AAGACGAAGT 
CCGAGAAGAA 
TATACCTCAA 
GTGGTATCCT 
TGGGTGGCTT 
CTTCCGTATC 
AAGATATGGA 
CCGAAGCGTT 
ACCCGATGGC 
TGAAACCCAT 
ACAATGAGCG 
AAGCCAAGAT 
GCGATAAGAT 
TCGAGAAGGA 
AATATACGCT 

acacggctac 
tcx;agcaata 
gtgccgagoa 
ccaatgtgga 
tgcgtgagaa 
tcatcacaga 
cggcagacctt 

CTCAA 



TCAGGTTCCT 
TGCTGTCGTT 
CTTCCTCGCT 
CTCCGACCAA 
AGATCAGAAG 
GAAGACCGAT 
CTCCGGAAAG 
GGGCAACTTC 
TCCCTCTITG 
AACTTTCTTC 
CAAGGCCACA 
TGCGCTTATG 
GAAAGCTGCl* 
ATTCGCTGAT 
GAAGCCATTG 
GAATAAGGCT 
AATGGATACT 
CAAGAACAGT 
CGTTCGCGAG 
TCCTCAGCCC 
AATGAATGCT 
ATACTTTAAG 
TCGTTTCTGG 
CTACGAATCC 
CTTGAAGCAG 



CTGGAAGCAT 
ACCCTCACCG 
GCTTTCAAAG 
AAATTGATGG 
TTCGGTACCA 
TTCAAATCAA 
CATGCTCXCA 
GATGCTATCC 
TCTCTGCTCA 
C7VGTTGATCT 
CACGAAAAGT 
GACTCTATCC 
GACGTGGAGA 
GCCGGCGACT 
ATCGAAACTT 
CAGGTACCGG 
CCTTCGACTA 
CTCCTGCTGG 
AAGGAAGGCG 
AAGGCTCTGA 
ATCGTTTTTG 
AAGACTATCG 
CTCGAAGCCA 
GTACTGAACG 
CAGAATCGGG 



TCAATCAGGC 
GTCCTTCAAA 
CTGCTCGTCA 
AGAAAGCTCC 
CGGAACTTAC 
ACGAAATCCT 
ACCAATCTGT 
ACCTGGATAA 
GTGAAGGTCT 
ATCTCCAAAT 
TGTACAATAA 
GTCATACCAT 
AAGTAAATTA 
TTATGTTCTT 
ATCTTGCTTC 

CTATATTCGA 
AAGTCTTCTC 
GTGCATACAG 
TGCACATCTA 
CTGAGTTGGA 
AAAACCTGAA 
TGAAGGCGTC 
GTCTTACTCC 
TTGTTGTCAT 



TATTGCXrCAA 
GGCTGAAGCC 
GCAGAAAGTA 
TAAGGCCGGA 
CCTTAGCAAT 
GATGAGTGCT 
GATGAATTCG 
GGTGCTGACA 
TTCGGGCAAA 
GACTGCTAAC 
CTTGAAAAAT 
GTACGGCGAT 
CGATCAGGTA 
CTTTATCGGT 
ATTGCCCAAC 
GGGAAAGATC 
TGTCGTGTCC 
AGCCGTAATG 
TGTGGCTGCA 
TTTCCCCACG 
GAAGCTTGCC 
TAAGCAGCAC 
TTTCTTCGAA 
TGCTGAATTG 
GATGGCTCCT 



1380 
1440 
1500 
1560 
1620 
1690 
1740 
1800 
1B60 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2020 
2035 



30 



35 



(2) THFORI1ATION EX)K SEQ ID hfO:19 

(i) SEOL'ENCE CHARACTERISTICS: 

(A) LEWGTH: 2058 base pairs 

(B) TYPE: nucleic acid 
{C) STRAHDEDTIESS : double 
(D) TOPOLOGY: circular 



40 



(ii) HOLECUI^ TYPE: DMA (genomic) 
<iil| HYPOTHETICAJ.: NO 
(iv) AUTI-SENSE: NO 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A> ORGAJ>IISM: PORYPHYROHOtJAS GINGIVALIS 

fix) FEATURE: 

(A) NAI^E/KEY: misc_featurs 

(B) LOCATION 1...2058 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 19 



TACACTATGA 
AAAAAATTCC 
GATGCTACCfC 
GACCTCCGCG 
GGCGTAGGGA 
CCGGAAGAGT 
CTCGGATTTT 
CGAGAAGATG 
CCTGCGGACA 
GAGTTCCTCA 
GTGCCGATCA 
GAGGACAATC 
GACGAAGACT 
TGGATCCACC 
ATCAAAAACA 
GTCACCGATG 
GATTCGCCGG 
AAGAAGATCT 
AACGACCGCC 
ATGCTGACGG 
GACGGACACA 
AAGGACGGAC 
AAACGTGCAT 
ATCGTGAGCC 



GTAAGAAAGG 
TGTACAGCGA 
AGAAGCTGAA 
TAACGGTCAG 
TGACCGAAGA 
TTCTTGAAAA 
ACTCGGCTTT 
CTACGGCGGT 
AGGCTGACCG 
AAAAAGAAAA 
TTTTCGGCAA 
AGATCAACGA 
ATAAGGAATT 
TCAATGTGGA 
ACTTGGATCT 
AAGTACAGGG 
ATATTCCCCl- 
CGTCTCATAT 
CCACATTCGA 
ATGAGAAGTT 
AGTACACGTT 
AGGTAGTGTA 
CCGACAAAGG 
TGCTGGAGCA 



AACAATCGGG 
CCATGAGATA 
AACGCTTACA 
CGTGGATGAA 
GGAGGTGGAG 
GTACAAAGAC 
CATGGTGTCC 
GAAATGGAGC 
TGGCACCGAC 
GATAGAGGGG 
GAAGCAGGAA 
CACACATCCT 
TTACCGTTCG 
CTATCCGTTC 
GCAGCGCAAC 
TATCGTGCCG 
CAACGTATCG 
CACCAAGAAG 
GGAGAAATGG 
CTATGAGC6T 
CGACGAATAC 
TCTCTATGCT 
CTACAGCGTG 
AAAGTTGGAG 



GTAACGAGCG 
TTCCTGCGTG 
TCCGTCGGCG 
GTGGCACGCA 
AAGTACATCA 
GACAAGGCCG 
GAGCGAGTGG 
TGCGACGGAT 
ATCGTGATGC 
CTCCTCGGCA 
TGGAAAGACG 
GCCTGGACCA 
CTCTATCCCA 
AATCTGACAG 
AAGATTCAGC 
GACTTCCTCA 
CGCTCCTATC 
GTGGCAGACC 
GATAGTCTGA 
GCAGCCAAAT 
CGAACGCTCG 
ACGGACAAGC 
ATGCTGTTGG 
AAGACACACT 



ACAATATATT 
AGATCGTCTC 
AATTCAAAGG 
CGATCACGGT 
ATCAGATTGC 
CCATTATCGG 
ACGTGATCAC 
CGCCCGAATA 
ACATCGATGA 
AATACTGTAA 
GCAAGATGCA 
AAAAGCCTGC 
TGTCCGAAGA 
GTATCCTCTA 
TCTACTGCAA 
CCCTCCTGCA 
TGCAGAGCGA 
GTCTGGAAGA 
AGCTCTTCGT 
TCTTCCTTrr 
TCGAAGGTGT 
ATGGACAGTA 
ATGGTCAGTT 
TTGTCCGTGT 



CCCCGTCATC 
CAATGCCGTG 
CGAGACGGGT 
CAGCGACCGC 
TTTCTCCAGT 
CCACTTCGGA 
GCGCTCTTTC 
CACGCTCGAA 
GGAGAATAGC 
GTTCCTTACC 
AGATACGGAC 
CGACCTCAAG 
GCCTCTCTTC 
TTTCCCGAAG 
TCAGGTTTAC 
CGGGGTCATC 
TGCCAATGTG 
AATTTTCAAA 
CGAATACGGT 
CACCGATATG 
ACAGACGGAT 
CAGCCACGTG 
GGATCCGCAT 
CGATAGCGAT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



if«q. -ii^^" tP'H, ^ 

u '-"Sif r^ii o 



^1!^! 'i^'i' tP^h iP'^h 

Hi 'O! iC:; u^i ii^ 
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10 



ACGATCAACA 
CGCGCCACTC 
AATGTAGCTT 
TTCATGCGCC 
CTCCCCGATT 
TCCGGTGAGA 
CTGAAAGCGG 
CCTGTTGCCA 
AGTATCAACG 
GACTTGGCTC 
CGCAGCCAGC 



ATCTGATCCG 
TCGTGAAGCT 
TCGAATCGCT 
GTATGCGCGA 
CGTACAATCT 
AAGAATCGGT 
AAGAGGCCAA 
CGAAGGAAGC 
ATCAACTGAC 
TGCTCGGAAG 
GTCTTCTC 



CAAGGAGGAA 
GTTCGAAGCA 
CGGAGCCGAA 
TATGGCACAG 
GGTACTTAAT 
AGAGCCTTCG 
GCTGCTCGAT 
CAAGGAGAAC 
CAAATATGCT 
CGGATTGCTG 



AGAGCC6AAG 
CGCCTGCCAC 
GGTGAAGCCA 
CTGCAGCCGG 
ACCGATCATC 
CTCACAGAGC 
GAGGAAAAAG 
AACGCCGTCG 
CAGGACAACG 
ACGGGAGAGG 



TGAAACTGTC 
GGGACGAGAA 
TCCTTATCAC 
GAATGAGCTT 
CGCTCATCGA 
TTAGAGCGAA 
GGAAGAAACC 
AAGAGGCCAA 
AGCTGATAGG 
CTTTGGCCGA 



CGATACGGAG 
GAAGCACTTC 
ACAAGCCGAA 
CTACGGCGAA 
CAGGGTACTC 
AATCGCCGAG 
GGAGGAAATC 
AACCGAAGGC 
TCAGCTCATC 
ATTCA'rrCGT 



1500 
1560 
1620 

leeo 

1740 

leoo 

1960 
1920 
1980 
2040 
20S8 



15 



20 



25 



(2) IHFOWIATTON FOR SEQ ID NO; 20 

(i) StlQUENCE CHARACTERISTICS: 

{A) LEHGTH: 144 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRA}^DEDNESS: double 

(D) TOPOLOGY: circular 

(i1) MOLECULE TYPE: DNA (genomic) 
tiii) HYPOTHETICAL: NO 
(iv) AIITI -SENSE: NO 



30 



35 



40 



45 



50 



55 



60 



{vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE:. 

(At NAME/KEY: mlsc_f eature 

(B) LOCATION 1. . .14 4 6 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20 



GACAITCGAC 
AAACTGATCG 
GTGCTCAAGC 
ATAATGCGGA 
TCCCTTTCGA 
TCCTGCCCTG 
AGGCTTGGAG 
ATAGATACGG 
GACAGAGATA 
ATCATAGGCC 
ACCCATGCCT 
GCTCTGCACC 
ATTCCCGAAA 
GCTTTTTCCG 
GAAATAGGCG 
ACATTCGAGC 
ACGAATGCTT 
CGAATCAACC 
CTGCTGGTCG 
TGGAGTGCGG 
CTGAAAAACG 
GATGTGGCTG 
ACCGACATGG 
CGTGCTGCAC 
GGACTG 



AGAAAAGACC 
ATATTTTGGT 
GAGCGGGCTA 
ATCCTGATGG 
CCTCCGGCAG 
TCATACTGAT 
CTTTCGACTT 
CCTTGCATCT 
CAGCCCGTCA 
GGAGCGATGC 
CTGTCCTGAT 
GTGGGAGCAA 
GTTTGTTCGA 
ACAGGAAAGG 
7\ACTACCGGT 
CGTTGGGCGA 
CCTTGGAGCG 
TGATACATCT 
AAGCCTTCAG 
AAGCTATGCG 
TAGTGGAGCG 
ACTTCGGTTC 
AGGAAGCTGC 
GAGCCTTGGG 



GTGCTTCAAC 
CGTAGACGAT 
TAATCCCGTT 
CGGOrGTAAG 
GGAAGGATTG 
GACGGCTTGG 
CATAGGCAAG 
GGCTGCTCCC 
GCCGAAAGCT 
CATCTGTAAG 
CACGGGCGAG 
ACGAGCCTCA 
AAGTGAGCTG 
ACGGTTCGAG 
CGGCAACCAA 
GAGCGl-CTCC 
AATGGTAGCC 
GCATCTGCCT 
TGAAGCCTTT 
ACGTATCTGT 
TACGCfATTG 
GCAGGTGACG 
TATCCGAGAG 
ATTGAGCCGG 



GCCAATTTGT 
GATGTGGCAG 
ATAGCCAACA 
CCGGCTGTGA 
GAACTACTGG 
GCTTCGA1*TC 
CCATGGGACA 
TCAGCTGTGG 
ACAGTCCAAG 
ATCAAGGAAC 
AGCGGTACGG 
GCCCCATTCG 
TTCGGACATA 
CTGGCTGATG 
GTAAAACTCC 
CACCGAGTGG 
GAAGGACGTT 
CCGCTGCGTG 
GCCCAATCGA 
GCCATGCCCC 
CTCTCGGGAT 
GCAGCAGACC 
ACGCTGACTA 
GCAGCTCl'TT 



ACTTTTACCG 
TCTGTGCCGC 
GTCCCGACGA 
TTCTGATGGA 
AGAAGATGCA 
CACTGGCAGT 
ACGATCGGCT 
CGAATCCATC 
AGAATGACCC 
GGATACGCCG 
CCAAAGAGTT 
TCAAGGTCAA 
AGAAAGGAGC 
GCGGCACGAT 
TGCGAGTGCT 
ACATCCGTGT 
TCAGAGAGGA 
AGCGTCAGGA 
ACGGATTGCC 
TACCGGGCAA 
CGAGAGAAAT 
ACTCCGACGA 
AATACAACGG 
ACCGGCGAAT 



TGCAATGGAA 
ACTGCGTCTG 
AGCTTTGTCC 
TATGAATTTC 
GATATTCACT 
GGAGGGAATG 
CCTTCGTACC 
GGAACACTCT 
CTGTGCCCAT 
CATAGCTCCC 
GATAGCCGAA 
TTTGGGTGGG 
TTTTACCAAT 
CTTTCTGGAC 
ACAGGAACAG 
GGTATCGGCT 
CCTCTACTAT 
GGATATACAG 
CCATGCCGTT 
TGTACGCGAA 
CAGTGCCCGG 
ACGGGCTTl'G 
CAACGTTAGT 
GGAGAAATAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
120O 
1260 
1320 
13G0 
1440 
1446 



65 



70 



75 



(2» TNFORlwriON FOR SEQ ID NO: 21 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 789 bas- pairs 

(B) TYPE: nucleic acid 

(C) STRAJJDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAL: HO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



»n=3t iP=ii H"-' -i!™!! trti "'"ii rf^ii -il^ii ..i*' -^-itii tr«*i ^tF=*il 

ujj o ^jLjw o o 10 iLJi -Oi in:: o iUJ u 
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iiv) AIITI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM : PORYPHYROMOHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY: mi sc feature 

(B) LOCATION 1. . .7B9 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:21 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
709 



AGATCTCTGC AATCATTTCA AAATAAAAAA 
CACGCCACAG TACAGGGCAA AGAGATATTG 
GAGATTCATG CTATCATGGG GCCGAACGGA 
GTGGGACATC CCTCCTTTGA AGTCACGGAA 
CTCGAACTCG AACCGGAAGA ACGTGCACAC 
GTCGAGATCC CGGGCGTCAG CATGGTGAAT 
AAAGCGATCG GAGCAGAACC CGTATCGGCA 
CGTGCCATTG TGGAGCTGGA CAACAAATTG 
GGTGGAGAAA AAAAGAGGAA CGAAATCTTC 
ATTTTGGACG AAACCGATAG CGGGCTCGAT 
GTAAACCGAC TCCCCTCTCC GGAGAATGCT 
Cl'CGAGTACA TCAAGCCGGA CTTCGTACAC 
GGAGGAGCCG AGCTGGCTCT CACr^TTCGAA 
ATAGGAGAA 



CACTCAAGTA TGCTTAAGAT AAAGAACCTC 
AAAGGAATCA ATCTGGAGAT CAATGCCGGA 
TCGGGGAAAA GTACGCTCTC TTCCGTTTTG 
GGAGAGGTGA CATTCAATGG AATCGACCTG 
CTCGGACTCT ITCTCAGTTT CCAATATCCG 
TTCATGAGGG CAGCTGTCAA TGAACATAGG 
AGCGACTTCC TCAAGATGAT GCGAGAGAAG 
GCCAGCCGTT CTGTGAACGA AGGCTTCTCC 
CAAATGGCTA TGCTCGAACC CAAGCTGGCT 
ATCGACGCrC TCCGCATCGT AGCAGGCGGG 
GCTATTGTCA TCACACACTA TCAGCGTTTG 
GTCCTTTACA AGGGGCGCAT CGTCAAGTCG 
GAAAAAGGCT ACGACTGGAT CAAGGAAGAG 



(2) IKFORHATIOIJ FOR SEQ ID NO: 22 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1306 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

tiij MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

Uv) AI^TI -SENSE; NO 

(vi) ORIGINAL SOURCE: 

(Al ORGA/JISM: PORY PHYROMONAS GTNGIVALIS 

tlx J FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . .1386 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22 

S^^I^S?''^ AGGAGAAAAC GATCTACGTC TGCCGTTCGT GCGGAACCAA ATACGCCAAA 
TGGCAAGGCA ACTGCAATGC CTGTGGAGAG TGGAACTGCA TTGATGAGGA gAaGG^S^ 
CGGGCAAGCA TGCACCCAAG AGTTTTATGC ci^G^A ^C^CcS 
^A^AGGATGT GGAGTCCGGC GATGAAGAGC GTATTCGCCT CGGCGA^G^ 
GAGTTCGACC GCGTACTGGG TGGAGGAATT GTCAAAGGAG CATTTGTCCT GCTTGGC^ 
^r^^n^^ TCGGTAAGTC CACGCTTTATC CTCCAGACGG TGCTGC^^S ^G^^^G 
CGCACGCTCT ATGTGTCGGG CGAAGAAAGT GCCCGACAAC TGAAGATGCG CGCCGAAOTA 
^C^C^C^ ^^l^r TACTGC^ C^^A^^^ ^G^^^C 

Ico^nrr^ ACCCGATCTC CTCGTGATAG ACTCTATACA GACGGTCTAT 

CT^?^ A??2^Sc ?f rS^^S^ GTGCKJGCAGA TCCGCGAATG TGCcSJcT^A 
r^irSiSr^I ACTGCAAGAC TACGGGTATC CCCGTCATCG TCATCGGACA CATCACCAAA 
TAGCCGGACC GAAGGTGCTG GAGCATATAG TGGATACGGT GCTTC^^ 

A^cc^GC tgS^tI?! S^^^f S^'^^^ ctccgaggac agaagaaccg 

tggggatata cgagatgcgg caggacggtc tgcgtggcgt ggagaatccg 

AGCGAACATC TCATCACACG CAATAGGGAA GACCTCAGTG GCATAGCCAT MCCGTAGCr 

TTCGCCCGAT ACTCATCGAA GCGCAGGCTT ^gS^CAG^C ^C^^ o.n 

^^^l'''''^'' AGCGTTCGGC CACGGGCTTC GATATTCGGC GGATGAaSt GCTcrilScc 1070 

nlnnl^f''''' AACGTGCCGG CTTCAAGCTC ATACAGAAGG ^GTGTTT^ ct^^^^^CC ^080 

ggaggtatca aaatagccga tccggctacg gatctc^ccg ttatctcggc ag?g^c? ^?!n 

r^rtrZ':'' ^^^CGTTAT CCCGCCGGCC GTATGCATGA CgS^^^T CG^^CTCC ^2^0 

^^^^^^"^ GTCCCGTGAG CCGCATCGAG CAGCGCATAA CGGAAGCGCG TC^S^ illn 

Ti^^2^^ TATTGGTACC GGCCGATAAT TTCCGGCAGG AGGATGCraG CC^C^ ^320 

AGA^"^"^^ ^^CCGGTCAG AAAGGTGGAG GAAGCCTTCC GCCAT^^^; S^^A ^380 



60 
120 
180 
240 
300 
360 
420 
480 
540 
60O 
660 
720 
780 
840 
900 
960 



1386 
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(2 J IMFORIIATIOH rOR StlQ ID NO; 23 

(iJ SEQU&JCE CHARACTERISTICS: 
5 (A) LENGTH; 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNE5S: double 
ID) TOPOLOGY: circular 

10 (li) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

tiv> WITI -SENSE: HO 



15 



45 



55 



(vij ORIGINAL SOURCE: 

(A) ORGWIISH: PORYPHYROMOdAS GINGIVALIS 



(ixj FEATURE: 
20 (A) NAME/KEY: mi5c_f eature 

(B» LOCATION 1. . .1119 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23 

25 GGGTC1"rGTG GAAGTAGCCC AGCAATTGGC CGATCAGGGT GTTCGCGTTG TGATCGCCGG 60 

ATTGGACATG GACTTTCGAC GTCACCCTTT CGGACCTATG CCGGGCTTGT GTGCCATAGC 120 

CGACTCCGTG ACCAAAGTl'C ATGCCGTGTG TGTGGAATGC GGCC6ATTGG CCAGCTATTC ISO 

TTTCCGTCGT GTCCAAGGCG ATCAGCAAGT GATGCTGCGC GAACTGAACG AATACAGTCC 24 0 

CCi'CTGCAGA ACCTGCTACA GGAAATGCAG TTCTCCCCCA CAAACAGAAG AAATCCATl'C 300 

30 GACAATATGA ATAGCAGACA TCTGACAATC ACAATCATTG CCGGCCTCTC CCTCTTTGTA 360 

CTGACATTGG GCGGCrGCTC CGTAGCCCAA CAAGATACGC AGTGGACTCT CGGCGGAAAG 420 

CTCTTTACTT CGGCGTGGAT ACAACGTTCG GCCGAATATC AAGCGCTTTQ CATTCAGGCA 480 

TACAACATCG CTACGGAAAG AGTGGACGCT CTACCGGCAG AACGTAAACA AGGAGATAGG 54 0 

CCTTATGCCA TCGTAACGGA CATAGACGAA ACCATTTTGG ACAATACGCC TAACTCCGTG 600 

35 TATCAGGCTC TCAGGGGCAA GGATTATGAT GAAGAGACTT GGGGGAAATG GTGTGCACAG 660 

GCCGATGCCG ACACACTGGC AGGAGCTTTG TCTTTCTTCC TCCATGCAGC GAACAAGGGG 720 

ATCGAGGTCr TTTACGTCAC CAACCGCAGA GACAATCTGC GCGAAGCAAC TCTTCAGAAC 780 

CTTCAGCGTT ACGGATTCCC CTTTGCCGAT GAAGAACATT TGCTTACGAC CCATGGGCCA B4 0 

TCCGACAAAG AACCCCGTCG GCTCAAAATA CAAGAACAGT ATGAAATAGT ATTGCTCATA 900 

40 GOAGACAACT TGGGCGACTT CCACCACTTC TTCAATACGA AAGAAGAGTC CGGACGCAAA 960 

CAGGCTCTGG GCCTGACAGC CGGGGAGTTT GGCCGGCACT TCATCATGCT GCCCAATCCC 1020 

AACTACGGAT Cl'TGGGAACC GGCATGGTAC GGCGGGAAGT ATCCGCCACT GCCCGAAAGA lOflO 

GACAAAGCAC TTAAACAACT GCACTCACAG AACAGCAGA 1119 



(2) INFORhlATIOt^ FOR SEQ ID NO: 24 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 8 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



lii) MOLECULE TYPE: DNA t genomic J 
(lii> HYPOVKETICAL: HO 
(ivl ANTI-SENSE: NO 



60 (Vl) ORIGINAL SOURCE: 

tA) ORGAfJISH: PORYPHYROHONAS GTNGIVALIS 

(ix) iXATURE: 

(A) NAMLVKEY: misc_feature 
65 tB) LOCATION 1...1278 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 

CGTTGGGGTT TTTCTAATTT TGTGOCCATG AGCACCAATA TAGATGTACA ACAGATCAAA 60 

70 CAGCGTTTCG GCATCATCGG TAGCAGTCCG CTGATGGAAC ATGCCATACG AGTGGCAGCA 120 

CAGGTGGCTC CTACCGACAT GTCCGTCCTC GTGACGGG-SG AGAGCGGTTC GGGGAAAGAG 180 

TTCTTCCCAC AGATAATCCA CTACTACAGC GCCCGGAAAC ATCATAGCTA CATTGCAGTC 240 

AATTCCGGAG CCATCCCCGA AGGAACCATC GATTCCGAGC TGTTCGGACA CCGCAAAGGT 300 

TCCTTTACCG GAGCCGTATC GGATCGCAAG GGGTACTTCG AAGAAGCATC CGGCGGCACG 360 

75 ATCTTTCTGG ACGAAGTGGG CGAACTGCCT TTGCCCACGC AGGCGAGGCT GCTGAGGGTG 4 20 
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10 



15 



CTGCAGACGG 
ATCGTAGCRG 
GACCTCTTCT 
GACGACGTGC 
CCTCCGCTGC 
AATGTGCGAG 
GTATCGGCAC 
GTCGTGATCC 
GAAATCATCl* 
ATGAACCGCC 
AACGACGACA 
GCGAACGCGG 
GTTTCGCI'GG 
AGCCGCAAGC 
AAGGAGTATG 



GCGAGTTCAT 
CGACGAAT'iT 
TCCGGCTCAA 
CCTTGCTTTT 
GCCTATCGGA 
AGCTGCGCAA 
AGACCATCAC 
GACGGAACGA 
ACCAGGTGCT 
TGGCGCACCA 
AGCGCACCGC 
CAGAACCCGT 
AGGAGGTAGA 
AGACAGCCGA 
GACTGGAA 



CCCCGTAGGA 
GAACCTCAAG 
TACGGTACCG 
TCGCCGATTC 
CGAAGCCCGT 
TATAACCGAC 
TCGCTACCTG 
AACGACCGAA 
ATACGATATG 
CGAACAGCCC 
AGATCCGAAG 
GGAGCCGATA 
G7^GAAAATG 
GGAACTGAAG 



GCCAGCCAGT 
GAGGCGGTAG 
ATCGAGGTGC 
GCCGCCGACA 
ACCATATTAA 
AGGCTGAGCA 
GACGCTGAGG 
GCGGACAAAC 
AAGAAAGAGA 
TCATGGCCTG 
TGGGGCGTCA 
CAGGAAGCCA 
ATTTCCCTTG 
ATTTCGGAGC 



CGCAGAAGAC 
CGAACGGGAA 
CTGCGCTGCG 
GCGCCGAGAA 
TGCGTTACCG 
TCCTGGAGGA 
GGATGCAAGA 
AAATCCCCCA 
TAGCCGATTT 
TAGGGTCGGA 
GCACGCACAA 
GCGAATACAC 
CATTGGAACG 
GGACACTATA 



GGATGTCCGT 
GTTCCGGGAA 
TATGCGACCG 
GTATCGGATG 
CTGGCCCGGC 
GGAGCGGACG 
CCTCCACCCC 
TTACGAGCGC 
GAAGGGGATC 
CGTCTGGGGC 
GGCCCCCATC 
CGAGGATCCG 
CCACCGCGGA 
CCGTAAAATC: 



460 
540 
600 
660 
120 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1278 



20 



25 



30 



(2) INFORMATION FOR SEO ID NO: 25 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 1969 base pairs 
(Dy TYPE: nucleic acid 
<CJ STRAUDCDNESS : double 
(D) T0P0I^3Y: circulax 

(11) MOLECULE TYPE; DfJA (genomic) 

(ill) HYPOTHETICAL: NO 

tlv) ANTI -SENSE: MO 



35 



Ivil ORIGINAL SOURCE: 

(AJ ORGAMIS1-J: TORY PHYROHONAS GINGIVALIS 

UX) FEATURE: 

(AJ HAHE/KEY: misc_f eature 
(B) LOCATION 1. . .1959 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUENCE DESCRI PTIOt^f : SEQ ID NO: 25 



AAAAATCTTC 
CCTGACAACG 
GTATGCCGAC 
GAAGGCCGTG 
AAAGGCAATC 
GTTOCCGACA 
CATCCTTTCG 
CCCGTAGTAA 
AACTACGATA 
ACAGGAGTGA 
CGCATATTGA 
CGCATCGTTT 
GCCATTATCA 
CGACGGATGG 
CCCGTAGGCG 
TCACTCCGAA 
TACCGATTGA 
ATAGTCAGTC 
CTGAAAGATG 
AGGCTCAATC 
ACGCTCCCCG 
ATGATGACAG 
GTACCCGAAG 
GGCATCTCTA 
GGTGCCATCC 
AATl'CGGGAG 
GCCTTTGTGC 
TCTGTCCGTT 
TATCTGGATA 
CGTCAGGCAG 
GAGCTATGGA 
CGATCTTGGG 
GATGAGTCCG 



CGAGGTACGC 
ATGATAGTCG 
GAATTCAAAC 
CCGCAGCTCG 
TGCAAGAGAT 
TGGTTTCGTT 
CCGAAGAATT 
GATACGAGAG 
CGGCTGCCGA 
ATACGATCCC 
AGCGAGACGA 
TCTTCGAACC 
CAAAAGAAAG 
GCATCCGTAT 
GGCCTGTCGG 
GCGGATTCAC 
TGGGGCTTGG 
TGCCCATAGC 
CAGGTGATGA 
CCGAACATGA 
CCCTCGGTCT 
CAGAGGTACG 
GCCGAGAGGC 
rt'ATCGGAAC 
GTAAGCAAGT 
CCAAGAGTGA 
AGTATGGCAA 
CGGTAACGGT 
CGCACAGTAA 
GTTGTTCGGA 
CTATGCCGAG 
AAACTTGCCG 
GAGCGATTCG 



TCCGGCATTT 
TGGGAGAAGC 
ACCTCTTCCG 
CGTGCTGGAT 
CCAGAGTAGC 
TTGTCGGAAA 
GCACGCTTCA 
ACAATACCCT 
GCGGATGCTT 
CAAGCTGGCT 
ATCGGTTGCT 
GCATGCGGAC 
CGGAGAGAGC 
ATATGCCGTC 
TTTGAGACGG 
TACCGGCACC 
CTCTCTCGCC 
GGAAATTCGA 
TCCGGATGTG 
GGAAGTCCGC 
GGAGGTCGGA 
CCGACTCTAT 
TGCTACCCAG 
ATCGGGAGTC 
GGGTATTGCC 
GCGTTATGl'A 
TTTCGTCGGC 
AGGAATCATG 
AAAGGTAGTG 
AGACATCCAT 
TGCGGAGGAC 
CCCATCGGTA 
TTTTCGTATC 



GGCAGAAATC 
CATAGACAAT 
CACATGATCC 
GAAGCGGG7VA 
CACGGCCATC 
GAAGAGATCC 
GTGGCAGAAG 
CCACGCGAAG 
GGCGATGGCQ 
GCTTTCTGGA 
TTGGCAGAGA 
GAGGAGCTGA 
6GTTACTTCC 
GTACGTCCCC 
GCGGTAGAAC 
ACAGCTACCG 
GAAGCTCCCG 
GAGGAAGAAG 
ACCAATGGCA 
TTCCTGCAGG 
GGTCCGGCTA 
GCGCAGGGAG 
ACATTCAATC 
GTGAAACCTT 
ACCGCCTTGG 
AAAGGAGCCT 
GAATCACTCA 
CTCGGCAAAG 
ATGAATCGGG 
GCCATAATAG 
AGCGATCGAC 
CCATCGGCCG 
GGTGGAGAA 



GTAAAAGGGA 
CGGGAAGGGC 
TACTCITCGG 
GTCCGTTTTT 
GTCTGACAGG 
GACTGATCGT 
CC^'CTGAACA 
AAGGTATCGT 
TGCAGCGTCT 
AAGAGCGCAC 
AGAACGGCTT 
TGCAAGCCGT 
GAGAAAAGAT 
CTTTGCCTCC 
GCCTCGTGCC 
CTGCAGTAGT 
TAGAATTGCC 
ATGCTGTCGT 
TGGCGGTATG 
GTGAAGGGGT 
TCAACCTCGT 
GTGTGGATAT 
CCCGACTCGG 
TTTCGGCCGA 
GAGCCAATCA 
ATCCGGCACT 
GTTGTGTAGC 
CAGTGAAACT 
ATTTCCTGCA 
ACAGCCTGAA 
TGCTACGAAA 
AATTAGAACT 



ACAACCTGAC 
TGTCACGGTT 
CGGTACTACG 
CTACTCCACC 
AGCCATGACG 
GGACGCCGCT 
AACAGGTATC 
CTGQTGTGCA 
GCTGATGCTC 
CACCTTTTGC 
TCCTGCGGAG 
TCGCCCCGAT 
AGAAGCTGCC 
TTCATTCATT 
GGGATTCTTT 
AGCAGCCATG 
TTCGGGCGAA 
ATCCGCAGTC 
CGCTACGATC 
GGGGGTAGTG 
ACCTCGACGA 
TACGATTAGC 
CATACGGGAC 
AGCGTTCGTT 
TATCGTCCTC 
CATTCCACAG 
TTCCTTCCCT 
CGCCGAAGGC 
CGAACTGGCT 
TTTGGCTCGT 
GATl-GCCGAA 
CCTGCTGATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
126Q 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1959 
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20 
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30 
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12} IllFOWlATIOM FOR SEQ ID IJO:26 

(11 SEQUEHCt: CHARACTERISTICS: 

tA> LEHGTH: 13S3 base paixs 
(B) TYPE: nucleic acid 
(CI STRAf-ZDEDHESS : double 
IDJ TOPOLOGY: clccuiar 

(ii) MOLECLiLE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: HO 

(iv) Al^TI-SEHSE: NO 

ORIGINAL SOURCE: 
(AJ ORGAIIISM: PORYPHYRa-tOHAS GIUGIVALTS 

(ix) FEATURE: 

(A) NAI IE/KEY: misc fcature 

(B) LOCATION 1...1353 

(xil SEQUEHCE DESCRIPTION: SEQ ID NO:2S 

CAACAAAGAC CACATAGAAT TATTATGTTG AGGACTTTCC GAATCGGTGG TATTCACCCC 60 

CCCGAAAACA AGTTGTCGGC AGGGAAGCCC GTAGAGGTGT TGCCTATCCC CTCACAGGTA 120 

GTCATCCCTC TTGGTCAGCA CATCGGTGCA CCGGCAACTG CCACGGTCAA GAAAGGGCAT 180 

GAAGTTAAGG TCGGGACTAT CATTGCTCAG GCCGGAGGAT TCG'l'ATCAGC TAATATCCAC 24 0 

TCATCTGTGT CGGGTAAGGT GCTGAAGATC GATAACGTAT ACGACTCAAG CGGCTATCCC 300 

AAGCCCGCAG TCTTCATTAG CGTAGAAGGT GACGAATGGG AAGAGGGCAT CGATCGCTCA 360 

CCAGCCATCG TCAAAGAATG CAATCTGGAT GCAAAAGAAA TCGTAGCCAA AATTTCTGCA 420 

CCCGGTATTG TGGGTCTTGG CGGTGCTACC TTCCCTACCC ATGTGAAGCT GTCCCCTCCT 4 80 

CCGGGCAACA AAGCTGAGAT CCTGATCATC AACGCCGTAG AGTGCGAGCC TTATCfGACG 540 

AGCGACCATG TCCTTATGCT GGAGCACGGC GAAGAGATCA TGATCGGCGT GAGTATCCTG 600 

ATGAAAGCCA TTCAGGTAAA CAAGGCCGTC ATCGGAGTTG AGAATAATAA GAAAGATGCT 660 

ATTGCTCACC TCACCAAACT GGCCACTGCA TATCCGGGCA TAGAGGTAAT GCCGTTGAAG 720 

GTGCAATATC CTCAAGGCGG TGAGAAGCAG CTGATCGATG CAGTGATCCG CAAGCAGGTA 780 

AAAAGCGGTG CCTTGCCTAT CAGCACAGGT GCCGTAGTAC AAAACCTGGG TACGGTATTC 840 

GCCGTGTACG AAGCAGTACA GAAGAACAAG CCTCTGGTCG AGCGCATCGT GACGGTTACA 900 

GGAAAAAAAC TGTCTCGTCC GTCTAACCTC CTCGTTCGTA TAGGTACTCC TATTGCGGCT 960 

TTGATCGAAG CAGCAGGTGG CTTGCCGGAG AATACGGGCA AGATCATCGG CGGAG6TCCG 1020 

ATGATGGGAC GCGCTCTGCT GTCACCGGAT GTGCCTGTGA CCAAAGGCAG CTCCGGAGTA 108 C 

TTGATTCTCG ATAGAGAAGA GGCAGTTCGC AAGCCTATGC GCGACTGTAT CCGATGCGCC 1140 

AAGTGCGTCG GAGTGTGTCC GATGGGACTC AATCCGGCTT TCCTTATGCG CGACACCTTA 1200 

TATAAGAGCT GGGAAACAGC GGAAAAAGGC AACGTGGTTG ACTGTATCGA ATGCGGTTCG 1260 

TGCAGCTTCA CCTGTCCGGC CAACCGTCCT CTGCTGGATT ATATCCGCCA AQCCAAGAAG 1320 

ACTGTGATGG GTATCCAAAG AGCACGTAAG CAA 1353 



(2) IHFOWIATIOH FOR SEQ ID NO: 27 

(ij SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1467 base pairs 
(D) TYPE: nucleic acid 
<C) STRAWDEI3NESS : doable 
(D) TOPOI.OGY: circular 

(11) MOLECULE TYPE: Dt^ (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANT1-SEI4SE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGAl^ISM: PORYPHYROt^aiAS GIHGIVALIS 

fix) FEATURE: 

(A) HAI'IF./KEY: mlsc feature 

(B) LOCATXOtJ 1 . . .lT67 

(xi) SEQUEl-tCE DESCRIPTION: SEQ ID NO: 2? 

ATGAACTCTC AAAAGAAAGA GGCTTTTAAT ATGAAAAGAA TACAACTAAC TCTTATCGCT 60 

CTCTTCGCCG CTGTTGCCGG TTTGGTCGCT CAAAATGCTT ACGAGGGAGT AATTTCATAT 120 

AAAATTTCGT TGGACAAAAC CGGAAACAAG CTTGTACTGA ATGGTGCGGC AGATATGAGT 180 

AATTTAAAGC TCAAGAGCAC TCAGATGATC ATTGTTACGC CTATTCTTCG TTCAGAAGAT 24 0 
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25 



35 



70 



(2) INFORIIATIOTJ FOR SCQ ID NO: 28 



( 1 ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1152 base pairs 
tB) rfPE: nucleic acid 
tC) STRAl^lDEDt^tESS: double 
(D) TOPOLOGV: circular 

{ii» IIGLECULE TVPE: DMA (genomic) 

(111) HYPOTHETICAL: HO 

(iv) AIITI -SENSE: NO 



(21 IWPORl-IATIOIl FOR SEQ ID NO: 29 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 baee pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: circular 



300 
360 
•120 
400 



^Jn^S^S^ GGGTGGAATT TCCTTCGGTA GTCATTACAG GCCGCAATAG AACAAAACCT 
SSSc^Tc TTGCCCCAAG CAAAACATGC AGCTCAATAC 

ATTCGCCGTC ATAATGGGAA GACCGAGCAG TTTGCTTTrA CAGGAGAACA TGCTTATGCA 

c l^Tlt^l^^ TGGATGCCAA GTTTGTGGTT CGTGAGGAGG TACGAGGTTG TGCTAAATGC 4 00 

^^iiSSIS I^^AGTAA TATTGTTCCT TTTGATCCAC TCTTCAATCC GGCAGAGGCT 540 

^^T^r^ l^^''^'"''^ TACTCCGGCA GAAGAAGTGG AAAAACAGCG AGAGTCCAGC 600 

TTCGATGCTT ATATCAACTT CAAAGTCAAT AAGGCAGATG TCCTTCCTGA GTATCGCAAC 660 

5?^??^^ AGTTAGAGAA AATCAAAGAA TTTtTTAAGCA cS??Xi^C ^I^TCC^C 120 

TATTCGGTCA ATAAAATGAT CATCGAAGGG TTTGCTTCTC CCGAGGCTTC AATACCCCAC 7QO 

^ TGTCGGAGCG CCGTGCTAAA AGACTCGCGG AAGAATTGg5 Ho 

^r^S^J Iff'''''^^'^'* AACCACTGAA TTCG'SCGGTG AAGATFGGAA GGGGCTGAAA 900 

CTGGCTATCG AAAAGAGTGA TATAGCCGAT CGTGACCGCG TATTGGAGAT AATCAACTCC Igo 

S^^^SJ^ CCGATGATGA TGCACGTGAA CAGGCTCTGA AGCAAC^^^C ^^^^^ 1020 

15 G^C^I?? A^^^^^ TCCGAATTTG CGTCGCAATA CGATAACCAT GGGGTATATC ^080 

tJZ^^^^ll ^J^^CCCTCGA AGAAGCTCG-T GAAATCATTA AGACTGCTCC GAAAGAACTT IHO 

AAATGTACCG TGTGGCAATG TCTTATCCTG AGGGGCACCA AGAGCGTTTG 1200 

TZ^^^^^ ATACGACCCT TAAGTATTTC CCTGAAAGTG TAACGGGCCG AATCAATTTG 1260 

^^B^^^?^^ CrrrTAATGG TGGAGACGTT CAACAGGCAA TTGCTCTGTT GAGTCCGATT 1320 

20 CAATATCCTT GGAGCTGCTT ATGCTCGTAC GGGAGATTTT 1380 

'° ^c?c-^?A r^r^ o^^roc^^o gagatgcaaa tgcgcagcgc ir.o 



tvl) original SOURCE; 
^ (A) ORGAI-JISH: PORYPHYROI-IONAS GINGIVALIS 

{ixj FEATURE: 

(A) NAME/KEY: inisc_f ea ture 
(D) LOCATION 1...1152 

^5 (xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 28 

GACATGGCAG AAAAAAGAGA CTATTACGAA GTCCTCGGTG TATCGAAGAA TGCCACCGAC 60 

gSg^WG ^^'^JL''^ TCGCAAGAAG GCTATCCAAT ACcSJ^S^ ?^^Cc2c 120 

qn nfi^^ AGGCCGAAGA GCACTTCAAA GAGGTAGCTG AAGCCTACGA CGTATTGAGC 180 

50 GATCCGCAGA AGCGCAGTCA ATATGACCAG TTCGGCCATG CCGGATTGgS CGGAGCTGCC 240 

^I?™- rS?S^^ CGGTATGTCC ATGGAGGATA TTT^^G^CG ???^GAT 300 

Stg^^cSa ^fn^^f TTTCGGCGGA TTCTCCGATA TGGGCGGTGG CAGTCGCAGA 360 

WGctS^G A^ff^fl^^ CCTGCGAGTA CGAGTGAAGC TTTCTTTGGC CGATATAAGT 4 20 

AiaAAGAAAGT GAAGGTAAAA AAGCAGGTAG TGTGCAGCAA ATGTCGTGGC 48 0 

^n^^^S? ^^^^^^ TGGCAAGACT ACCTGCCAGA CCTGCCATGG AACCGGCGTG 540 

?^^cSI« I^rS^rnS GCCATGCAGA CCCAGAGCAC TTG?^Sc^ ioo 

TGCCACGGAG AGGGTGAGAT CATCACGAAG CCATGCI'CCA AGTGTAAGGG CGAAGGTGTG €60 

^g™gA ^^^Ir^l ^^^^^^^ ATCCCTGCCG GTGTaS^C^ AG^^ 7^0 

BO ^^J^^tl^^ ACGGCAAGGG AAATGCCGCG CCCCGAGGAG GCGTGAATGG CGACTTGATA 7 80 

S^CTGCTTA TA?c™o ^Si^^^^^^ CTGATCCGCA ATGGCAACGA TCTGaJa^IS 840 

S^C^aSI? IrllrlZ^^ 2TI^^'^^ AAAGGAGGTA GTGTGGAAGT GCCGACGATA 900 

CCAAGATCCG CATCGAGGCG GGGACACAAC CCGGCAAGAT GCTGCGTTTG 960 

S;^^^^ CGTAAACGGC TATCCCATGG GAGACCAACT GGTG^GT- 10^0 

B5 ^ItT^^I^ tccccgaatc gatcgatgcc aaagatgagc aggctatcgc agcgatSS^ loSo 

^S5gct^ gcttcaaacc taccgatgct gctcgtaagg atatagaov\ gaaata?^ i?4c 
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(ii) MOLECULE TYPE; DHA (genomic) 

(ill) HYPOTHETICAL; NO 

UvJ AIITI-SEHSE: HO 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: fORYPHVROMONAS GIWGIVALIS 

( ix ) FEATURE : 

(A) MAME/KEY: nxisc_ireature 
(B» LOCATION 1 . . . 92'? 

15 Ixi) SEOUEHCE DESCRIPTIOII: SEQ ID NO:29 

CGATTCGATT CAACAACTAA TGTCTCACAA ATTAATTTAA GAACAGAGAT GAAAAAACTG 60 

ATTTTAGCGA CTTTGGGACT TATGGCCATT GCCATGCTCT CATGTTCAAG CAACAACAAG 120 

GATTTGGAGA ACAAAGGGGA GGCTACTCTT TTGGTAACGT TTGGTAGCTC CTATAAAGCT 18 0 

20 CCACGCGAAA CCTATGCGAA GATTGAGAAG ACTTTTGCCG CAGCTTATCC CGATCAAAGG 24 0 

ATAAGCTGGA CATACACGTC ITCTATTATC CGAAAGAAAC TGGCTCAGCA GGGTATTTAT 300 

ATCGATGCTC CGGATGAGGC TTTGGAGAAA TTGGCTCGTC TGGGTTATAA GAAGATCAAT 360 

GTACAGAGTC TTCATGTGAT TCCCGGCCGA GAATATGATG AGATGATCGA CTTTGTC7VAT 420 

AAGTTTAAGG CAGCACATAG TGATATTACT GTGAAGGTAG GGGCTCCGCT TTTCGATACC 4BQ 

25 GATGAAGATA TGCGCGAGGT GGCAQAGATC TTGCACAAGC GTTTTCAGCA AACGATAGAG 54 0 

AAAGGTGAAG CTATTGTATT CATGGGACAC GGCACCGAGC ATGCTGCCAA TGACAGGTAT 600 

GCCCGTATCA ATAAGATCAT GAAGAACTAT AGCAAGTTCA TGATCGTCGG AACCGTCGAG 660 

TCCGATCCCT CTATCAATGA TGTTATTGCC GAACTGAAAG AAACCGGTGC CACGGCCGTA 7 20 

ACAATGATGC CGCTGATGAG TGTGGCAGGC GACCATGCTA CCAAT6ATAT GGCCGGAGAT 780 

30 GAGGACGATA GCTGGAAGAC GTTGCTGACC AATGCCGGCT ACACAGTTTC TATAGACAAG 84 0 

CTGGACAATG GCAATTTCTC AGCTCTTGGA (SATATAGAAG AGATCCGGAA TATCTGGCTC 900 

AAGCATATGA AAGCCACCTC TGCTCGC 927 

35 (2) INFORMATION FOR SEQ ID NO: 30 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1473 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(DJ TOPOLOGY: circular 

(ii) MOLF.CULE TYPE: WPi (genomic) 

45 (iti) HYPOTHETICAL: HO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 
50 (A) ORGANISM: PORYPHYROMOMAS GINGIVALIS 

(ix) FEATURE: 

(Al NAME/KEY: talsc feature 
(B) LOCATION i.-.l^TS 



55 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 



CGGAGAGATG CGCGGAGGCA ACTGGTACGC CGTAGGAGGA AAGAGCTATC TGGCACAGCA 60 

AATACGCGAT GCCRGACGAG AGTA'ITTCCT GAAAGACGAT CAGCGATCCG GTGGTGrTTC 120 

60 CCTCGACTTC GAGACGGTCT ACAGCCGATC CGATTCGGCT CGCTACTGGC GAATCCTCAA 180 

CAGCTATCCC GGGGAGTTCG GTTACGAAGC TTTGAAAGCC GAAGACAGCC GATATGCCGA 240 

TTTCAGGGCA TTCAAGGAAA AAGGCGTGAT ATACTGCAAC CTGAGGGAAA CAGCCTTCTA 300 

CGAACTCATG CCCATGCATC CCGATTGGGT GCTGGCCGAT CTTATCGCTA TeiTGCACCC 360 

CGGACTACTT CCCGACCACC AACC«3CAU"rT CTATTATTTG CTCCAATGAC ATCCGTCAGC 420 

65 CACTTACGTA CAATTTCTGT CGCAGGTATC CTGGCTGCGC TGGGAGGGGC TCTACTCATT 400 

CTCTTCGGGG TTAATCTCTT CCTCGGCTCG GTGGCTATTC CGATGAGCGA GATCTTCCGA 54 0 

CATCTTTTTT CAGATCGTCC CGAAGGAGGA GAAGCACTCG TGCACTACAA TATCCTATGG 600 

AAATCCCGCC TGCCCGAAGC CCTCACGGCT GCTTTTGCCG GCGCAGGTTT ATCCGTTAGT 660 

GGerCGCAGA TGCAGACCGT CTTTCGCAAT CCTTTGGCCG GTQCGTCCGT TCTCGGCATC 720 

70 AGCTCCGGTG CCAGTTTGGG TGTTGCTTTG GTCQTTCTGC TGAGCGGCTC GCTGGGAGGA 700 

GTGGCATTGA GTAGCCTGGG TTATATGGGC GAGGTGCCCA TGAATATAGC CGCTGCCGTA 84 0 

GGCTCGCTGG CAGTAATGGG GCTGATCGTT TTTGTCAGCA CCAAGGTGCG CAGCCACGTT 900 

ACGCTGCTCA TTATCGGCGT TATGATCGGA TATGTAGCCA CTGCCGTCAT CGGGGTATTC 96 C 

AAGTTTTTCA GTATCGAAGA AGATATTCGG GCATACGTAA TTTGGGGGTT GGGCAGCTTT 1020 

75 TCCCGTGCCA CCGATTCGCA ACTGAGTTTC TTTGCCATTC TGATGTTGAT CTTTATTCCG 1080 
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GCCGGTATGC TCCTTGTCAA GCAGTTGAAT CTCTTATTGC TGGGAGAAAG CTACGCACGT 1140 

AATCTGGGAC TGAATACTCC TCGGGCACGG CTGCTCGTGA TCTCTTCCGC CGGTTTGCrC 1200 

ATCCCTACCG TCACGGCCTA TTGCGGTCCC ATCGGCTTTT TGGGGATGGC TGTGCCACAC 1260 

TTGCCACG-3G TTATCTTTCA CACATCGGAT CATCGGATCC TGATGCCTGC TACCTGTTTG 1320 

ATTGGAAGTG CTCl'GGCTCT TTTCTGCAAT ATCATTGCTC GTATGCCGGG GTTTGAGGGG 1380 

GCTTTGCCCG TCAATTCC6T AACGGCTTTG GTGGGAGCAC CTATTATCGT CACCGTTTTG 14 40 

TTCCGGCGCA GACGCTTCAA GGAAGAAACC GAC 14T3 



(2) IKFOFy-fATION FOR SEQ ID NO: 31 

(il SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 2289 base pairs 

(B) TYPE: nucleic acid 

(C) STRAWDEDNESS: double 

(D) TOPOLOGY: circular 

(11) T-tOLECUliE TYPE: Di^TA (genomic) 

HYPOTHETICAL: NO 

(iv) AUTI-SEt4St: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: P0RYPHYRO11ONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/ KEY I misf^_f eature 

(B) LOCATION 1. . .2289 

SEQUENCE DESCRIPTION: SEQ ID HO: 31 

CATTTTTTAG TAACGATTAT GCGGACAAAA ACTATCTTTT TTGCGATTAT CTCTTTTATT 60 

GCTCTATTCT CGTCTTCTCT GTCGGCTCAG AGCTVAAGCCG TTTTAACCGG TAGTGTGTCG 120 

GATGCCGAAA CCGGAGAGCC TCTTGCCGGT GCTCGAATCG AAGTCAAACA CACCAACATA 180 

GTAGCCGGTG CCGATGCCGG CGGACATTTC GAGATCAAGA ACCTGCCGGC AGGGCAGCAT 24 0 

ACTATTATAT GTTCGTTGGG GGGGTATGGA CAGAAAGAGG AGGTGGTTGC CATCGAAGCC 300 

GGACAGACCA AAACGATCTC TTTTGCATTG CGACTGCGAA CGAACAACTT GGAGGAAGTC 360 

GTCGTTACCG GTACCGGTAC ACGTTACCGC TTGGTCGATG CTCCTGTGGC AACGGAAGTC 420 

CTTACCGCTA AGOACATAGC CTCrTTCTCG GCTCCTACTT CCGAGGCCTT ATTGCAGGGG 480 

CTGAGTCCGT CTTTTGACTT CGGCCCCAAT CTGATGGGCT CTTTCATGCA GCTGAACGGC 54 0 

CTTAGCAGTA AGTATATCCT CATCCTTATC GATGGTAAGC GTGTGTACGG CGATGTAGGC 600 

GGTCAGGCCG ATTTGAGTCG TATTTCTCCT GATCAGATCG AACGGATCGA ACTGGTGAAA 660 

GGTGCTTCGA GTTCGCTCTA CGGATCCGAT GCCATCGCCG GGGTAATCAA TGTGATCACA 720 

AAAAAGAATA CGAATCGACT GAGTGCATAT ACGTCACATC GCATATCGAA GTACAACGAT 780 

CGGCAAACCA ATACTTCGCT CGATATAAAC ATCGGTAAGT TCAGTAGCAA TACCAACTAT 84 0 

TTCTTCTACO ATACGGATGG CTGGCAGAAT AGTCCGTTCG AAATAAAAAA GAAAAAAGGA 900 

TCCGGCGAAC CGGTCTTGGA GGAAACGTAT AAGAAAACTT TTCGTGCACA GGAAAATCAG 960 

GGTGTAAGCC AATCGCTTTC CTATTATGCA ACTAACAATC TTAGCTTCAG CGGAAATGTG 1020 

CAGTACAATA AACGTCAGAT CTTCACTCCG ACTTTTTCCG AAAAGAAGGC CTATGACATG 1080 

GATTATCCTG CTITGACGGC TTCACTCGGT ACGAACTATC TTTTCCCCAA TGGTCTGCAT 114 0 

ACGCTTTCTT TCGATGCCGT CTACGATCGC TTCCGTTTCG GATATTTGTA TCATGACAAG 1200 

GACAGCAGTG AGAGCCTGAT CAACAACCAA GGTCAGACCG AGCAACCCAC ATTCTTTCCG 1260 

GGTCAGCTAC GCAATAAAAA CGATCAGATC CGATACACGG CAGAGGCTCG CGGTGTATTT 1320 

ACACTGCCTT ATGCUCAGAA ACTGACCGGC GGTTTGGAGT A'lTTCCGTGA GGAATTGATC 1380 

TCTCCCTATA ATTTGATTAC CGACAAGGCA GATGCTTCCA CGCTCTCTGC TTATGTACAA 1440 

GATGAATGGA AACCGCTCGA TTGQTTCAAT ATGACAGCCG GTTTCCGTCT GGTACACCAT 1500 

CAGGAGTTCG GTACACGAAT GACGCCTAAG GTATCCATAC TCGCCAAGTA TGGGCCGCTG 1560 

AACTTCCGCG CTACGTATGC TAACGGCTAT AAGACTCCCA CGCTGAAAGA GCTTTTTGCA 1620 

CGGAACGAAC TCJXCCACTAT GGGTTCGCAC AATCTCTATC TCGGCAATGC GGATCTTAAG 1680 

CCACAGATGT CGGATTATTA TGCTTTGGGC TTGGAGTACA ATCAAGGCCC TATCTCGTTC 17 40 

AGTGCAACGG TTTATGACAA TGAACTTCGC AATCTGATCT CCTTTATGGA TATACCGACC 1800 

TCACCCGAGC ACGAAGCTCA GGGAATCAAG AAAACCAAGC AGTATGCCAA CATAGGAAAA 1860 

GCTCGCAGCC GCGCX:CTTGA TGTCCTATGT GATGCCTCTA TCGCl'TGGGG TATCAAGTTA 1920 

GGAGCCGGAT ACAGCCTCGT GGAAGCl'AAG AATCTCCAGA CGGATGAGTG GCrCGAAGGA 1980 

GCTGCACGTC ATCGTGCCAA TGTGCACGCC GATTGGGTTC ACTACTGGGG TCAGTATAGA 204 0 

CTTGGCGTGA GCCITTTCGG CCGTATTCAG AGCGAGCGTT ACTACAAAGA CGGCAATGCT 2100 

CCGGACTATA CCTTGTGGCG ACTCGCCACA TCGCATCGTT TCGCTCATTT CCGCCACATC 2160 

ATCCTGGATG GAACGCTCGG TATAGACAAC CTGTTTGACT ACGTGGATGA TCGTCCTATG 2220 

GGTGTCAATT ATGCTACCGT AACGCCGGGA CGTACTTTCT TTGCTCAAAT AGCGATTCGA 2280 

TTCAACAAC 228 9 
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(i) SEQUEMCE CHARACTERISTICS- 

(A) LENGTH; 1095 base pairs 
(BJ TYPE: nucleic acid 
(C) STRA^^DEDMESS: double 
iD) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DT-IA {genomic) 

(lii) HYPOTHETICAL: NO 

{iv) Al IT I -SENSE: NO 

(vi» ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYRO(iONAS GIWGIVALIS 
( Ix ) FEATURt : 

(A) NA^^E/KEY: mi^c feature 

(B) LOCATION 1...1095 

(Xt) SEOUENCB DESCRIPTION: SEQ ID NO:32 



TGCTGCGCAA 

TTTCCTTATT 

CGTAATATCG 

GTTGGTTTTT 

6AATACCGAG 

CAGCAGGTGC 

ATGGAGCAGG 

GGAGCACGCA 

GTAGCCATAG 

CCGGCTCAGA 

GCGGCCGAAG 

GCTTCTGCCC 

GAAACCTACC 

GAACTCGTAG 

AGCTTTGCCG 

GTGGTCCCGG 

GGTACCTATG 

GAGGTAAAGG 

GTGATCA'l'AC 



GGCTTCGGGA 
TCGTTATTCA 
TATTCCCGGC 
TCATGCTCAG 
TGTCCAGCAA 
AGGCCGGCGA 
CAAAGGCT6C 
GCGAACAGAT 
CGACCAAGAC 
AGTTGGACGA 
CCCAGTACAA 
TCGTCGATAG 
TCATCGCCCC 
GTACCGGCGC 
TTCGTGAGGA 
CTCl'GAATGA 
CTCCCTGGAA 
CCACCCTTGC 
GCAAG 



CACCTCTCCC 

TCTGATCAAA 

GT'rrCTCCTC 

ACCGGCCGAG 

AGTGCCCGGG 

TACCCTCGCT 

CGAAGCAGCT 

ACAGGCAGCC 

ACACCAGCGC 

AGCCACTGCC 

TATGGCTCGC 

AGCGAGAGGA 

ACGGGCAGGC 

ACCTATCATG 

TTTCCTCACC 

AGAAAAAGTA 

AGCGACCAAG 

GGATAAAGAC 



CGGAAGAAGT 
CATATTATCA 
TTGCTGGGAG 
GAGATTATCC 
CGCAl'CAAGG 
GTCATCGAAG 
GCACAGGCTC 
TATGAGATGT 
GTGCAGAACC 
CAGCGCGATG 
AACGGTCCCG 
GCCGTCGCCG 
GAAGTGTCGG 
AATATCGCCG 
AGCATGACCA 
CGCTTCAAGA 
ACAACAGGGC 
AAGGCACAAA 



CGTTTCGGAA 

TTATGACGGA 

TCATCGCAGT 

AAGGACAGAT 

AACTTAGGGT 

CCCCCGACGT 

AGAACGCCAA 

GGCAGAAAGC 

TCTATGACCA 

CGGCCATCGC 

AACGCGAAGA 

AGGTGGAGTC 

AGATATTCCC 

AGATGGGCGA 

TGGGAGCCGT 

TCACATTCAT 

AGTACGACCT 

AGCTACGCCC 



TGATACCGTT 

CAACAAACAA 

GGTGACGATC 

AGAAGTGACC 

ATCCGAGGGA 

AGCGGCTAAG 

GGCTCTCAAA 

TCAGGCCGGC 

GGGAGTGGTA 

TACGCAAAAA 

CAAGCTGGCA 

GTACATCAAC 

CAAAGCCGGC 

TATGTGGGCC 

TCTGGAGACT 

CAAGAACATG 

GAAGACCTTC 

GGGTATGTCC 



(2) INFORTIATION FOR SEQ ID HO: 33 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 base pairs 

(B) TYPE: nucleic acid 

(C) STRAilDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) HOLECUE.F. TYPE: OtiA {genomic* 
(111) HYPOTHETICAL: HO 
(iv) ANTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMa^AS GINCIVALIS 

(ix) FEATURE: 

(A( NAME/KEY: misc_feature 

(B) LOCATION 1,..960 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33 

CCGCAATCCT Cl'CCTGATCG AAGAAGCTTC CAAAACGT'-A TGAATAAATA 
^Srl^nl^^ AGGTCGGCAA AATTGGGATT GTGaVtA^^? ^^^^J? 
GTACATCAAC AACCCCCTTT TTTAAGCCAT A;^ATCAATTA TGcStattct 

J^^r'*^"^ ctttttcggt tttgcttttt gcatcwgcc ct5S^g 
fHIiS?!?? aagatatcca aacttttaat cgggagatta tcgS^c 
aaaattgaga aggacgatgt gctgaacatc cttgtcacca gtXgagaccc 

^n^^nS^n"^ ACCAAGTGIT GACCACTCGT GCACTGGC^ G-^C^^A 
It^TG?^G CGATTCGAAA GGGTACATCA AT?^?S 

'^GGCCCTTAC TCGTACCGAA CTGGAGAACG AGATACAGAA 
TCCAGTGGAT TTATCAAGGA TCCTACGGTA ACGGTGC/vGC TTCAAAATTT 



TCATTCTCAA 
AGTACGGAAT 
CAGTAA'ITTT 
AGAAAAGGTC 
ATATGACGTA 
GGAGCTTTCA 
TGGAACGAAC 
TTTAGGCCAG 
GAGGATTATT 
CAAGGTGTCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1095 



60 
120 
180 
240 
3O0 
360 
420 
480 
540 
600 
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GTTTTGGGAG AGGTGAATCA TCCGGGTTCG ATGTCGGTAA AAGGAGAGCR AATAACTrTT 
TTCiGAAGCGA TCGGAATGGC CGGAGACCTG ACAATCTATG GT^C^CGA ^^^^^^^^ 
"c??cS^^ ^CC^^'Jtt GTTTTCCAGA CGGAT^t'Sg I^IcC^J 

gccccgigi-a ctatctgcat cagaacgacg tcatctatgt ggagccgaac 

OT^^ S™?? clr^l'=T'' CAGAATAATA ACGTAA^CG? A^^^^^^^ 
GTTACCTCCA CTTTV:K.TATC CATTTCCACG CTGACGATTA CGATAATAGA TAAGACCAAA 

(2) IMF0RILATTC5N FOR SEQ ID NO: 34 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 174 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRAfJDEDNESS: double 

(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DNA (genomic > 

(iii) HYPOTHETICAL: NO 

(iv) AMTT- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAIJISH: PORYPHYROMOMAS GIH3IVALIS 

(ix) FEATURE: 

(A) NAME/KEY; mlsc featur- 
fB} LOCATION 1, . .174 6 

(xi) SEQUENCE DESCRIPTIOI4: SEQ ID HO: 34 

IS??^^^rS l^^nn^^ GTTTTGTACC TTAGCGCGCA. TGAAAAAGAC CAATCTGTTT 
H^I^f JS: T=^T*^TCTT TATCACCGGT AGTTTTATGA CTGCCTGTGC ACAGAAGTCC 
^G^T?^A TA^??^^^ AGCCGCAATG AGTA^^^ ^CG^^AT 

SaT^TGACTC ^l^ll^J^ T^"^"^"^ CTGTATTTCG TCGATACCAT AAGTATCAAA 
^Z~t^ GGCGTGGTAT AGATGCGATG TTGGGCGGGC ttgaccccta taccgaatac 
Prl^^I^^S? aggaaatgga tgaactgaaa ttgatgacta SggagaSa ?2c^SJg?c 
a?gS^ag acgS^S^ a??°^?^^^^ gctgtgatta tccagagac^ ?a5^gg? 

«ltit,CCeCAG ACGAAGCAGG ATTGATAGCA GGCGACCGCA TCCTGACTAT rrtuTr-ra-anii 
SACTTCCGCA AATCCACCAC ACCGAAAGTA AGCCaSwC T^^^ AG^r^J^^ 
GTTGCAAAGG TGACAGTAAT GCGCTATGGC GAAAcSSc CTcS^^??? T^S^^ 
TGATTATGAA TTCCGTCACT TACAgSS^ ^^CTCGaJ^ SSSa^ 

ga?ct5^^ Ic^S?^ tacggacaaa agtgcagaag aggtgSJ^ §^§^gwg 

^A^^?^ J^^fS =T«TTTT«3 ATTTAAGAGG CAATGGTGGC 

OGACTOATlaC AGGCTGCTAT CGAGATAGTC AATCTGITCG TCCCTAAGGG CAAaRnrrTf 

^SSflS?''' cattacttta ttatatggcc atcaataatg acgt5ttc^ ^g^^ 

nf^^l^V'f' "^^GCATAA AACGATTGCC AAGCCGC^GG ATrTTTCC^ Jllc^^i^ 
GACTATGCAG CTTTCTGCAA GATGATGGAA GAAAAGAAAT TTGaSaTGA 1AAC^ 
GGCAAGATGC TTGACAAACT GGAGGAACTG GCTAAGATAG AAGGCTACCT Ccr^J^r^ Ar.?. 

^^r^c^ l^lT^l ^^Sr^" ctaaaacSa ScgS ?ga???^c'5^ f eo 

w^ATTCAAAA AGGAGATAAC AAACTATCTC AACAATGAGA TTGTCACTCG CTATTATTat 

Sg^^GG aJca?ccgS tc^JTr^r^rl ^^^^-^^^ ^A^SSoA Z^C^l^J^l Hit 
AAAGGG ^'^''^^^^^^ ACAAATTCGA CAGATCCTTG CAGCTCCGAA AGCAGAGAAT 1740 

1746 

(2) IMFORIIATION FOR SEQ ID NO: 35 

(i) SEQLrENCE CHARACTERISTICS: 

*A) LENGTH: 2 9b5 base pairp 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

111) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



720 
780 
840 
900 
960 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



1260 
1320 
1380 
1440 
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(iv) AIJTI-SEMSE: NO 



tvi) OKXGINAL SOURCE: 

(Pi) ORGAhriSH: PORYPH-l-ROt^OHAS GINGIVAL IS 

FEATURE: 

(A) NAtlE/ICEY: misc_feature 

(B) LOCATION 1 . . .2955 

(xii SEQUEIICE DF.SCRIPTIOH: SEQ ID MO: 35 



GTAACAGACA 
ATCTGTGCGT 
GCGATGGGCG 
TTCGGCTACA 
GGGGGTATGA 
AAAAGTTTGG 
TCCGACTTCA 
GCCGTTATCT 
GACGTAGTGC 
CTCCGTGCTC 
CAAGGGCGTA 
TTGCAACGCA 
GACTTGATCG 
CCGGAAACAG 
CAAGCTGTAG 
GACGCACTCT 
CGTCGTGCTA 
ACACGTGAAG 
AAGGAGACCG 
TTGGGAGGTG 
GAACCGATCG 
AAGGATAACG 
AACGTGAATG 
GAGGCCGGTG 
GAACAGGAAA 
TCGTTCCTGC 
CCtiGGTCTTA 
GCTTCTTl'CC 
ATGGCTGTGG 
AAGACTCCGA 
TCGAACGTTA 
CGCGG'ITTTG 
TTGACTCGTA 
ACTACGAGCA 
AAGACCGGCT 
ATCGGTCTCA 
CAGCCTGTAT 
TTGGTTACCT 
CAGGAGGAAA 
GGTTTCTACA 
AAAGTAACrC 
TCGATGATCT 
GGGGTATTCG 
TGGAAGATTC 
ATCATCGGTT 
AAATTGTACC 
GGTCGAACAT 
GGAGGTGCTA 
TACTCTACGC 
AAAGCAGCTA 



AAATGCAAAA 
TTTACCTGTC 
ATGTTGCCGG 
CGCTGAAAGA 
ACGTTATCTT 
ATCCCAACTT 
TCGATATTTT 
TCGGTTCGGG 
GTCTGCTCAA 
GTATCGATGC 
TCCTTGTCGA 
GTGCCAACCT 
CTGCCAATGA 
AGCCTGCAAC 
CTGATTCTGC 
TCTCTCTGCT 
ATATGGCICA 
ATGTGCTTTT 
ACCTCTACGA 
ATGTAGTGAC 
TTTCGATGAC 
TGGGACGGGC 
ATGAGATCAC 
ACCTTGCCAA 
ACGTGATTGG 
TCGCTTTGGT 
TCGCAAACGG 
ATGCCGTGCT 
ATGCCAACGT 
TTCGTGCCGT 
ORACTATTAT 
CCACTACGTT 
TCGTCTTCGA 
TTAd'CGCAA 
TTATCATTCC 
ATAGGGGTAT 
CTTCCGAAGC 
CCATCGGTAC 
GCGAAGAAAC 
CCCAGCAGCC 
CCAGTATGTC 
TCATGGCCAT 
TATCTGTGGC 
TGCCCTTCAC 
ACTCGCTCAA 
CCAACAGAGA 
TAAATACGTC 
CGATGCGTAG 
TCTTTGTTGC 
AGAAA 



CAAAGGATTT 
ATTCTCTTTC 
AATGGCCTAT 
AGCTCAAGCC 
GAAACTTAAC 
CAACAAAGCT 
CGTGAAGGAA 
TGACCTTCGC 
AGAAAAATAT 
TTTCGGTGTG 
ACTCCCCGGA 
ACAGTTCTGG 
TCGTCTGAGC 
TACTGACTCT 
TACIGTAGCA 
TACTCCCGTG 
GATATCTGAA 
CCTCTGGGGT 
ACTCTATGCT 
TTCCGCCAAG 
GATGAATGAA 
AATCGCTATC 
GGGCGGTCGC 
CGTACTCTKAC 
TCCTACGCTG 
TATCCTGATG 
CGCATTGATT 
GACCCTCTCG 
ACTTATCTTC 
TACGGATGGT 
TACCGGTATC 
GATTATCGGT 
GAAACTGGCG 
TCTCCTTGTC 
GGTGATTATC 
TGAA'iTCTCC 
CGTTCGTTCG 
TGAAGGGACA 
TGAAGCAGAG 
TACTGCTGAT 
GAGTGACATC 
TTATATTCTG 
CGCTACTACA 
CATGGAGATC 
TGACACCGTG 
TCGCTATCAG 
'ITTGACTACG 
TTTCACGTTC 
TACACCCCTT 



GTGATTGTTA 
GTTACGAACC 
CTTGATTCCA 
CAGCAAATTG 
GCAAGCGATC 
CTGGAGAATG 
TATCGCAAGC 
GACCAGATTA 
AATAGTGCTG 
GTTGCACCTA 
GTGAAAGACC 
CGTACATACA 
GAATTGGCTA 
GTAGCTGCAA 
CAAAAAGAGG 
AA'rCGTGGCG 
ATGCTCCAGC 
GCTAAAGCAA 
ATTCGTACCA 
AGTGATATCC 
GAAGGTGCTC 
GTTTTGGATG 
TCTCAGATCT 
TCCGGTAAAA 
GGTGCCGAGT 
TGTTACATGT 
GTAAACAGCT 
GGTATCGCAG 
GAGCGTATCA 
TATGGCAACG 
ATCCTATTCC 
CTTATCGCTT 
AAAAAAGGTC 
AATCCCTCAT 
ATCGTTTTGG 
GGAGGACGTA 
GCCTTGTCTT 
GAGGTGCGTA 
ATTACTGACA 
CAGTTCTTGG 
ACGAGAGGTG 
ATTCGCTTCX: 
TTCTGCATTA 
GATCAGAACT 
GTTGTATTTG 
GTGATCAACG 
TTTATCGTTA 
TCGATCCTGC 
GCCTACGAGA 



TCACATCGGC 
GTTACGAAAA 
TGTCGAATGA 
GTCTTGGCCT 
TGCTTCGTAA 
CTGCCAAGAG 
TCGATCCCAA 
CCGCAAAGTC 
TAGAAGCl'TC 
ATTTGCAGCG 
CTGAGCGTGT 
AATTCGAAGA 
TGAACAACAC 
CAGCCGATTC 
CCTWGGATGC 
GTGCAGTAGT 
AAGCTCACGA 
TCGAAGACCC 
ATCGTACX3GG 
AAAATGACTT 
GTAAATGGGC 
GTGTGGTTTA 
CCGGGCACTT 
TGGATGCTAC 
CCATTAAAGC 
GTCTGGCTTA 
TCTTCACATT 
GTTTGGTGCT 
AAGAAGAGCT 
CTTTCTCTGC 
TCTACGGGAC 
CTTTCATTAC 
GTTTGGATAA 
ACAACATCTT 
GACTTATAGC 
ACTACGTAGT 
CTCCCCTGCA 
TATCTACGAA 
AATTGTATCA 
ACAATATCAT 
CTATTTGGGC 
GTGACATTTC 
TTGCTCTGTA 
TCATCGCTGC 
ACCGTATCCX3 
ATGCCCTTAA 
TGTTGGTAAT 
TCGGTATCGT 
TCCAAAAGCG 



TCTGGCCATC 
GAAGGCl'AAG 
GAAGCTCTGG 
TGACTTAAAG 
CCTCTCTAAC 
CACGGAGCAA 
CGGTCGCTTG 
TACRGATGCA 
GTTCAATGTG 
ATTGGAAGGG 
TCGTACXrCTT 
GGTCAGCGGA 
GGATGCTACC 
TGCTGCTGTA 
TACl'CGTAAA 
GGGTGTGGCr 
TCTGAAGGTT 
CGAAACCAAA 
AGATCCl'GAT 
CGGTCGTTCC 
GCGTATCACA 
TTCTGCTCCG 
CACCGTGGAG 
GGTAAGCATC 
CGGATTCTTG 
CGGTTTCTTG 
GGGCGTATTG 
GACGCTGGGT 
TCGTGCCGGT 
CATCTTCGAC 
GGGGCCGATT 

GGcrGTcrrc 

GATTACATTC 
GGGTAAGCGC 
TTCATTTACA 
TAAATTCGAC 
GGAAAAGGTA 
CTATAAGATC 
GAGCCTGAAA 
TAGCTCTCAG 
TGTGCTGTTA 
TTTCTCTGCC 
TGCGTTGCTG 
TATTCTGGCT 
AGAGACGATG 
TTCAACATTG 
CTTCATCTTT 
TATCGGTACA 
CAAGCTCAAC 



60 
120 
180 
240 
300 
360 
420. 
480 
54 0 
600 
660 
720 
730 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2955 



(2) IIIFOWIATIOM rOR SEQ ID NO: 36 

{i) SEQUEMCE CHARACTERISTICS: 

(A) LE>4GTH; 3138 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIIDEDI-^ESS: double 

(D) TOPOLOGY: circulax 

(il) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: HO 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(iv) AJ-JTI -SENSE: NO 

(v1 ) ORIGIHAL SOURCE: 

(A) ORGAJIISM: PORYPHYROMOJ^AS GINGTVALIS 

I XX} FEATURE: 

(A) HAME/KEi: misc feature 

(B) LOCATION 1...3T38 

Ir.L) SEQUENCE DESCRIPTIOM: SEg ID MO: 36 

GACTTCCCTT G'3GTTAGGGT AAAACCGGAG AAGAAAAGAA AACJVACATAA CAGTAATAAT 60 
TTl'AAGTTTA ACGCAAAAGA AAAGTCTATG AAAAGAATGA CGCTATTCTT CCTTTGCTTG 120 
CTGACGAGCA TTGGGTGGGC TATGGCCCAG AATAGAACCG TGAAGGGTAC AGTTATCTCC 180 
TCCGAGGATA ATGAGCCCCT GATCGGCGCG AATGTCGTGG TTGTCGGAAA CACCACTATC 240 
GGTGCTGCAA CCGACTTGGA TGGCAACTTC ACGCTTAGCG TGCCTGCCAA TGCCAAAATG 300 
TTGAGAGTGT CCTATTCCGG TATGACTACC AAAGAGGTCG CCATCGCTAA TGTGATGAAG 360 
ATCGTACTGG ATCCGGACTC TAAGGTTCTG GAGCAGGTAG TTGTATTGGG TTACGGTAOG 4 20 

GGACAGAAAC TCAGCACTGT ITCCGGTTCT 6TGGCCAAAG TGTCCAGCGA AAAGCTCGCG 4 00 

GAAAAGCCCG TTGCCAATAT CATGGATGCC CTCCAAGGTC AGGTAGCCGG TATGCAGGTT b4C 
ATGACTACAT CCGGTGACCC TACTGCCGTC GCTTCTGTGG AGATCCATGG TACAGGGTCG 600 
TTGGGCGCAA GC-fCTGCACC ATTGTATATC GTGGATGGTA TGCAAACTTC TTTGGATGTT 660 
GTGGCTACGA TGAATCCGAA TGATTTTGAA TCTATGTCCG T'lTTGAAAGA TGCTTCTGCA 720 
ACATCTATTT ATGGAGCTCG TGCTGCAAAC GGAGTCGTTT TCA'ITCAAAC GAAGAAAGGT "7 80 

AAAATGAGCG AGAGAGGTCG TATTACCTTT AATGCCAGTT ACGGGATTTC TCAAATCCTG 840 
AATACTAAGC CCCTTGATAA TATGATGACT GGAGATGAAT TGCTGGATTT TCAGGTGAAG 900 
GCAGGTTTTT GGGGCiAACAA TCAAACCGTT CAGAAGGTTA AAGATATGAT CCTTGCCGGA 960 

GCTGAAGATT TGTATGGCAA TTATGATTCT TTGAAAGATG AGTATGGTAA GACATTGTTC X020 

CCAGTGGATT TTAATCATGA TGCAGACTGG CTCAAGGCTT TGTTTAAAAC AGCACCCACC 1080 

AGTCAAGGTG ATATTTCTTT CTCCGGAGGG TC1'CAG<3GAA CTTCATATTA TGCCi'CTATA 1140 

GGCTACTTCG ATCAGGAAGG TATGGCTCGT GAACCGGCAA ATTTTAAGCG CTATAGTGGC 1200 

CGGCTCAACT TCGAAAGTCG TATCAATGAA TGGCTGAAAG TTGGTGCAAA TTTGTCTGGT 12 SO 

GCGATAGCGA ATAGACGATC TGCCGACTAT TTT6GA7\AGT ATTATATGGG GTCAGGTACT 1320 

TTCGGTGTGT TAACGATGCC TCGTTATTAT AACCCTTTTG ATGTGAATGG GGATTTAGCA 1390 

GATGTCTATT ACATGTATGG AGCTACCAGA CCTTCrATGA CAGAACCGTA CTTCGCAAAA 14 40 

ATGAGACCGT TCAGTTCCGA ATCACATCAG GCCAATGTAA ATGGTTTCGC CCAGATTACT 1500 

CCGATCAAAG GCCTTACTTT AAAGGCACAG GCTGGTGTTG AT ATT ACT AA TACTCGCACT 1560 

TCTTCTAAGA GAATGCCCAA TAATCCGTAT GATTCTACTC CTCTTGGGGA AAGAAGAGAA 1620 

AGAGCTTATC GAGATGTTAG CAAGTCTTTT ACAAATACGG CTGAATATAA GTTTTCAATT 1680 

GATGAAAAAC ATGATCTTAC AGCATTGATG GGGCATGAAT ATATTGAATA TGAAGGGGAT 1"?40 

GTTATTGGGG CATCTTCTAA AGGATTTGAA AGTGATAAGT TGATGTTACT GAGCCAGGGTv 1800 

AAAACCGGAA ATAGTTTGTC TTTGCCTGAA CACAGAGTCG CTGAATATGC ca'ATTTGTCT 1860 

TTCTTTAGTC GTTTTAATTA CGGTTTTGAC AAATGGATGT ATATAGATTT CTCTGTTCGT 1920 

AATGACCAAT CCTCTCGATT CGGATCCAAT AATAGAAGCG CGTGGTTCTA TTCT6TCGGT 1980 

GGAATGTTTG ACATATATAA TAAATTCATT CAAGA/y^GTA ATTGGCi'CAG TGATCTTCGA 2040 

CTGAAAATGA GTTATGGTAC AACGGGTAAC TCGGAGATTG GTAATTACAA CCACCAAGCA 2100 

CTCGTTACTG TGAACAATTA TACTGAAGAT GCTATGGGGC TTAGCATTTC TACAGCAGGC 2160 

AATCCCCACC TCTCGTGGGA AAAGCAGTCT CAGTTCAACT TCGGTTTGGC TGCAGGGQCT 2220 

TTCAATAATC GCTTATCl'GC AGAGGTAGAT TTCTATGTCC GCACTACGAA TGATATGTTG 22 80 

ATTGATGTCC CGATGCCTTA TATCAGTGGT TTCTTCTCAC AGTATCAGAA TGTAGGCTCT 2340 

ATGAAAAATA CGGGTGTAGA CCTTTCTCTT AAGGGGACGA TCTACCAAAA TAAGGACTGG 24 00 

AATGTATATG CTTCTGCGAA TTTCAACTAC AATAGACAGG AAATAACAAA GCTTTTCTTC 24 60 

GGTCTCAATA AGTACATGTT GCCTAATACC GGTACTATAT GGGAAATTGG GTACCCCAAT 2520 

TCGl'TCTATA TGGCTGAATA TGCTGGAATC GACAAAAAAA CCGGTAAGCA GTTGTGGTAT 2580 

GTTCCT6GTC AAGTCGATGC GGATGGTAAT AAAGTTACAA CAAGCCAGTA CTCAGCTGAC 2640 

TTGGAGACAC GAATTGATAA GTCTGTTACT CCTCCTA'ITA CAGGTGGTTT CTCCTTAGGT 27 00 

GCTTCTTGGA AAGGACTTTC TTTAGATGCT GATTTTGCCT ACATCGTTGG TAAATGGATG 27 60 

ATCAATAATG ACCGTTACTT TACAGAGAAT GCAGGTGGAT TGATGCAATr AAATAAAGAT 2820 

AAAATGCTAT TGAATGCCTG GACAGAGGAT AATAAAaAAA CAGATGTTCC AAAAITGGGA 2880 

CAGTCTCCTC AGTTTGATAC GCATTTGTTG GAGAATGCTT CTTTCCTGCG TTTGAAGAAT 294 0 

CTCAAACTCA CCTATGTACT CCCCAATAGT CTTTTTGCTG GGCAGAATGT GATTGGTGGA 3000 

GCTCGTGTCT ATTTCATGGC GCGCAATCTG TTAACTGTTA CGAAGTATAA AGGCTTTCiAC 3060 

CCTGAAGCAG GGGGGAATGT GGGAAAAAAT CAA7ATCCTA ATTCTAAGCA GTACGTTGCG 3120 

GCTATTCAGT TGTCTTTC 3138 



12) lUFORMATTOll FOR SEQ ID MO: 37 

ti) SEOUEMCE CHARACTERISTICS: 

(A) LENGTH: 2607 bsse pairs 
IB) TYPE: nucleic acid 
{CI STRANDFDNESS : double 
(D) TOPOLOGY: circular 

{jl> MOLECULE TYPE: DTJA (genomic? 
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(till HY POTHETICAL : MO 
(iv) AUTX -SENSE: NO 
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(vl) ORIGINAL SOURCE: 

(AJ ORGAJ^IISM: PORYPHYROMOHAS GINGIVAJ-tS 

(ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) LOCATION 1...2607 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 37 



TGGCATAiGGA 
CCATACCGTT 
ATAACAAACA 
GTTGCTTTGC 
TTTGACGAAC 
GAGCI'GGCTC 
TATGCCAGCG 
ACTCCTCCAT 
GAATGGAATG 
AAGTATCCAT 
ATGATTIGCA 
TCTACGATGC 
CTGCTGTCGA 
TGGATCTGCC 
GGAGCTACrC 
GCCGGTGATC 
GAAGGAAACA 
TATCCATGGG 
CGCAACTACG 
TCGG6AATCA 
AATTATTTCC 
GCCTCTCAAC 
GAATCCTTAT 
TGCCAGATCT 
ACAGGCTACT 
GGTTATCCTG 
ATTGCATTGG 
ATCTCAGTAG 
GAGTGGACGT 
TATAGTCATC 
ACGATTACGA 
GGTACCCCGA 
GCCACCAACT 
CCGACTGTGC 
TCCGGTGAAA 
GCTCCGCTAG 
TTCAAAGACT 
CCTGCCACCA 
GtCCAGCTGA 
GAGGTTATCC 
CCGCAAAACG 
CTCTATGACA 
GTAGATCTTT 
GCTCGCACGG 



ATATTTTTAT 
ATGCAAAAAC 
TAATTATCCT 
TGACTTTCAT 
GGGGCGAGGC 
TGATCATGTC 
AAGAGGAGTT 
CCATGCAGCG 
CTTACCCCAC 
CACTTTGTAC 
AGCTGACGTC 
ACGGAGACGA 
ACTACGAATC 
CTTTGACCAA 
GCTACAATGC 
ACCCCGATGG 
CCTCTTTCGT 
ATAATAAAAA 
CAGCCGCTTG 
TCAACGGTTC 
ATCGTCTGCG 
TTCCAAAGTA 
ACGGCATCCA 
TGATAGAAAA 
ACGTACGTCC 
AGGCAACTCG 
GCAACTCGCrr 
GCGAAAGCGT 
TCGAAGGCGG 
CCGGTCAGTA 
AAGAAAAATT 
CGGAAATAGA 
ACGTATGGAT 
TTTACAGCAA 
CGGTGAAGAC 
CCGACTTCGA 
TGTCTACGAA 
GCACGGAGCA 
CTGCCACCAA 
TCGATGACAG 
GAACGAAGCA 
TCAATGGACG 
CCATCCTGCC 
AAAAGATCCA 



CTTTGCGAGT 
CGAGCACCl'T 
ATTTATCAAG 
CGGCAGCATG 
CTACTTCrCA 
CATCGACGAG 
CGAGGCATTC 
CGCTGTCGAG 
CTATGAAGCC 
TACTTCCGTC 
CTCTGCCAAT 
AACGACCGGA 
CGATCCGAGG 
TCCGGACGGA 
CAACAATGTC 
AAAACCTTGG 
GCTCGGTGCC 
AGAAAGACAT 
TCAGAGTATT 
AGACTGGTAT 
AGAAATTACC 
TTGGAATCTG 
TGGTACAGTG 
CCATGACAAG 
TATCAAAGCC 
TACCATTACG 
TCCTCTGCCT 
CCAATTCCAA 
ACAGCCTGCC 
CGACGTTACG 
CATCACTGTC 
AGAGGGCCAG 
ATTCGATGGC 
AGCCGGCCAA 
GAAAGAAAAA 
AGGAACACCT 
CAATCCGACT 
AAACCCGGTG 
CGAGGGCGGA 
TGTCGAGGAC 
GATCCTCATA 
GGTCGTACTC 
CGAAGGAATC 
TATCGGG 



ACATTTAGCC 
TTTCTCGCAA 
AAAAAGAATT 
CAGGCACAAC 
TTCAAAGTGC 
TTTGACCCCG 
CTGCGCTATG 
ATGTTCGACT 
TATATCAGrA 
ATTGGCAAGT 
ACAGGOAAAA 
TATGTCGTAC 
ATTAAGAACA 
GCATACAGAG 
GATTTGAACC 
CAGCCGGAGG 
AATATACATG 
GCAGACGATG 
TCCGCCAGCT 
GTAATTCGCG 
CTTGAAATCA 
AACAAAGAAT 
ACTTCCGCTG 
CGCAACTCCG 
GGCACTTATA 
ATC/WVGGACA 
GTACCCGATT 
GATCAAACGA 
ATGAGTACAG 
CTCAAAGTGT 
AATGCCGTTA 
ACGGTATCTT 
GGCACTCCCG 
TACGATGTCA 
TACATCACTG 
CGAAAAGTAA 
TCATGGCTTT 
GTCACCTACA 
AGCAATGTGA 
ATAGTGGCAC 
GAAGCCAACG 
AAAAGTACTC 
TACACCATCA 



CGAAAAATAT 
AAGGATACTG 
TTTTGdTCT 
AGGCCAAAGA 
CTGATAGGGC 
TAACCAATGA 
GGCTCAAGCC 
ACCGCl'CAGG 
TGATGGAAGA 
CCGTAAAGGA 
AGCCTCGCGT 
TGCTCCGACT 
TTCTGGATAA 
CCGGAAACCA 
GTAACTTCAA 
CAACTGCATT 
GAGGAACAGA 
AGTGGTACAA 
ACATGACCTC 
GAAGTCGTCA 
GCAACACGAA 
CTCTGCTTGC 
CGAACGGACA 
ATGTTTACTC 
CGGTGAAATA 
AAGAAACCGT 
TCACAGCTTC 
CAAATAACCC 
AGCAGAATCC 
GGAATGCAAG 
T6CCTGTAGC 
TCCAAAACCA 
CTACCAGTGA 
CGCTCAACGC 
TCAAGAAAGC 
AGAAAGGCGA 
GGGTGTTCGA 
ATGAAACAGG 
AGAAAGCAGA 
AGACGGGTAT 
CTGCTATCAA 
CGAATCAGCT 
ATATCAAAAC 



GCTCCCACTG 
CAAGAATCCA 
TGGCATTTTC 
TTATTTCAAC 
CGTTCTACAA 
AGCCATTGCC 
TACATTCTTG 
AGAAAAATAC 
GTTCCAAACA 
TCGTAAACTG 
GCTCTATACT 
CAIAGACCAT 
AACGGAAGTA 
CACCCTACAA 
GGATGATGTA 
CATGGATTTG 
GGTGGTGAAC 
ACTGATCAGT 
CGAAACCAAT 
GGACAATGCA 
GTTGGTGCCG 
TCTGATCGAA 
GCCTCTCAAA 
CGATGCTACC 
CAAAGCCGAG 
CATCATGGAC 
TCCTATGACC 
GACGAATTGG 
GCTCGTATCC 
TGGTTCCAAC 
TGAATTCGTC 
ATCCACCAAT 
AGACGAAAAC 
GATCAtSTGCT 
TCCGGTCCCT 
GACAGTTACT 
AGGCGGCTCT 
CAAGTACGAT 
AGAC7ACATT 
CGTCATTCGT 
AGCGATCGTT 
CCGCTCGACC 
GGAAAAATCC 



(2) INFORl-IATION FOR SEQ ID MO: 38 

(i) SEQUENCE CHARACTER T5T ICS: 

(A) LENGTH: 318 base pairs 

IB) TYPE: nucleic acid 

IC) STRAi IDE Dl JESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (yenomlc) 

(lil( HYPOTHETICAL; NO 

(iv) ANTI -SENSE: NO 

(vl ) ORIGINAL SOURCE: 



60 
120 
IftO 
2-10 
300 
360 
420 
480 
SAO 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 

leoo 

1G60 
1920 
1980 
204 0 
2100 
2160 
2220 
22B0 
2340 
2400 
2460 
2520 
2580 
2607 
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fA) ORGANISM: PORYPHYROHONAS GIMGIVALIS 

I Ix ) fKlATURE : 

(A) NAJ-IE/KEY: mi£c_f ea ture 

(B) LOCATION 1. . , 318 

(xi) SEgUtNCE DESCRIPTION: SEQ TD NO: 38 



CTTTCGTACA GTGGAGAGAG CGATGCAAAA GAGTCTGATC 
TTCATCGGCT TTGAAAAACG AGTAAATACG ATGCGACTGA 
CTCTTACTGC TCCCCATTTT CTTCTACAAG CGGTTTATAT 
TGCCGGTTTA CCCCCTCATG TTCGTCCTAT GCCATCGAAG 
GGCAAAGGAC TATTGCTGAG CATCAAGCGT ATTCTCCGCT 
CJGCTATGACC CCGTTCCG 



AGAATTGCCG GAAATCTACC 
TCAAGGCTTT TCTCGTGCAA 
CGCCGCTTAC ACCGCCTTCA 
CCTTACGTAA ATATGOCCCG 
GTCACCCGTG GGGTGGAAGT 



60 
120 
180 
210 
300 
318 
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(2) INFORMATION FOR SEQ ID NO: 3 9 

Ul SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: double 
fO) TOPOLOGY: circular 

(ii> HOLECULPl TYPE: DHA Igenoinic) 

tiii( HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE! 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

(ixJ FEATURE: 

(A) HAME/KEY: inlsc_f eature 

(B) LOCATIOr^ I. . .2583 

(Jvi) SEQUENCE DESCRIFVIOM: SEQ ID NO: 39 



TTAGAGATGG 
GAAGCAACCC 
CGCGACGGAG 
CTCAAACGGT 
TCGCCCATCT 
GACATGGAGG 
CAGGAGAGCT 
TCGGATTACT 
CTCGACGGGT 
GGGAATAGCG 
GATACTACCA 
GACATCACTG 
ATCGAAAGGG 
GAACCCGGTG 
AAGGTGAGCC 
GCCGGCACCA 
AAGAAGAATC 
TCTGCAGCCG 
GTACAGTGCA 
GCACTCGAAC 
ACCATCCTGC 
GAAGCGATCA 
GATAAGGCGA 
GCTCCGAAAG 
TCGGCCGTAA 
ACTCAGCAGC 
GAGACGGTGG 
GAGCGGCTGA 
AAAGTAGTAG 
CTGGGACTTC 
GTAGGCAAGA 
ATGATCAGOG 
GCCCCTCCGG 
AAACCCTATT 
CTGCTCITAC 
TTCAAGAACA 



CATACGACTT 
GTCTCGGCGT 
ACAATGGCGC 
TGATCGAGTT 
TCACCCCTTC 
ACGAGGCCGT 
TAGTACAAAA 
TCGGACAGCG 
ACCAAGACAA 
GGACAGGCGG 
CCACGACACG 
CCATGGCGGC 
TGATACAGAT 
TAGGCAAGAG 
GTATTCTTTT 
AATATCGCGG 
CGCAGATCAT 
GATCGATGGA 
TCGGAGCCAC 
GCCGCTTCCA 
AAAACATCAA 
AAGCGGCAGT 
TAGATGCCAT 
AAATCGAGAT 
AGGCTCAGAA 
AGATAGCGGA 
ACGAGAATGT 
GCACGGGCGA 
GTCAGGACAC 
GCAATGAAAA 
CCTAl-rTGGC 
TGGATATGAG 
GATATGTGGG 
CCGTGGTTCT 
AGGTGATGGA 
CCGTGATCAT 



TACACAAACA 
CGTAGCCGTG 
GATCGACATC 
GGAAGCCATC 
GGCTCGGGAG 
CAGCCCGGTC 
GATA'ITTATG 
CAACCCCTCC 
CGACTTCGAC 
AGGCTCCGGC 
GAGTGGAGGC 
AGCAGGCAAG 
ACTCAGCCGG 
TGCCATCGTG 
CGACAAGCGG 
ACAGTTCGAA 
CCTCTTCATC 
TACGGCCAAT 
TACGCTGGAT 
GAAGGTGCCG 
AGAGAAATAC 
GGAACTGACC 
GGACGAGGCC 
ACTGGAGGCC 
CTACGAACTG 
AGAGAAGAAA 
AGTGGCGCAT 
AGGCGAACGT 
AGCCATCGAA 
GAAACCGATC 
CAAGAAGCTC 
CGAGTATATG 
CTATGAAGAA 
CTTGGATGAG 
CGAAGGTCAG 
CATCACCTCC 



TTCCGCAACA 
ACGCAAGATA 
ATGCGGCACT 
GCCGAGAGTT 
GCTATCGATG 
CATCTGTTGC 
AAACAAGGTA 
GAAGGGAAGT 
GACGAAGAGG 
GACGCCCCCG 
GACACGCCTG 
CTCGACCCGG 
CGCAAAAAGA 
GAAGGACTGG 
ATCATCAGCC 
GAGCGGTTGA 
GACGAGATAC 
ATGCTCAAAC 
GAGTATCGTA 
ATAGCCCCCT 
GAGGACTATC 
GATCGCTATG 
GGCGCGAGCG 
GAATTGGCAT 
GCTGCCTCCT 
AAATGGGAAG 
GTAGTGGCGT 
CTGCGCACGA 
AAGATGGTGC 
GGITCTTTCC 
GCCGAATACC 
GAGAAGTTCT 
GGCGGCCAAC 
ATCGAAAAGG 
CTGACCGACA 
AACGTGGGTA 



GCCTGGAGTA 
TGCTCGTACT 
ATGG6ATCAA 
TGCCTGCTTC 
ATGCCACAGA 
TGAGTATCCT 
TAAAATACGA 
CTCCCTCCGA 
ACGAATCCTC 
AACAGAATAC 
CACTGGACAC 
TAGTGGGTCG 
ACAATCCGGT 
CGGAACGCAT 
TCGATTTGGC 
AAGCCGTGCT 
ATACCATCGT 
CCGCTCTTGC 
AGAACATAGA 
CGACTGCAGA 
ACGGTGTACG 
TATCCGATCG 
TCCATATCAC 
CGGTGCGA6A 
TCCGCGATCA 
AGCAGATGTC 
TGATGACAGG 
TGGCAGATGA 
ATGCCATCCA 
TTTTCCTCGG 
TG'iTCGAGGA 
CCGTTTCGCG 
TGACGGAGCG 
CGCATGCCGA 
GTCTGGGACG 
CACGCCAGCT 



CAGCTATCAG 
CGGTATCATT 
CTTGTACGAA 
GCCTGAGGGA 
CATCTGTGCC 
CAACl'CGACA 
CACCATCCTG 
AATGGAGATC 
TCCGCCTTCC 
CGGCGGAGGC 
CTTCGGCACC 
GGAGCAGGAG 
GCTCATCGGC 
CGTGAACAGG 
TCAGATGGTA 
CGATUAGCTG 
GGGAGCAGGC 
CCGTGGACAG 
AAAGGACGGA 
AGAAACGCTG 
CTATACGGAC 
TTTCTTCCCA 
CAATGTGGTG 
GAACAAGCTC 
GGAGCGGCGC 
CAAGCACCGC 
CGTTCCGGCT 
TCTCAAGACC 
GCGCAATCGT 
CCCCACGGGG 
TGAGAATGCC 
TCTCGTGGGT 
CGTAAGACGC 
TGTCTTCAAT 
GCGCGTGAAT 
CAAAGACTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
€00 
€60 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 

leoo 

1860 
1920 
1980 
2040 
2100 
2160 
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55 



GGGCAGGGTA 
TCCGTGATCC 
GATATCATCC 
CTTAAAGCCG 
CCCAAGGATG 
CGCACACTCC 
GAGAAAGGGC 
GCA 



TCGGGTTCCG 
AAAAAGCTCr 
TCTTCGACCA 
TCTTGGCGCG 
TGATAGCGAC 
AGAACGAAGT 
AGACGCTTAC 



TTCGGAAAAA 
GAACAAGACG 
ACTGGGCAAG 
CATCCATCGT 
GAAGGGATAC 
GGAGGATCGC 
GCTCTCTGCT 



GACGAGGAAG 
TTCAGCCCCG 
ACGGAGATTC 
GCCGGATACG 
GACCTCCaAT 
CTCACGGATC 
CGCGATGGCG 



CGAACAAGGA 
AATTTCTCAA 
GCCGGATGGT 
ACCTCGTCCT 
ACGGAGCACG 
TTATCCTCTC 
AGATCTVTCGT 



GCATAGCCGT 
CCGTTTGGAC 
GGACATAGAG 
TACCGATGAA 
ACCGCTCAAG 
CGGACAGATC 
ACAAGAACAA 



(2) inrORIIATION FOR SF.Q ID NO: 4 0 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEUIGTH: 1368 base pairs 

(B) TYPE: nucleic acid 

(C) STRAjrOEDMESS: double 

(D) TOPOLOGY: circular 

(ii) HOLECULE TYPE: mA (genomic) 

(iiil HYPOTHETICAL: MO 

(iv) AIJTI -SENSE: NO 

{vij ORIGINAL SOURCE: 

{A) ORGAiaSM: PORYPHYROHOTAS GItlGIVALIS 

{!>:) FEATURE; 

(A) NAMF./KEY: misc feature 

(B) LOCATION 1. . .1368 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 0 



AGAGGAGGAC 
TGCCGCTTCT 
AATTACCTGT 
GCl'TrCCTTT 
GGCAGAGCGT 
GGTAATAATA 
TTGGCGCAAT 
ATCATCATAC 
ATGATGATGA 
TCTAAACTCT 
CCTACAACAG 
GGAGAAAACG 
TTTTCCGGTA 
TTGCAAACGG 
ATTATCTACA 
CGTGGGCAAG 
GCCAATAAAC 
GAACTTGGAG 
GACATTGAGG 
TATCTGGTCA 
CCTGAGTCTG 
CCACAGGCCA 
TCCACCAAGA 



AAATCCGCCG 
CCTTTCATGC 
ACATACTGAT 
CTTCGGACAA 
TAAACTTGCT 
TCGTTITGGT 
GGATTGATAA 
TGTTTACCGG 
GGGTATTCGC 
TCACCGGTIT 
TAGGGTTGGG 
AACAGAACGA 
TTCAGGTGCG 
ATATTGAAGT 
GACAGAACAT 
ACTGGCAAAA 
TGATGCGACT 
GTACGGCCGG 
ACGAACACGA 
GTGGTCGTAT 
ACGACTACCT 
ATGAGGTC6T 
TCGAACTGGT 



ACACCATACC 
CGAGACCATC 
TACACTTTTA 
ACTGCGTCTT 
GTATCGACAT 
AGTCTATGGT 
CGATGCTATG 
GGAATTTCTA 
CCTCCCTATC 
ATCTCGCTCT 
GCGCGTAGAT 
CTTGACTACC 
AGACTGCATG 
ACTCAAAACG 
AGATGACGTA 
ACGTATCAAT 
ACTCATGCAG 
AATGGTCACA 
CACTCGCAAG 
GGAAATAGAT 
TACCGTGGCC 
GGAGATTGCT 
GAAAATGTCC 



GATTCGTCAA 
GGTTTCTCCA 
CTCTCCGGCT 
GAGTTGGACA 
CCGGACCAAC 
CTGCTGATGG 
ATCGTCGTTC 
CCCAAAGCCA 
GTAGCGATCT 
TTTATTCGTC 
CTCGATCATT 
GAAGTGAAAA 
ATCCCACGCA 
ACTTTTATCG 
GTAGGATATA 
ACTACTGTAT 
CGCAAGAAAA 
TTAGAGGAIT 
ATCATAGCCA 
GATGTGAACG 
GGATTTATCC 
CCTTATACT-r 
ATCGACGACC 



GGGGATCGGA 
ACCACCAAAG 
TTTTTTCCGG 
GGAATAGAGG 
TGGTGACTAC 
CGGGATl'GCT 
TCCAATCTGT 
TTTTCAAGAC 
ATTATCTGCT 
TGGTGGACAA 
ATTTGGCAGA 
TCATCCAGAA 
ATGAGATGAT 
ATACCGGTTT 
TCCATTCGAG 
TCGTACCCGA 
GCATTGCGAT 
TGGTAGAAGA 
AACAGCTCGG 
AACGTTTTGG 
TGAATAGCCA 
TTACCATTCT 
AATCGAAC 



CAGCAAAGCC 

AGCAAAAATG 

TGCTGAGATT 

CGATCTCACA 

TCTCCTTGTG 

GGCCGCACCT 

CTTATCCACT 

CAATGCCAAT 

TTATCCTCTG 

GAATTATGTG 

AAATATGTCC 

TGCGCTGGAT 

AGCATGTGAG 

GTCCAAGATC 

CGAAATGTTT 

AAGCATGTAT 

CGTCATCGAT 

GATTTTCGGT 

CCCTCATACC 

GTTGTCCTTG 

TCAAAATATC 

CAGATCTTCT 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2503 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
7D0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1368 



60 



65 



70 



75 



{2) INFORMATION FOR SF.Q ID NO: 41 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 897 base pairs 

(B) TYPE: nucleic acid 
iC) STRAIJDEONESS: double 
(D) TOPOLOGY: circular 

{iij MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

fiv) ANTI-SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORVPHYRa-IONAS GINGIVALIS 
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tlx) ttATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIOfJ 1...897 

(xi) SEQUENCE DESCRIPTION: SEQ 10 HO: 41 



TTAAAAAAGG 
CTGGTTITAC 
GAAGAGCTGT 
GCAGAGGTAG 
CGTTCTTATA 
CCCACTTTTC 
TATCACGGAG 
GTTACCAATC 
GAAGAGAAAG 
GGAACCTTGA 
ACATACATCT 
ACCGTGACCG 
CGAGTAGAAG 
AACGGACGTA 
TATGCAGTGC 



AGATAACTAT 
CCATGCTTAT 
TGGAATCATT 
AAGCTCTGCA 
CCTACACGGT 
TCGATGAAGA 
GCGTATCCGA 
TCACCGGAAC 
CTGAAGAGCC 
TCGGTAATAC 
ACGAAGTGGA 
TGACCAACAG 
GCAAGAAGAT 
CCGTGGCCGT 
GCTTCGATGT 



GAAACAGAAC 
TATGCCATTG 
GACTTTCGTT 
GGAGATAGTC 
CTATCGTGAT 
TGTTCCTGCC 
CAAAGTATGC 
TGCTTCCAAT 
GGCAAGTGAT 
AGCTGAAACT 
AGTAAAGTAT 
CTCATTGAGC 
TATTGCGGAA 
AGCCCCGAAT 
GGGGAATAAA 



TACTTCAAAA 
GAAGTAGCAG 
CCACCGGTTG 
GCTACTGAGG 
GGCGTGAAGA 
GGCGAACATA 
GTGGACGTAG 
GACGAAGTTT 
AAAGCAGTCA 
CATTATGTGG 
CCTGACGGTG 
AATGTAGATG 
GCCCATGGTA 
CGAITGGAAT 
CACCATGl'AT 



GAGTCTGCTC 
CTCAAGAGAT 
AGGAGACAGA 
AGATTGCGGA 
TTGCTTCAGG 
CCTACTGCGT 
AGGTGAAGGA 
CTTTGGACTG 
GCTACAACGT 
AGACCGGTGT 
TATCTCCGAA 
GACAGGCTCC 
TGATCACGCT 
ACATGGCGCA 
CGAAAATACA 



ACTGCTTTGG 
TATTCCGAAC 
CGCAATAGAG 
GCAGGCTGTT 
ATTGACTGAG 
AGAAGTACAG 
CTTCAAACCG 
GGACGGTGTG 
CTACAAGAAT 
AGCCAATGGT 
GGTGGCTGTA 
TTACACATTG 
CTACGACATC 
AACCGGTTTC 
AGTAAGA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
8 97 



(21 IHFORIIATION FOR SEQ ID HO: 4 2 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic t 
(111) HYPOTHETICAL; NO 



(ivj ANTI -SENSE: MO 

(Vi) ORIGIHAL SOURCE: 

(A) ORGANISM: PORYPHYROMWAS GINGIVALIS 

( ix ) FEATURE : 

(A) NAHE/KEY: mlscf eature 

(B) LOCATION 1...H31 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42 



TTGTTTAATC 
TCTGGCTTTA 
CCTGTGTCGG 
CCGl'ATGACA 
TGGAAGACCA 
GGACAGAGTG 
CCGGACAATT 
ATCTAITGGG 
TCTTCTACAG 
GCCAAACGCA 
CAAAGAAAGG 
ACGGATAATT 
AGAGCTCCGT 
ATAGCGAGTG 
GACTATTGTG 
GTGGTGGCAA 
GGCAACACCA 
CGGCTGATAG 
ATTAAAATAC 



ATAAAAAATC 
TTCTTTTGTG 
ATTTACGTGC 
ATCCGATGAT 
TTGACGCAGA 
Gl'CTCTGTGT 
ATCTGATAAC 
TATGTACTCA 
GCAATAATGC 
TACAATCCCC 
TGGTACTCCC 
TCTGGCTCAA 
GTCCGCATCC 
GA'I-TGTCGGC 
TCCAAGTCAA 
ATTCTGCAAA 
TTGTAGCGAG 
CTTCCGGCTG 
AGGTAAACGG 



ATGGTATGAA 
TGAAAATACC 
AGAAGCGTAC 
TCCTCTAAGC 
TGGCGATGGC 
CTCTTCGGCT 
ACCCGAATTA 
AGATCTCACT 
TGCTGACTTT 
CGAGTTGATC 
TAACGATACT 
TTTGGATGAA 
GGGTGGTTAC 
ATTGGCATAT 
TTATCTGCAA 
CJVTCTATGGG 
TGCTTTCAAA 
CGATACGCTT 
AACTGTCTAT 



ACATTTCAAT 
CTTGCACAAC 
GGCTCTACCG 
GAGAGTTTTG 
TATAATTGGA 
TCATACATAG 
AAACTACCCA 
GCTCCATCGG 
GTTAATCTCT 
CGCGGAAATC 
AAATATGTTG 
GTATCTATCC 
ACTTATTCTG 
ATCGATACGG 
GGAGACTCGT 
GCGGATAAGC 
GGAGAGATCA 
AGGTACAAAG 
ACTGAGAAAA 



TTTATCTCGT 
AAAAAACAGA 
TTTTCCTCCA 
AATCAGGTAT 
TGCATTTGAC 
GCGGCGTCGG 
CAGACGCGTT 
AGCACTATGC 
TATATGAAGA 
GGACACAAGG 
CTTTCCGCCA 
TGTATACCCC 
TATTCCGTGA 
ATGTACCGTA 
ATAAAGTCTG 
CTTTTGCGTT 
CTCTTTATGA 
CGGAAAATGG 
TCCAAATCCA 



TGTTTTCCGC 
GGAGTTTGCA 
CTGGACTCCG 
TCCAGCTATA 
CAATTTCACG 
AGCTTTGACT 
GGTGGAAATA 
CGTTTATTCC 
GACTTTGACT 
TGTTTGGTAT 
TTTTAATTCC 
TCTTCCCCGA 
TGGACA7VAAG 
TGGGACTCAA 
CAAAAATATA 
GACCGTGGTT 
CATTCGTGGC 
TTTTTACCTC 
A 



€0 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1131 



(2) iriFORIlATION FOR SEQ ID NO: 43 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2547 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED! I ESS : double 

(D) TOPOLOGY: circular 
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( i i ) MOLECULE PE : DMA | genotni c ) 

{ilil HYPOTHETICAL: NO 

liv) AUTI -SENSE: NO 

(vi) ORIGIHAL SOURCE: 

lAl ORGAJJISH: PORYPHYROMOHAS GIMGIVALIS 

(ix) FEATURE: 

(A) my^E/y.XY: TOisc_£eature 

(B) LOCATION 1 . . .2547 

Ixi> SF.QIJEMCE DESCRIPTION: SEQ ID NO:43 



TTCGGAATAT 
ATTGCCATGC 
CTGGCTTTGC 
ATCGATTACG 
TCTCCTGCAT 
GCAGATGACA 
AGTATGCCGG 
CTTTCCGGAA 
GTACA'l'GCTC 
TACCCATGGA 
GTTGCTACTG 
TCTTTCGATT 
TACGACTGGA 
GATGCCTACG 
AATGGAAGTG 
AAGCGTTCAC 
ATCCGCGGGG 
GGACATGCl'T 
TGGGGAGGTG 
ATCGGAGGTG 
GCTAAGACTC 
GAAGCCOAGT 
GAAGAGCAAT 
GAGGTGAAAA 
TTCTCATTGG 
CGCACAGGCA 
ATTAAAGTAA 
AAGCTCAGTA 
ATTACAGTAC 
CTATCTACAG 
CCGG6CGGTA 
GGCGACTATA 
ATCGGAAGCG 
CATAATCAAA 
ACACTCAAAA 
CAATCCTCAT 
ACTTTCGTAT 
CTCCATGCAT 
GTGAAGATCG 
TTCCCTAATC 
TCTATCATTC 
CATGGCAGAA 
TATACGACGA 



CACCTTCAAT 
AGGGACATTC 
GACAGGTATC 
TTTATCGGCA 
ATTTTTATGT 
GAATACCGAC 
ACAATCTTCG 
AAGCTCAGCT 
TGATGGATAA 
ACAATAAGGA 
CTGCAGCACA 
ATCATGCAGG 
TCAATATGCC 
CCACACTGAT 
GTACGTACAG 
TGCAGCTACA 
AACTTGCCTC 
TCGTTTGCGA 
TTTCCAACGG 
AGGGAATAGG 
CCGCTGAAGC 
ATAAAAGTGA 
CAAATCTTGA 
CTTCATCTAT 
CACCTAATCA 
CCGAACAGTG 
ATACGACAGA 
TCGTCCCCAA 
AGTTCAATAG 
GAGCTACTGC 
GCAGCAACTA 
CATTGTGGTA 
TGTCAGTAAA 
CTTCTACCTA 
ATCTCGGATT 
CGGGGTCTTT 
ATAAACCTGT 
TCGTAAACGG 
TCAATGGTAC 
CGGCACGCGA 
TTTTCGATCT 
TGGATGTCAG 
AAATAAATAT 



GAAAAAAAGT 
TGCTCCGGTT 
CTTGCGAATG 
AGGAGATGCT 
AGCTAATCGT 
AATTTTAGCC 
CATGTQGCTA 
CAATGAAGAG 
CGGTCATTTT 
ACCACTGCTT 
AATCATGCGC 
•ITCATTAGTT 
CGGAAATCCC 
GCGTGATGTG 
CGTTTATGTA 
TGTACGCGCC 
CGGAAGGCCG 
TGGTTATGCT 
CTTCTACAAA 
TTTTACCATT 
CGGTACAGAT 
ATCCGGATTG 
CCTCGGATAC 
CAATATCTCT 
GTTGTCACAA 
GGAGCCGGTA 
CCCGAACAAT 
CAGCTTTGTC 
CGACAGCCCr 
GGACGATGTA 
TCCGGTGGTT 
TAGATTTTCC 
AACACCGACA 
TACGCTGGAT 
GCCTTTCAAT 
ATGGGCAGCT 
TGTCGAAGGC 
ACAACAACAG 
AGCCGTAGAA 
TTATGTGGAA 
GTCAGGCAAG 
CCGACTTCCT 
AGTGCAC 



TTTCTTTTAG 
ACGAAAGAGC 
GGACAAACAG 
GAGAGGGGTA 
GGAAATAATG 
TATTCACCCA 
CAAATTTACG 
ATATTACGTA 
GCCAACGATC 
CCTAATGGCA 
TACCATAGCT 
GGCAACTGGT 
GACCTTGATA 
AGTGCATCTG 
GTAGGAGCCT 
TTATATACCT 
GTCTATTATG 
TCGGATGGTA 
CTAACACTCC 
TATCAAGAGA 
GCCTTGCCGA 
AACGTAGGGT 
AGATTGAACA 
TGGTACGGAT 
GGAATCAACA 
CGGCATGCAC 
GTCGTAGTCA 
GCAGATCTGA 
GATGAGATCC 
ATATCTTTGG 
TGGTCTAAAG 
ATCAACAACC 
GAGTATACGC 
ATGGCACACA 
GGTGAGTTGG 
CAAGAAACAG 
CCTATACCTG 
TTGTACCTCA 
GCAATAGAAT 
ATATCGGCAC 
ATTGTCATGA 
AATGGGGCCT 



CCATAGTAAT 
GAGCTTTGAG 
CAGTATCTGA 
TCACATCACA 
AGGGCTATGC 
TTGGCCGTTT 
ATCAGGAAAT 
CCGAGGGCGT 
CCATGCGATG 
ATCATGCCTA 
GGCCGCTTCA 
CCGGCACATT 
ATCi'GACTCA 
TTTCGATGAG 
TGCGAAACAA 
CACAGGAGTG 
CAGGGAATAA 
CTTTCCATTT 
TCTCGCCGAC 
TCATCACCGG 
TCTTGGCACT 
ATTCGATATA 
AGGCTGACGG 
ACGGAGAGCA 
CCATCACCCT 
AGGGAGGATA 
CGGTAGATAA 
ATTCTTATGA 
GTACACCCGT 
GCTGGGTAAT 
ACGTTCTCAC 
AAAAGGATGA 
ACCCCTTATT 
ACAGAGTATT 
TTGTTGTTTT 
TACATATCAA 
ATGGATCCTA 
AGGGGAAAAG 
CGTCAGAAGA 
CTTGCATTCC 
AGAATAGTTT 
ACATCCTTAA 



GCTCl-rTGGC 
TCTGGCTCGG 
CAArj\TTTCC 
AGAGGAAGGC 
TGTTGTAGCA 
CGACATGGAC 
AGGCCTGATA 
ACCGGCTGAA 
GAATCAAGGT 
TACCGGCTGT 
AGGTGAAGGC 
TGGTGAAATG 
ATCTCAAGTG 
TTTTTATGAA 
CTTTCGCtAC 
GCACGATATG 
CCAGAGCATA 
CAACTGGGGT 
*TTCGT*l'GGGT 
TATCGAACCG 
GAAAGACATA 
TAATACAGGT 
AGAAGTCATA 
TCCCGAGAGT 
ACTTTATCGT 
TGTCAATAGC 
TAACGAAGGC 
ACATAGTACG 
AGCCTTTGCT 
GGCrGAAGTT 
TCTCTCGGAA 
ATGGAAAAAG 
CGAAGTGGCC 
GCCCGACTTT 
CCGCCAAACA 
GCAAGGAGAA 
TCGTGCGACC 
GAACTACACG 
GATCAGAGTA 
CCAAGAAACA 
ATCAGCGGGG 
GGTGGATGGA 



(2) IMFORTIATIOW FOR SEQ ID NO: 4 4 

(1) SEQUENCE CHARACTERT5TICS: 

(A) LENGTH: HSS base pale? 
(E) TYPE; nuclei = acid 
(C) STRANDEDNESS : double 
IV) TOPOl^Y: circular 

(ii) HOLECUT.F. TYPE: DHA t genomic) 

(iii) HYPOTHETICAL: NO 

tiv) AIJtl-SENSE: HO 

(v-i) ORIGINAL SOliRCE: 



60 
120 
190 
240 
300 
360 
420 
400 
540 
600 
660 
720 
760 
640 
900 
960 
1020 
1080 
ll-lO 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1960 
1920 
1900 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2547 
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(A) ORGWJISM: PORYPHYRWOIJAS GINGIVALIS 

(ix) FEATURE: 

fA) NAME/KEY: tnisc_f ea t ure 
5 13) LOCATION 

{Xi) SEQI.IEJ>ICE DESCRIPTION: SEQ ID WO: -14 

ACGAAGAAAA CACTGATGAA AAAGCTATTT CTCTCGCTCA CGAGTCTTGT AATGGTCTTC 60 

10 GCTGTTGCAA GTTGCGATAT AATCGACAAG GATCAAACCC TCTTGCCGGC TCCGACCAAT 120 

GTGACACCCG ATAATCCGGA TGACAATCCT TCGGAGATCG ACATrACGCA GACGCACACA 180 

GAAAAATATG TTTTGGCTGA AGAATTTACC GGCCAAAAAT GTCTCAACTG TCCGAAAGGT 24 0 

CATCaCAAAC T6GCGGCTC1' CAAGGAGCAA TACGGTAAGA GATTGACTCT TGTCGGTATA 300 

CATGCCGGCC CTGGATCTCT CGTGCCACCT CTTTTCCGTA CAGAAGCCGG AGACGCATAT 360 

15 TATAGCAAGT TCGCCAATAA TACCCCTCTC CCTGCGCTGA TGGTTTCGCG CAAAAAGTTC 420 

GGCTCri'CCT ACGTTTATGA TAAGAGCTAC AAAACGTGGG ACGTGCCTAT TGCCGAGCAG 480 

ATGGAGCAAA AGGCGAAGAT CAATATCTTT GCCGTGGCCC AATACACCGA TACCCAAAAG 5^0 

ATCAAGGTGA CTGTAAAGGG TAAAAtACTG GAGGGGAATA CACTCCCGAA GTCCATGGTT 600 

CAGGTGTA7C TGTTGGAGGA TAAGCTGATC GCTCCGCAGG TGGATGGCAA TACGACAGTC 660 

20 GA6AATTACG AGCACAATCA CGTGTTGCGT GGAGCCGITA ATGGTATTTG GGGCGAAGAA 720 

TTTGTGAATC TCAAAGATTA TTTGTATACT TACGCCGTTG AACCGCTCTC GGGTATGTCC 7 90 

TTCGTAGCCG AGAATTATTC GATTQTGGCT TTTGTATACG ATGTGCAGAC CTTCGAAGTG 84 0 

TATGACGTTG TGCATGTAAA GATCAATCCG CAATCCGATG GCAAA 885 



25 



35 



(2) rwroRHATIOJJ FOR SEQ ID NO: 'IS 



(i) SEQL'ENCE CHARACTERISTICS: 

(A) LENGTH: 690 base pairs 
30 IB( TYPE: nucleic acid 

tCt STR7UJDEDNESS: double 
(Di TOPOLOGY: circular 



till HOT.ECULE TYPE: UHA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AMTT -SENSE: NO 



40 fvi) ORIGINAL SOURCE: 

(AJ ORGANISM: PORYPHYROMONAS GXNGIVALIS 

Ux) FEATURE: 

(A) NAME/ KEY: mi3c_featur- 
45 <B} LOCATION L...690 

(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 4S 

ACGAATAAAA AAGAAGAGAC AATGAAGAAA TCAAGTGTAG TAGCCTCAGT TTTGGCCGTG SO 

50 GCTCTCGT(3T TC6CCGGTTG CGGACTGAAC AATATGGCAA AAGGCGGCCT TATCGGCGCC 120 

GGAGTAGGAG GTGCCATTGG TGCCGGAGTA GGTAACGTAG CCGGAAATAC GGCTGTCGGT 180 

GCCATCGTCG GTACTGCAGT CGGTGGAGCA GCCGGTGCTC TCATCGGAAA GAAGATGGAC 24 0 

AAGCAGAAAA AAGAACTGGA GGCCGCAGTA CCCGATGCTA CGATTCAGAC AGTAAATGAC 300 

GGAGAGGCTA TrCTGGTTAC TTTCGATAGC GGTATCCTCT TTGCGACGAA CTCCAGCACT 360 

55 CTGAGTCCCA ACTCACGCAC TGCGCTGACG AAGTTTGCTG CAAACATGAA CAAAAACCCC 420 

GACACGGATA TTCGTATCGT AGGCCATACG GACAATACCG GCTCCGACAA GATCAACGAT 480 

CCTCTGTCTG AGAGACGTGC AGCCAGQGTA TATTCTTTCC TGAATTCTCA GGGTGTGA6T 540 

ATGTCGCGCA TGGCAGCCGA AGGGCGTGGG AGCCATGAAC CGGTTGCAGA CAATAGCACA 600 

GTTGCCCGAC GTTCGGCCAA CCGCCGTGTG GAGGTTTATA TCTTGCCGAA TGCCAAGATG 660 

60 ATCGAACAAG CACAGCAAGG TACGCTGAAG 690 

(2) INFORMATIOr^ FOR SEQ ID NO: 46 

65 (il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1026 base pairs 
(BJ TYPE: nucleic acid 
(CI STRAHDEDNESS: double 
(D) TOPOLOGY: circular 

70 

(11) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
75 (Iv) ANTI-SENSE: NO 
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ivi) ORIGINAL SOURCE; 

(A) ORGAMISW: PORYPHYROMOMAS GIMGIV7U,IS 

{ ix ) FEATURE : 

(A) rOAIIE/KEY: raise feature 

(B) LOCATION 1...102 6 

SEQUENCE DESCRIPTION: SEQ ID NO: 4 6 

AACAGGAACA GAAATATGTC GAAAAAATCG ATCCTTCTGC TTTGCTGTTC GCTGTGCTTC CO 

ATTTCTGCTA CGAAGGCTGT GACCCCCGTC AGAAATGTGC GCAATAGCCA ACTG^S^C Jo 

AAAGCAAAGA CCGAACGTAC AAAGCCCTCG GACTCTGTAC GGTA^G ^^GA^^ IBO 

GCAGATCGGC TGGAGTTCCG CAACAAGATT TCTTCCGAAA AAGAGGTAAG AAAAGCCGAA 24 0 

TATGAAAATC GGCTGGCGAT GGAAGCACTC AATTACCCTG CCATAGATTT AT^^5^ 3o2 

SJJS^IJ^ fSfSS^^^^ AAACCCTTTC GTGGGTGCAG GAAC^S? C^^^ 360 

AACTCCTATG ACATTGATTG CTCTTCGTTC GTGATGCCCG TCGAAGATAA GCAGGTCACC 420 

TCTCAATTTG GCTACCGTCG GCGTTTCGGA CGGATGCACT ATGCTATTGA ?S??^CTG ilo 

AATCGTGGCG ATACGATACG AGCAGCCTTT GACGGGAAAG TTCGTGTACG cSct^SI 540 

^^^^ r?™^!:S CATAGTCTTG CGCCATCCGA ACGGACTGGA S^OT^ toO 

SS^™?^^^ GTCRCCAATT GGTAGACGAG AATCAGATCG TTCGAGCAGG ACAACCGATC S60 

GGATTAGGAG GCAGCACGGG TCGAAGCACC GGTCCTCATC TTCACTTCGA GACCCGCTTC llo 

ATGGGTATTC CCATCAATCC GAGTACCATT ATAGACTTCG ATAaSgAGT ^C^^A llo 

GACATTTACA CATTCAAACG AGGGAGCAAT TCTCGCTATG CAA^GCCTC tS^^^S 8^0 

tctcgctatg caaaaaaagg gaagaaaggc agacaagctt cttctcctat ga^ctSIS laa 

GCGATACTTT GGAAACAATA GCCAAAAGGC S^gSS?? ?^55Sg^^ 060 

CCAATGGCAT TGGCAAGAGT AAAATTTTCA CTCCGGGCAA iSJ^GA^ 1020 



1026 



(2) TKFORHATION FOR SEQ ID NO: 47 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 477 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA {genomic} 

(ill) HYPOTHETICAL: NO 

(iv) Al^TI-SEMSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROT-tONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAl-lE/KEY: misc feature 

(B) LOCATION 1 . . . 477 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47 

S?CCT^ ?Sr??rI?? ^n^"""^ AAAATCAATT TCTATGCTGA AGGCGTCAGC 60 
CTTCCTCuGA TCAGAAGACG GATCGTCGGT AAGTGGATAG CCGAAGTATG CAGCCGATAT l 
GGGAAAGCGG TGGGAGAAAT CTCCTATCTT TTCTGTGATG ACGAaStI? OT^^GC? llo 

^^J^JSf^'" ^-^^ATCA tgactactac accgacatS tSccttcgI ?5?^g^ 2!o 

^S^^^^' TGAATGGCGA CCTGCTTATC AGTCTCGATA CCGTACGCTC GAA^G^OT 300 
GCTCTTGATC TTCGATACGA AGACGAACTG CATCGTGTCA TTATCCACGG CATACTGCAT 
CTTPGCGGAT TGAAAGACAA GAGCAAAAAG gatgaagccc aaatgottgc ag^cSSSg 
AAAGCCCTTG TCATGCTGCG AGAAACCATC GGATCGGaSc ^CC^l^ S^TA^ 



360 
420 
477 



(2) INFORltATION FOR SEQ ID NO:48 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1185 base pairs 

(B) TYPE: nucleic acid 

(C) STRAigDEDNESS: double 
(Dl TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill I HYPOTHETICAL: NO 
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(iv) WITI-SEMSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROHONAS GINGIVALIS 

( i^c I FEATURE : 

(A) NAME/KEY: misc featvjr- 

(B) LOCATION 1. . .lies 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECINESS: double 

( D ) TOPOLOGY : c i r cu 1 a r 

(11) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv» AUri- SENSE: NO 



PCT/AU98/01023 



SEQUENCE DESCRIPTION: SEQ ID NO: 4 8 

AAAAGTAAAA Cl'ATGAAGGT AAAGTACTTA ATGCTCACAT TGGTTGGAGC AATTCTArrr At 
AACC^GTG CACAGGAGAA TACTGTACCG GCAACGGGTC AGtJJc^GC ^SJ^S?? J 
GCTTTTGCTC GCAATAAAGC AGGCAGCAAT TGGTTTGTAA CACrrrl^^ J^^l^ 

TCTCT-IICTG TCGGAAAGTA TCACAGCCCT TTCTTTGCAA CTCGTTTGCA AATTAawa ^ru 

^^I™^ ACACTTTCCT CGGAAAAAAT GGCGAAcSS ^^^JJ??^? 36 

GCAGCTCACT TCGACTTTAT GTTTGATGTG GTTAACTACT TTGCA^CATA ^rili^l a^c 

CGTTTCTTCC ATTTAATTCC ATGGGTAGGT GTTGgSaS!. AAC^^?? ttr 

??^GC^GA GTAOAATGGA ACCCCgJct? tS^^ 

HL^ut-AAtFA AAGCTTGGAA TCGTGTTGTA ATCGTTCGCT CCAAG 1185 

(2) IHFOar-lATTOI^ FOR SEQ ID NO:49 

do 



(vi) ORIGINAL SOURCE: 
50 ORGAHISM: PORYPHYRO^^OI^^AS GINGIVALIS 

( J X ) FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATION 1...1161 

f^i> SEQUENCE DESCRIPTION: SEQ ID NO: 49 

TATAAAATGA CATACAGAAT TATGAAAGCT AAATCTTTAT TATTAGOVCT Tr^rnr-T-r^n 
GCATGCACAT Tr'afrnr-nn/^ n/^r«<^«.». - ™-w-»a»-4 ^ i/m xt\i JMlyf^CTT TGCGGGTCTC 60 

GCCGGAaJI^ CgS^^^ GTAGACTTCA ATATTGAAGG ACAAGCTTTT 600 
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GTAGTAATGA CTGCAGCGGA A 1161 
(2) IHFORl'lATION FOR SEQ ID NO: 50 

5 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SB5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAh^DEONESS: double 
10 (D) TOPOIXJGY: circular 

(ii) MOLECULE TYPE: DMA | genomic) 

(iil) HYPOTHETICAL: NO 

(iv) AIlTl-SENSE: NO 



15 



20 



(vi) ORIGIN7VL SOURCE: 

(A) ORGAf4ISH: PORYPHYRO»IO?IAS GIHGIVALIS 

tiX) fEATURE: 

(A) NAIIF./KEY: misc_feature 

(B) LOCATION 3 . . .585 

25 <xi) SEQUENCE DESCRIPTIOrJ: SEQ ID NO: 50 

GTAATTGGCA TCATTATGGA ATTTTTCATG TTATTCATAG CGGCGGTTTT CGTTAATAAC 60 

GTCGTGCTGT CGCAGTTCCT CGGTATATGC CCATTCTTAG GCGTATCGAA GAAGGTAGAC 120 

ACCTCAATCG GTATGGGTGC AGCCGTGACA TTCGTATTGG CACTGGCTAC CTTGGTTACC 180 

30 TTCCTGATTC AG/^GTTCGT TTTGGATCGT TTCGGATTGG GCTTTATGCA GACCATTGCA 24 0 

TTTATTTTGG TCATTGCCGC CTTGGTGCAG ATGGTGGAGA TCATACTCAA GAAAGTATCT 300 

CCTCCCCICT ATCAGGCACT GGGTGTATTC TTCCCCTTGA TTACGACGAA CTGCTGTGTG 360 

CTCGGTGTCG CTATTTTGGT TATCCAGAAG GATTATACCC TGCTCCAGAG CTTCGTCTAT 4 20 

GCAATATCCA CGGCTATCGG TTTCACCTTG GCAATGGTTA CTTTCGCAGG TATTCGAGAG 4 80 

35 CAACTCGATA TGACCAATCT CCCCAAAGCT ATGAAGGGAA TACCTTCGGC ACTCTTGGCT 54 0 

GCCGGTATAT TGGCTATGGC TTTCATGGGC TTCAGCGGTA TCGCC 585 



40 



50 



55 



(2) INFORMATION FOR SEQ ID NO: 51 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2628 base pairs 
(BJ TYPE: nucleic acid 
(CI STRAI^DEDNESS: double 
45 (Dl TOPOLOGY: circular 



(ii) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv> AI IT 1 -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROi^ONAS GIHGIVALIS 



( ix ) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...262B 

60 (xij SEQUENCE DESCRIPTION: SEQ ID NO: 51 

TACCGATCTT ATCGTGGGAT AGGGAGTGGG ACACACTCTC CTAACCTCAA AAACCGACTA 60 

AAAAGGATCG GAATAAGGAT ACCGAACAGA CACTATATCC ATATCAAGCC AATCAAACCA 120 

AAAAATAAAA TGAAACAACT AAACATTATC AGCTTCATCA TTGCTTTCCT ATTCTTAGGA 180 

65 ACGAGCGCAT CGGCTCAGCA ATCGGGCGGA TCCGTTACAG GTACCGTAGT GGACAAAAGC 24 0 

TCAAAAGAAC CTATCGCATA CGTACAAGTA TTCGTCAAAG GAACCACTCT CGGAACTTCC 300 

ACGGATGCAA ACGGAAACTA CTCGATCAAG GGAATCCCTT CGGGTAATCA AACTATCGTA 360 

GCCCGACTCA TGGGTTACTC CACTTGCGAA GAAAAAGTAC ATATAGAAAA GGGTGGTTCC 4 20 

CGCCACGTAG ACCTCTATCT GACCGAAGAG ATTCTCTCTC TCGATGGGGT AGTGGTATCT 4 80 

70 GCCAATAGAA ACGAGACTTT CCGCCGTCAA GCACCCTCGT TGGTAACGGT ACTGTCGCCG 54 0 

GAACTTTTCC TCAAAACCAA CTCTACCAAC CTGAGTCAGG GACTTAAGTT CCAGCCCGGT 600 

CTGCGCGTGG AGGACAACTG TCAGAACTGC G(3TTTCAACC AAGTTCGTAT CAATGGACTC 660 

GAAGGAGCCT ATTCGCAAAT TCTTATCGAC AGCCATCCXTA TCTTCAGTTC GCTTGCCGGT 720 

GTCTATGGCT TGGAGCAGAT GCCrCCCAAT ATGATCGAAC GTGTAGAAGT AATTCGCGGT 78 0 

75 GGAGGTTCGG CTCTGTTCGC CTCTAATGCT GTGGGAGGCG TTATCAACGT AATTACGAAA 84 0 
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GAACCGCTTC 
GGGTGGGGGA 
AAAGCCGGTG 
GACAACTTTA 
AAGACCGGTC 
GGTGGCGACA 
TATATCAATG 
ACTCTGTATG 
ACCGAAAATC 
GATGCTTTCA 
GGTATGTACC 
TACGGCCAAC 
ACTTTTAGTC 
GCACGTATCG 
ATTTTCAGTC 
TCATACAGCG 
GCCGGTGGTA 
AGCATCAGTG 
GGAQAAGCCT 
ACGACCGATG 
AAGGTATATG 
CTCGGCGGTA 
AAAACAACGG 
GGCTATTTCG 
TTCACGGGCA 
ATAGCTCCGG 
GAAGGTCATG 
CACGACTTCC 
TTCAACAGCT 
GGTCCTATGC 



GCAATTCGCC 
GCTTCCAAAA 
TCATGGTATT 
CCGAACTACC 
TCTACAGCAA 
GACTGGACAA 
GCGGAAGTTT 
CTTCAGCACA 
TGCTGAACGG 
CGGCTCTTAC 
GTCATACCTT 
TCGATGACAG 
AGTACGACCA 
ACTATGTTCT 
CTAGAGCCAA 
AAGGATTCCG 
CTCCTATCAG 
CTTCTTTCGA 
TCTCCACCTT 
GCAAAGAATG 
GTGTGAATCT 
CATGGrjVGAG 
GACAAGCCGA 
TTGCTACGGT 
AAATGGATGT 
ACGGATCGTT 
CCAAGCTCGT 
ACCTTGCTTC 
ATCAGAAAGA 
AGCCCAGAAG 



CGAGATCAGC 
TACGACCCAG 
CGGCCAACAC 
CAATCTGCGC 
AGCAACCCTC 
TCCTCCTTTC 
CAAATTCGAT 
AGACGTTCAG 
AGCAGTTCAG 
TTCCTACGGG 
CGGAGAAAAC 
AAGCGGCTAC 
GCTCGAATAT 
CCTCAATCAG 
CGTACGATAC 
CGCTCCTCAG 
CCGTGTCCTT 
TTATTACCAC 
TATCAGCAAT 
GATCATTCGT 
GGAGGG7\AGA 
AAGCCGCTAC 
GATCTCTGTG 
ACGTCCTACC 
AGTACACGAA 
CGACITTGAA 
CAAGACTCCG 
CACTATGACC 
CACGGACAAG 
GATTTTCGTC 



CATTCTACCA 
TTCAACGGTT 
AACTACCGTC 
AACCGCTCGC 
GAATATCACA 
GAAGCCCAGA 
CAGGGCTTCA 
CGTCGTAGCT 
AGTGGAAGCA 
ACTACCAAGG 
T6GGACTTTA 
AGACCGAGCA 
AAGACGGAGA 
GATCGCAAAC 
AATCCCAATA 
TATTTCGATG 
TCCCCCAATC 
AGAGCCGACG 
CAGTTCAAAC 
ACCATCTACA 
ATCGCCTACA 
GGAAGCATCT 
AAAGACTATG 
GAGCACTTCG 
GCCTATGAAG 
ATGAATGGTC 
GCCTTCGCCG 
•ITGGAATTGA 
GGACCGGGTA 
GGTACAAAGA 



TGACCl'TCGA 
CTATGCTGAC 
CCGGACAGGA 
TCGGTTTCCG 
GCATGCAGGA 
TAGCGGAATA 
GCGGTGGCAA 
ACTACGGGGC 
CCGAATCGGA 
GATTCGATTT 
CCGGCGGACT 
AAATAGATCA 
AGTTAAGTGC 
GCTATATCGA 
AGAATCTCAG 
AAGATCTGCA 
TGAAAGAAGA 
AATGGCAATT 
CATOCGATAA 
ACGACAAGGA 
ACAAATCGTT 
ATACCGCTGT 
TACGCACTCC 
CCATCAATCT 
GCGATATTCC 
AGCAATTCAA 
ATATAGACCT 
ATGCCGGAAT 
GAGCTTCTAC 
rcAATTTC 



CCACGCGAAA 
GGAAGACCGC 
TATAGACGGC 
CTCATACTAT 
GTACCGTCGT 
TCTCCAGCAC 
GGATTTCTTC 
TGGCGACTAT 
CGAATACAAC 
GCAAGGAGGA 
CGAATATATC 
GAATACCTCr 
CCTTATCGGA 
TCCGCTCTTC 
CTTCCGACTC 
CGTAGAGTTG 
ACGTTCACGA 
CAATATCATG 
GGTCGAAACC 
TGGAGTATCG 
CGACCTCCAG 
GGAAGCGGAC 
GAATCTGTAC 
CTCCGGTACA 
CGCAGAACAC 
AGGTTTGGCC 
CAAGCTGAGC 
ACAGAACATA 
TTACGTATAC 



900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1140 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2628 



(2) INFORMATION FOR SEQ ID NO: 52 

35 ix) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2697 base pairs 
{B) TYPE: nucloic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 

40 

(11) MOLECULE TYPE; DNA (genomic) 

(ill) HYPOTHETICAL: NO 

45 (Av) AtJTI -SENSE: NO 

tvi) ORIGINAL SOURCE: 

iA) ORGANISM: PORYtHYROHOt^AS GINGIVALIS 

50 Ux) FEATURE: 

(A) NAME/KEY: misc_featucft 
iB] LOCATION 1. . .2697 



55 



ixi) SEQUENCE DESCRIPTION: SEQ lU NO: 52 



ATATTGAATC ATTTGAGAAA GACTATGTAC AAAAAGATTA TTGCCGl'AGC AGCTCTCTTC 60 

TGCGCCAGCA TAGGGATCCT GAAAGGACAG TCCTCGGATC TGACCCCTCA GGATACTATA 120 

TATAGCCCTG AAATATCCTA TGCCAAGCCT ATrCATAAGA CCATAGCATC TATTGAGATC 180 

GAGGGAATGA CGTCTTTCGA TGACTTTGTC TTGCGCAATC TTTCAGGCTT GGCTGTAGCT 24 0 

60 GATGAAGTCC TGATTCCTGG AGATGCCATG TCTGCTCCCG TGAATAGAAT TATGCGTCAG 300 

GGCTACrrCT CAAATGTGCG AATCATCGCG GATAAATATG TCGGCAATAA AGTCTATCTG 360 

AAAATCATTG TCACTGAACG TCCTCGGATC AGTAAGGTTA CTTTTAGCGG GGTAAAGAAG 4 20 

TCTGAGAGAG AAGATCTTGA AATGAAAATC GGTCTTCGCG AGGGGATTCA GATGACCAGA 4 80 

AATAATGAAG ACAAGGTCAG GCAAATCGTA CAGAAGTATT TTAGTGAGAA AGGTTATCGC 54 0 

65 GATGCCAGCA TACGGATAAC GCAGGAACCG GATCTTTCCA AAGATGGCrT TGTCAATGTG 600 

CTTATCTCGA TTGAGAAGAA AAGCAAAACC AAGGTGAATG AAATTTATTT rPCCGGCAAC 660 

AAGGCCCTTA GCAATCATAA GCTAAGAATG GCGATGAAGA ACACCAATGC CAAATTCAGT 720 

CTTAGAAAOC ATATTCGCTC ATCTTTCTTG AAACTTFTTA GTACTCATAA GTTTGTGGAA 7 80 

GAGAGCTACC GTGAAGATTT GGTCCGATTG ATAGAGAAGT ATCAGGAATA TGGATATCGT 84 0 

70 GATGCTGAAA TACTGACCGA CAGTGTCGTG AAGGCTCCTG ACGGCAAAAG AGTGGATATT 900 

TATCTCAACA TCGAAGAGGG GCAGAAGTAT TATATTAAGG ATGTCAACTT TGTGGGCAAT 960 

TCACAATATC CATCGGAGTA TTTGGAACGA GTGCTCGGAA TAAAATCCGG AGATGTGTAC 1020 

AATCAGAGAC GATTGGCTAA GCGTCTCAAT GAAGATGAAG ATGCTGTGGG GAACCTGTAC lOBO 

TATAACAATG GCTATATTl^T TGCGTGGGTC GATCCCGTGG AAACAAATGT AGTGGGGGAT 114 0 

75 TCTGTTTCGC TTCATATTCG TATAGCGGAG GGGAAGCAGG CCAATATCA^^ TAAGCT6ATC 1200 



SUBSTITUTE SHEET (Role 26) (RO/AXJ) 



ir=?\ :«=--H ii™ 

iUii r:ii oqj 




i^z C3i Oi lo ^al ic. o tlj o 



wo 99/29870 



PCT/AU98/01023 



35 / 490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



ATCAAAGGAA ATACTGTCGT GTACGAAGAC GTAGTACGCC GAGAGCTTTA CACAAAGCCC 12(j0 

GGCCAGCTCT TTAGTCGCGA GGATATCATT AACTCTATTC GTCTCATCAA TCAGCTTGGG 1320 

CATTTCGATG CCGAAAAATC TATTCCCCGT CCGA'ITCCCA ATCCCGAAAC AGGAACAGTG 1380 

GATATAGAGT ATGATTTGGT GCCGCGTAGC AGTGACCAAT TGGAGCTTTC TGTCGGTTGG 14 40 

AGTCAGTCCG GACTTCTGTT CCGAGGAGCC ATTAAGTTCA CGAACTTCTC TGTCGGCAAC L500 

TTGCTCCATC CCTCGATGTA TAAGAAAGGG ATCATTCCGC AAGGGGATGG GCAAACACTA 1560 

TCACTGAGTG CTCAGACCAA TGGAAAGTAC TATCAGCAGT ATAGTGTCAC ATTTATGGAT 1620 

CCATGGTTTG GGGGCAAGCG GCCGGATATG TTCAGCTTCA 6TGCATTCTA TTCCAAGACT 1680 

ACGGCGATTG ACTCCAAGTT CTACAATAGC AATGCCGGCA ACTACTATAA TGCCTACTAT IZ-lO 

AATAGCTACT ACAACAACTA TAATAGTTAT TACAACGGTA TGTCGAACTA TACCGGCGAC 1800 

CTCTATACTC AGGCCAGCGA TCCGGATCGT TCGCTTCAGA TGTTAGGTAC TTCGATCGGT 1860 

TACGGTAAGC GTTTGACTTG GCCGGACAAT TGGTTCCAGA TITATACTTC TCTGAACTAC 1920 

ACCTACTATA GACTGCGAAA TTGGAGCTAC AATACCTTCC AAAATTTCCA TCATGGCTCG 1980 

GCTAATGATC TCAACTTGGA GCTGCGTCTC TCTCGTACTT CCATCGATAA TCCTATTTAT 204 0 

ACCAGAAGCG GATCGGATTT CATGGTTTCT GTrGCTGCTA CTCTTCCTTA TTCTTTGTGG 2100 

GACAATCATG ACTATGCCAG CXAGAACCTC AGCCTAACCG ATCCTTACAG ATTTATCGAG 2160 

TATCACAAGT GGAAGTTTAG AGGACGAGTT TTTACTCCAT TGCTCAATCC TGCTACGCAT 2220 

AAATATACAC CGGTGCTCAT GAGTCGAGTG GAAGGAGCAG TTCTTGGTTC GTATAATTCC 2280 

AATAAGAAAT CTCCTTTCGG TACTTTCTAT ATGGGAGGTG ATGGTATGTC CAGCTATTAT 2340 

GGTGGCTACA TGAATGAGAC TATAGGTTTG CCTGGTTATA AGAACGGATC TATTGCCGGT 2400 

AATAACTACG ACTATGCATA TGCTTATATG CGGCTTACGA TGGAACTACG TTTCCCGATT 24 60 

CTGTTTGAAA ACTCATTCAA T6CGTGGCTC TTAGCTTTTG CCGAAGCAGG CAATGCGTGG 2520 

CGCAGl'ATCG ACAATTATAA TCCCTTTAAC Cl'GAAGCGAT CGGCCGGTGT AGGATTGCGT 2580 

GTAACGTTAC CGATGGTCGG AATGCTCGGT ATCCATTGGG GATATGGCTT TGACCGTCCG 2640 

GACAATTCTC TACAGCGAGG AGGAAGCAAT GTCCACTTTG TGCTCGGACA GGAGTTC 2697 



(2) INFORIIATION FOR SEQ ID NO: 53 

ti) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 531 base pairs 
IB) TYPE: nucleic acid 
to STRAIJDEDNESS : double 
(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DtIA (genomic} 

( i i i ) H Y POTI lETICAL : MO 

(Iv) AlITI -SENSE: NO 

(vl } ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(Ix) FEATURE: 

(A! HAHE/KEY: misc_feature 
(BJ TOCATIOH 1...531 

I:tl) SEQUEnCE DESCRIPTIOW: SEQ ID HO: 53 

CAAATAAACA TCAATGGCGA TATGAAACGG TTTTTGATTT TGATCGGCTT TGCACTGGCG 60 

GTAGCTTTCT CCGGTTTTTC CCAAAAGTTC GCTTTGGTAG ATATGGAATA TATCCTCAGG 120 

AATATTCCTG ACTATGAGAT GATGAACGAA CAGCTGGAAC AGGTGTCCAA GAAATGGCAA 180 

AATGAAATCG AAGCTCT«3A AAATGAAGCC CAATCTATGT ATAAGAAGTA TCAGAGCGAT 24 0 

CTCGTATTCT TGTCTGCTGC ACAGAAGAAA ACCCAAGAAG AGGCTATCGT AAAGAAAGAG 300 

CAGCAAGCAT CCGAGCTCAA GCGGAAGTAT TTCGGCCCGG AGGGGGAGCT GTATAAGAAA 360 

CGCTCCGATC TGATGAAGCC TATTCAGGAT GAGATTTGGA ATGCTATCAA AGAGATTGCC 420 

AAGCGTAACA ACl'ATCAGAT GGTGCTTGAT AGAGGTACGT CCGGAATTAT CTTTGCCAGT 4 80 

CCGTCTATTG ACATTAGCGA CCTTGTACTG AGCAAGATGG GCTTTAGCAA G 531 



(21 IHFOR/^ATION FOR SEQ ID NO: 54 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 510 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: douhle 

(D) TOPOLOGY: circular 

(ill MOLECULE TYPE: CltiA (genondc) 
till.) HYPOTHETICAL: NO 
(iv) AHTI-SEHSE: NO 
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(Vl) ORIGINAL SOURCE: 

(A) ORGAWISM: PORVPHYROMONAS GINGIVALIS 
(ix) EtATURE: 

(A) NAME/ KEY: misc teatuire 

(B) LOCATION 1 . . .510 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:54 



CGAATAAATA AACAACACGA 7UVTGAAGAAA TTTTrrCTCA 
TTGAGCCTCT TGGCACAAAA GGTGGCAGTG GTAAACACTC 
CCGGAACAAG TAGCTGCTAC CAAACAGCTC AACGAATTGG 
CTCAAGAGTA TGGACGATCA GTTTGCCAAA AAGACAGAAG 
TCTCTACTGG AGAACATCCG CAATCGTCGT CAGCAGGAAC 
TATCAGCAGT CATACCAAAC GATGCAGGAG GATTTGCAAA 
GCTCCTATCC AACAAAAGGT GGCTGATGCC ATCAAGAAAG 
GCCTACATCA TGGAGGCCGG TATGATGCTT TACACCGGAG 
GCAAAGGTAA AAGCGAAACT CGGAATCAAG 



TGCTTCTGAT 
AGGAGATCAT 
CCGAAAAGTA 
AATTTGTAAA 
TTCAGGATAT 
AGCGCCAACA 
TGGGTGACGA 
CTACTGCTAT 



GGCTCTTCCT 
TTCCAAAATG 
TCCCCTTGAT 
GGAAAAAGAC 
TCAAACTCGT 
ACAGCTTTTT 
AGAAAACTGT 
TGACTTGACC 



i2) INFORilATIOt^ FOR SEQ ID NO: 55 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDI-JESS : double 

(D) TOPOLOGY: circular 

(ii> HOLECULE TYPE: Df^^A (genomic) 

(iii) HYPOTHETICAL: NO 

(Iv) AtJTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( ix J FEATURE : 

(A) NA>fE/KEY: misc feature 

(B) LOCATION 1 . . .2T84 

(xi) SEQUENCE DESCRIp-riON: SEQ ID NO: 53 



ATCATGAAGG 
TTGTCATTCA 
AGGGTATCGG 
GTGCGTGGTA 
AAGGCCACGA 
CGCACCTTTC 
ATGGAGCTTT 
GTCAATACCC 
ACTACCTACG 
GGAAGCTACG 
GTTCGCTCTG 
CAGTTCTCCG 
ATTCGCTACA 
AGTGCCTACT 
AGTGCCAAAT 
GACGGACAGA 
AATATTTCGC 
CTGAGCGATG 
ACCTACTTCG 
ACGCTTT CGG 
CTGAACGATG 
TCCAACGGCT 
GTGCTGAACA 
GGCGTATCGG 
TCGGTAG<3AT 
GCCGATACGC 
AGCATGGGAG 
AACAAGGAGT 
GCTTTGGTAC 
AGGCAGACGC 
TCTCAGGGAG 
AAATACAACT 



AAGCTATTCC 
TTCTGCl'ATG 
GCTATGTGCT 
CGGGGACAGG 
CGGACAGCAT 
CGTCTCTGAC 
CGAGCGTGAC 
GAGACCTTCG 
CAGGAGTAAC 
ATGAGAATAT 
CACAGCAGGA 
CCGGAGCGTT 
AGCAACCGCA 
ATGGCAGCAG 
CGCTCTTGGG 
CATTCATGAC 
AAACTCGCTA 
CCAAAAAGTT 
GTGCCTTCAG 
CCTTCAACAG 
TGCAGCTGGG 
TGGGCATCGG 
TGGGTTACAG 
CACAGATGGA 
ACAACCTACC 
AGATGAGGGG 
GAGGTACATT 
TGCTCGTCAG 
TGCGTGCAGC 
ATAAGGATGC 
CTTTTCACAT 
TTACGGCAGA 



CCG7\AAGAAC 
CTGCCTGCTA 
CGACCGTGGG 
CGCAACGACG 
CACGATCGAG 
CAAGGACACT 
GGTACAGGCC 
TGTCAATGCA 
GCAGAACAAT 
GGTCTATGTA 
AGGTCTGAGC 
CACGGCCGAC 
GGAGAAGGAA 
TGCCGGAGCC 
CACTACGGAC 
GTACCGTTTC 
CAAGTTTGTC 
GAAGATCTTT 
CATGAACTTC 
TAACGAACGG 
GGCGGACGGA 
GCGCAATCAC 
AGGGGAGATG 
GAAAATAGCC 
TCACTCGGAG 
AACGCGCTTG 
TTCTCTCATT 
CCCACGTATC 
CGCCGGACTT 
CGAAGGCAAT 
TCTCGCAGGA 
GGCTTACTAC 



AAGTATATAA 
TGCTCTCAGG 
GAAAAGCCGA 
AATCTGAAAG 
TTCAGCTCCA 
CGGCTGAATG 
ACAAAACGCA 
GGCCCTACGG 
GAACTAAGCT 
AACGGAGTGG 
TTCGTCAATC 
TATGGCGACA 
GGAGCGGTAC 
TTCAGCCAAA 
ACGAAAGCCG 
AGCCCCAAGC 
CCTCAGACCC 
TTCGACGGTC 
GTGCCGGACG 
GAGACCTACG 
ACTGCTTCGA 
GAGCATGCGC 
AAGCTGAACG 
GACCATATCA 
ACCGTATTGC 
TCGGCATTCG 
CCGGGTATCA 
AGCGTGGGTT 
TATTATCAGG 
AACGTGGTTG 
GCAGA'CTATA 
AAGAGCCTGT 



AGCTCAACGG 
CAGCTATGGC 
TCCCGTTCGC 
GATACTACGA 
TGGGGTACCA 
TTCGTTTGGC 
GACTCAACAC 
GAGGGGTGGA 
CGCAATACTC 
AGGTTTATCG 
CGGATCTGAC 
AGATGTCCTC 
TCCTCGGGAT 
TCACGGGTGT 
AATACGATCC 
TGTCGGTTAG 
GTGA6ACGAG 
AGGAACAAGA 
ACAAACAGCG 
ATATTCAGGG 
TGGCTTCGGG 
GCAACAGGCT 
AGAAGCATCG 
CCGAATGGGA 
TGATGTACAA 
TACAGGATCG 
GAGCl'TCGTG 
ATTCTCCCGA 
CACCGTTTTA 
TCCTCAACGA 
CCTTCGAAAT 
TCAACATCAA 



TATATACAGA 
ACAAGGCGTC 
CGGAGTCAAA 
GTTTCGGATG 
AGGGGTAAGT 
AGAGGCCGAG 
GATGGAGCGC 
ATCGCTCATC 
GGTTCGCGGA 
CCCGCTGCTG 
ACAATCCGTA 
CGTACTGGAT 
GCTACAATCG 
ACGCTACAAG 
GATCTATGCG 
TTTCCTCGGC 
CTTCGGTACA 
TCGTTTCCTG 
GCATACGGTT 
AGAATACTTT 
CTCAGAGAAC 
GAGCTACCGC 
CCTGCAAGCC 
ACGGAGGGAT 
TAACCTATAT 
ATTCAACTTC 
GTGGAGCTTC 
AAGCAACCCG 
CAAAGAGCTA 
GAAGATCCGC 
GGGGGGGCGA 
CCCGTATATA 



60 
120 
IQO 
240 
300 
360 
420 
480 
51 0 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 

loeo 

1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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ATAGAGAACG 
ATCGATCTCA 
ATTATAAAAG 
ACTTACAATT 
ACCCTGCGGG 
TCGCCGGCCT 
CTCGACCCGG 
TACATAGGGG 
GTGTCGGATG 
AACCTGCGTC 



TGAAGATCCG 
AGCTCTTCGG 
CCCGTCAGAA 
TCTCCTTCTT 
CTGCACTAAG 
TTACCGCACC 

ATGACTccrrr 

CTGACCTCTT 
CCTACCAACA 
TCCTCGTCGA 



CTATCTGGGC 
CGAACTGGTA 
ACTGGATGGC 
CCTTCAGGAG 
CGGAGGATTC 
GGCCTATAAG 
TGCCGGCCGA 
CAATCTGTTC 
GCAATACCCC 
ATTC 



GAAAACATCG 
CCCGGAGTGG 
TACGGTTCTT 
TACGTGCCGG 
CCCCAGCTCA 
CGTGTCGATC 
AGCAAATGGC 
GACATGACCA 
GTACCGAACT 



GTTCGGGTTA 
ATTCGTGGCT 
TACCACTGAT 
GC AATAAACG 
ATCCGAGCAA 
TGGGGGTAAT 
TAATGGGAGT 
ACGTCAATTC 
ACCTGACACG 



TGCl'GCGGGT 
GACGGCTTCC 
GAACGCACCC 
CATCACAGCC 
AGGGCTTAGC 
GTACAAATGG 
AAAAGGGGCC 
TTACTACTGG 
CCGCCAATTC 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2464 



15 



20 



25 



(2) INFORMATIOW FOR SEQ ID NO: 56 

(1) SKQIIEMCE CHARACTERISTICS: 

(A) LENGTH: 2037 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMUEDNESS : double 

(D) TOPOLOGY: circular 

(ii) HOLECULE TYPE: DNA (genomic) 
(iiil HYPOTHETICAL: NO 
{iv| AflTI-SEHSE: WO 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(vi) ORIGINAL SOURCE: 

fA) ORGANISTS: PORY PHYROMOI-) AS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
iB) LOCATION 1...2037 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 



CCTATCCCTT 
ATCCTGTTGG 
AAGGCACATG 
CGTACCCGAC 
TATCGTGCCG 
GGGTATCCGG 
TATAAGGAAG 
GCAGCTATCG 
TACCGGATAC 
GCACCCGATG 
AATAGCAGCA 
GGACGATGGG 
GTGCCAACGA 
GATTACGACC 
AACGGTTCGC 
TCGGCATCCG 
GATATTTATC 
GATATCAATA 
TTCGCTTCGG 
TCTACCGGCC 
TTCGGCTTGG 
GTCAAAGGTG 
TTCGAACTGC 
CCCATAGCCG 
GTGACTACTC 
ATGCTTGCCG 
AAGCAGAGTG 
TTGCAAAATA 
TTGGACGAAC 
CATGCCGACA 
TCCGTGGTGG 
TACGGTAAGT 
AAGGAAGGCG 
GTCTGCGACC 



TTCTTCTTAA 
TATGCCTTGC 
ATCGCCAAGA 
GAAAGCAGGT 
CCGGCAGACA 
ATTCTGTGGT 
CCGAGGTACT 
GTTTGGAGGG 
GGCGAGCTGC 
CTTCGGCTCT 
TAACGGGACT 
GACGTCCCGA 
TCACGCCCGA 
GTACGGTACA 
TCGTGGACAT 
GCGATTACCT 
GTGTCAAGGT 
CGCCGGGGGA 
ACGGACACGC 
AATGGCATGT 
CTGTGGAGCC 
TATTTTGTTC 
CGGCTATCTA 
GAGCCACTGT 
GTGACGATGG 
GAGCATCGGG 
AGACCTACTA 
TTTTCTATGA 
TGATTCGTAT 
GGAAAGGCCC 
ATTACCTCAC 
CTGTCrCCAA 
ATGTGCTCAC 
AACTGAACCG 



TATGTACAGC 
TTTTGCTGCC 
GTATACCAAG 
GGAGATGAAA 
AGCCAAAGCT 
ACTGCTCCGl' 
CTTCCGTGGA 
GTGTCTCTTT 
CGAGTGGAAT 
CTATTTCACA 
GAAACCCAAC 
TAGCGTGTCC 
TGGTAGTACC 
GATCTATTCC 
TATGCGC(1AT 
GTATTTCGTC 
GTCGGATCGT 
CGAAATGTTT 
CGGTCTGGGA 
AGTCAATATC 
TAAAGGCAAA 
CAACCGAGGC 
CACCGAGATT 
CAGGATCGTA 
CTCCTATAAG 
TTATTTGAAT 
TGTGGACTTT 
TTTCGATAAA 
CCTCACGGAC 
CGATGCTTAC 
GAGTCGTGGC 
GACGGTGACA 
CGAGGAATTC 
TCGTACCGAG 



GGACATCATA 
TGCAAGAGCG 
GCTGCCGATA 
GCTTATACGG 
TTGCGTGGCT 
TTGGCACAGA 
TATCTGGAAG 
GCCCGCCAGC 
TCGGCACGGG 
TCGAGCAGAA 
GACATTTATA 
GGAGGAATCA 
ATATATTATA 
GCCGCTCGGA 
TCGCTCCGTA 
AGCAATATAG 
TCTTATGGTT 
CCCTTCATAG 
GGACTGGATA 
GGACAACCGG 
AACAAAGAAG 
GATGCACGCG 
CAAGGTTATG 
GGCGAACGCG 
ATGAGCGTGC 
CAGTACGTAG 
TTCCTTGCAT 
GCTACTCTTC 
AATCCGGATA 
AACCTCGGAC 
ATAGCGGCCG 
3CCAAAATTG 
GTAGCACCTT 
TTCCGTGTGA 



AAATCCATTA 
TGAAGTTGAA 
TGTACAATAC 
CTTTCCGATC 
ATCTGAATGC 
CTTATCAGCA 
CTTATCCGAA 
AAAAGGAATA 
GCGACTTCGG 
GCAAAGACGA 
TCATCAAACG 
ACACTCCATG 
CGTTGGCGCA 
GCGGAGAAGG 
TGGCTGCTCA 
GCGGTAGCTA 
CACCGGAGAA 
ATGGGGATAG 
TTTTCAAAGC 
TCAATTCCTC 
AAGCTTTGCC 
GATGGCCGCA 
TGATGGACAG 
GCCCCGl'AGG 
AGGGCGATAC 
AACTCAAGAC 
CGCGTGAGAA 
GCCCCGAAAG 
TTCGGATCGA 
TATCTGACCG 
ACAGGCTTAC 
CCGAACGGCA 
TGACCGAGGA 
TCGAAGAAGA 



TCCTTTTCTT 
AGATGCGGAG 
ATTATACAGG 
CGGTGAAAAC 
CAGACGCTAC 
AGGAGGTAAC 
AAGTrATT-l-r 
TCCTACACGT 
CCCGGCCTAT 
CGGTTTGGAT 
AGATGCACAA 
GGATGAAGGC 
GCAAGGAGCC 
CGGTTGGAGC 
TCCCTCTATG 
TGGCGGCAAG 
TTTGGGGCCT 
TACCCTTTTC 
CACGCTGGAC 
TGCCGATGAT 
GGACAACGGA 
CCTCTTCCAT 
A6AAGAAAAT 
ACAGGGATTC 
TCGCTATGTA 
CGATACCGCC 
AGCCGAGGGC 
CATGAAGAGC 
ATTGGGTTCG 
CAGAGCCAAA 
GTGGAAAGGC 
CGATTTCCTG 
GCAGCAGTCA 
GTTGCGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
114 0 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1000 
3 860 
1920 
1980 
2037 



75 



{21 INFORHATTOH FOR SEQ ID HO: 57 

ti) SEOUEIICE CHARACTERISTICS: 

(A) LENGTH: 2316 bare pairs 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(BJ TYPE: nucleic acid 

(C) STRANDEDI-fESS; double 

(D) TOPOLOGY: circular 

{XX) MOLECULE TYPE: DHA (qsnomic) 

(iii) HYPOTHETICAL: NO 

(iv) AHTI-SEHSE; HO 

(vi) ORIGINAL SOURCE: 

(A) 0RGA14IS11: PORYPHYRa-JOHAS GINGIVALIS 

(ix) FEATURE: 

(A) IWIE/KEY: misc_feature 

(B) LOCATION 1 2316 

txl) SEQUEMCE DESCRXPTIOH: SEQ ID HO: 57 

CCTGCACAGC CGAAAACATA TTGCATACGA TACTITCGGA GAGAGGTATC TCCGAGGAGG 60 

CAAACAGAAA GGACATTGTA TGCGGACAAG ATGCGCCGGC ATATCGTCAA GTGCTGTCCC 120 

TCTGCCTCCA TACTCTTGCA AGAATACTCC ACAACAGAAT TATfCATTCA TCACTTTAAC 180 

ATATCAATAA TTATGAAAAA GTTTTTCTTC GCGCTACTAT CGATTGGTAT TTCAGCGCAG 2^0 

GCTTTTGCCA AGACGGACAA CGTCCCGACA GATTCGCTAC GAGTACACAA TCTTCAGACC 300 

GTCACGGTCT ATTCTACACG CACGGCCGTA CCTCTGAAAA AGATACCGGC CAACATGGAA 3C'0 

CTCATCTCAT CGCGCAACAT CAAGCAGTCC GGCTTTAACA ACATGACCGA CATCCTCAAG 4 20 

ACGCAAAGTT CGCTCGATGT CATACAATAC CCGGGCTTTA GTTCGAACAT CGGTATCCGC 4 80 

GGTTTCAAGC CCTCCGGCAA GTATGTAACC GTATTGGTAA ACGGCATCCC TGCGGGAACG 54 0 

GACAATATCT CTACGCTCAA CACGAGCAAC ATCGAACAAA TCGAGATCCT CAAAGGCCCG 600 

TTCTCTTCCA TCTACGGCAC CAATGCCATG GGCGGTGTGG TGAACATCAT CACCCACAAA 660 

TCCTWVGGACA AGATCCATG6 CAACGTTTCT CTCTTCGGCG GTAGCTACCA GACCATGGCC 7 20 

GGATCATTCA ACTTGGGTGG CCGCTTCGAG GATATTTTCT CATTCGATCT TAGTCTGGSC 780 

TTGGACAAGC AGAACAAGGA CrATAAGACC GGATCAAACA ATTTCCTATC CCPGAGCAAA 84 0 

CTGGAAGAAG CTATAGTAGA TGTAAATGCT ACCAAAAACA AGAAAATGAA GGGGAGCGAC 900 

TATACTGTAG CAACGGGACG TCTGCGTTTC GGTATCGACT TCACGCCCGA ATGGTCGCTG 960 

AATGTGTATC AAAACGTAIT CCTCGGAGAT GCGATCCCCG TAGGAGGATC TATATGGGGC 1020 

GTTTACGGAG AATCCAAAAA AAATCT'GAAT CGTTCTTCGA CCTCTTTCGA GCTGCTCGGC 1080 

AAACATGGCT GCCACACGCT TCAATTCTCC CCCTACTTCA ACATAGAGAA ATCGGAGAAC 1140 

TATAACAATG CCGATCCCAC CGGTTTCATC AACTACAAAA GCGACTACTA CACCTATGGT 1200 

GCCCTACTCC AGGACAAGAT TTCCTTTGGA GGACAAAATA TCGTACTCGG TGTCGACAGC 1260 

CGAAACATGA CGATGGAGTC AGAAAGATTC GAGCAGGCAG GAGTGAATAC AAAGCCATAC 1320 

AACCCCGGAT ATGCCACGAA CAATATCGGT TTGTTCGGAC AGGCCAATTT CTACCTGCTG 136 0 

AACGATGCTC TATCGATATC TGCCGGTGCA CXJTGCCGACT TCATGTTCTT TGACCTGAAA 14 40 

GCGAACGAGT ATCTCAACAA TGAAGCCAAA CAGGAAACTC ATAACGTAAT CAATCCGAAT 1500 

GTCGGAATCA AATATGAGTT TGTGAAAGGC CTTACAGCTC ATGGTACATT CGGTAGTGCA 1560 

TTCAGTGrrrC CCGATGCTTT CCAAAAAGCA GGCCAATACG TAGGCCCGTT CGGCACGACC 1620 

ATAGGCAATC CTGACCTGAA ACCCGAAAAG TCCATGACCT GGGACTTCGG TATCGGATAC 1680 

AGCAATGCAC GCTGCGGGAT CC'AAGCCGAC GTAACCTTAA CCTATTTCCA CACCGACCAC 174 0 

AAAGATCTGA TCTTGTCCAG CCCTGACTAT GCTAATAATA TCACCACATA CATCAATGCC 1800 

GACAAGGCTC GTATGAGCGG TATCGAGGCC CTTn'GTCTT ATGACTTCGG CAGCCTCTTT 1860 

GCCAACAAGT TCTCTCTCCG CGCATTTGCG AATGCCACGA TCATGCTCAA TTCCGAGATG 1920 

AAGAAAAGCC AGACCGATGC CCCTTGGAGC GAAATGTACT ACGTTCGCAA GCAGAACATC 1 990 

ACCrrCGGTA TCGAATATCG TGGCAAAGAA GGACTTGAAG TGATGCTCAA CGGTCGCTTC 2040 

ATGGGACGCA GGATCGAGCA AAACTGGTAT GCTTACTACC CCGAAGTTCG CCCCGAACTC 2100 

CAGCAACTGC TTGCAGCAGA AGAGCCTGAA TTGGCTGCTC AGGGACTGCT CCCTCATCCG 2160 

CAAGCAATGG TGTTCAATGC CTCTGCTTAC TACCACATGA ACAAGTATCT CACCTTCGGT 2220 

GTGAACTTGA ACAACATCTT GGATGAGCTT TATACGGAGA AAGACGGCTA CCACATGCCC 2280 

GGACGTAACA TCATGGGTAA GGTTATGGTC AACTTC 2316 



(2) I1IF0RJ1AT10I4 FOR SEQ ID NO: 58 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1452 base pairs 
(BI TYPE: nucleic acid 
(C) STRAimEDNESS: double 
(DJ TOPOLOGY: circular 

111) MOLECULE TYPE: DHA {genomic) 

(ill) IfYPOTKETICAL: NO 

(iv) AIJTI-SEDSE: HO 

Ivi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



tF«il -rfF "^11 -rfxit), 

11, iL J Q icr a tL;ii 
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(A) ORGANISM: PORYPHYROMONAS GIHGIVA1.IS 

(ix) FEATURE; 

(A) NAME/KEY; misc_feature 
tB) LOCATION 1...14 52 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUEHCE DESCRIPTION: SEQ ID NO: 58 



GGCCGGGCCT 
GCATCTATGA 
TTTGTATCG-S 
ACGGATACCG 
ACAGTCywVC 
GCCGATCTCT 
GTCTTATATA 
CACAATACGC 
AGCATTGCCA 
ATCGATTTGG 
TATGGCGTAT 
AAGTTCGATC 
AACATCGAGC 
AAGGTCCTGA 
TATAAAGAAC 
TCCCTGCGTC 
TACAGCTTCC 
GACATCAAAT 
CTGTACATTC 
TCGATCCGTC 
AAGAACCAAA 
GTCCGAAGTG 
ACTCTCGTCG 
CAGGCCATAT 
ATTCCCGAAC 



CTTCCCCITA 

ACAGGTTTTC 

CGAGTCGGAC 

CCGTACTCTC 

TGGCCGATAT 

TCCCGAAAGT 

TAGATATGCC 

AAGGAGGGGT 

TGACCGCiAGA 

TGGCAGAGGT 

TCAAGCGCAG 

GTGGACTTGT 

CTAACCTCTT 

TGAGCATGGA 

AAGTCTATAC 

AGCTCGATAT 

TGCCTACACT 

TCTGGGGCGA 

CTATATTCAA 

AGATCGATCT 

ATGACCGTCT 

CAGAAAAGGG 

AGCTCAACGA 

TCGACTTTAT 

AA 



CAGGCAAATG 
AAATCATTGG 
TGTGGCACAA 
CGAAGCCGAT 
GGATGTGCGC 
AGACCTCAAT 
CCGTTTCAGC 
GAACGTCTCC 
ACAGCTCGAT 
GAAGAAGGCT 
CTATGACAAT 
GGCCGAGTAT 
GCAAGCGCAG 
AGTGGAAACT 
CGGCTATTTT 
ACAGCGTCGT 
CAATCTGGGA 
GGGACAACGC 
TGGAGGCAAA 
GCAACGACGC 
GCGTACCTGT 
CTATCAGATA 
TGCCGATGTG 
GACCGCAAAG 



GATGGCATAC 
CCCTGCATCC 
AACGCCTCCG 
GTGCTTCGGA 
AAACAGGAAT 
GGCGTTTACA 
AGTAGCGAAG 
ATGCCATTGG 
CTGGCTCTCG 
TACCTCAGTG 
GCTCTGGCCA 
GATAAGATTC 
AACTCCGTAG 
CCGATCAGAC 
GCCGCCGATA 
CTGGCTGTCA 
GGGCAGTACA 
TGGACGCCTT 
CGTCTGTACA 
CACATAGAGC 
ATGCAGAGAT 
GCAGAGAAAC 
GCTCTTTTGC 
GCCGAATTGG 



TAAACGATGA 

TCGTGGGGTT 

AAACGACGGT 

TCGCTCTTAG 

ATGCACGTAG 

GCCATACGCT 

GTATCGAAAT 

TGTCGGCACA 

AGAAAGCTCG 

TATTGTTGGC 

ATTATAAGAA 

GAGCCAATGT 

CCCTTGCTCT 

TCTCCGGTTC 

CGCTTATTTC 

GTGCAGACAA 

CCTATTCGCT 

TCTCCACCAT 

ACGTGAAGCA 

AATCCATCCG 

TTGTGGCCTC 

GCTATCAGAC 

AGGCTCGACT 

ACAAGATGAA 



ATATAGACAA 

TCTACTCTGG 

ATCGTACGAT 

TGAGAATGCC 

GGCAGCACGT 

AAAGAAGCAG 

GGGGCGTACA 

GCTTTGGAAA 

CAGCTCCCGA 

CGACGACTCT 

CATATCCGAC 

ACAGGTACGC 

CTGGCAGCTC 

ATTGTCCGAC 

CAACAACTCC 

GCTGAACAAG 

CAACAGCAAC 

ATCGCTCAGC 

AAGTGCTTTA 

AATGGGAATC 

GGAAGAGGCT 

AGGCGAAGGC 

CAATTATAAT 

CGGCATGGGG 



(2) INFORMATIOI^I FOR SEQ ID NO: 59 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1620 base pairs 
(Bl TYPE: nucleic acid 
(C) STRAI4DEDNESS: double 
(O) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(iil} HYPOTHETICAL: NO 

(iv) AI-iTI-SENSE: NO 

(vi) ORIGINAL SOURCE .- 

(A) ORGAI^ISM: PCRYPHYROt>IONAS GTNGIVALIS 

(Ix) FEATURE: 

(AJ MAHE/KEY: raisc_f eatur- 

(B) LOCATIOtJ 1-..1620 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:S9 



TTTCATAACT 
TTGGCGAAGA 
CACCTCTCTG 
GTGCGCCGAA 
ATTCGTATCG 
CTTGTGAGTG 
GATAGCTTCC 
GGACGCCGCT 
GATACGCTGA 
GAGCTGTCGC 
GATGGCGGAT 
ATAGCGATGG 
GTCATGCG'3C 
GGA(3AGGGGC 
CTCCTGATGA 
CGCACTTATA 
ACCCATTTCA 



TTGACTTCCT 
CTCGACGTTA 
GGCAGGAAGT 
CGCTGGTGAA 
GATTCTCTCT 
ATTCTTTAGC 
CGATCTCAAA 
TCTTTTATTT 
CTGCCCGTGT 
CTGCCTCCCG 
CGCCGAACTA 
CCGACAGACG 
GCAGTACGGA 
GCAATCATGG 
TCTTTGTCGG 
TATCGGAAAG 
TCTTCGGCAA 



AAACGGTATA 
TGTCTGCCTT 
CACTATGTGG 
6GTAGCCTTA 
TCCGAAAGAA 
GGl'GCGCGAC 
GGAAGATCGT 
GGCTGCGGAT 
GGAAGAGGTG 
TCGTCTGTAT 
TCGTATCCCG 
AAAATATAAT 
CGGAGGGAAA 
CTTTGGCGAT 
TGGAGTAGGC 
TCGGGACGAA 
GGATTGTGCC 



AAATTGTTTa' 
GTCGGTTTCT 
GGGGACAGCC 
AGTGAATCCC 
ACGGAGGAAA 
TTGCCGGACT 
ACCACAGCCC 
ATAGGGCCTG 
GCTGTCGATG 
AGGGGGTATG 
GCCATTTTGA 
CAGACGGATC 
TCGTGGAGCG 
GTAGCCCTGG 
CTGTGGCAGT 
GGACTGACTT 
GATCCGGGAC 



CGATGGCAAA 
GTTGGCTCAT 
ATGGAGTGGC 
TTCCTCCGGG 
AAGTCACCGC 
ACAAAGGGCG 
TTTCTGCGGA 
TTGCTTCTTT 
GCCGCCCTTT 
AGGCCCTCm* 
AAACGGCTAA 
TGCCGGAGGA 
ATCCCAGGAT 
TGCAAACCCA 
CTACCCCCGA 
GGTCGCCTCC 
GCAGTCGCTG 



TAATACTCTT 
GGCGATGATG 
GCCGAACCAA 
TGCAAAACAG 
CCTATATCTC 
AG'rCTCTTAC 
TTCGGTAGCC 
TTCCCGATCC 
GCCGTTGAAA 
TGTACCCGGT 
TGGAACACTC 
TATAGATATA 
TATCGTACAG 
AGCAGGAAAG 
TCGTCCTCAG 
TCGGGATATA 
GTTGGCCTCC 



60 
120 
180 
240 
300 
360 
420 
460 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1 260 
1320 
1380 
1440 
1452 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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TTTTGTGCTT CG<3GACAAGG GCTTGTGCTG CCATCCGGTC GTATCACGTT TGTGGCTGCC 1080 

ATCCGCGAAT CAGGGCAGGA GTACGTCCl'G AACAACTATG TCCTCTATAG CGACGATGAG 114 0 

GGCGATACAT GGCAGCTTTC CGACTGTGCA TACCGCCGTG GCGATGAGGC AAAGCTTTCA 1200 

TTGATGCCCG ATGGCAGGGT ACTGATGAGC ATACGCAATC AGGGACGGCA GGAGAGCCGA 1260 

5 CAGCGTTTCT TCGCTCTCTC CTCCGACGAT GGCCTTACTT GGGAGAGAGC CAAGCAGTTC 1320 

GAGGGCATCC ATGACCCCGG CTGTAATGGA GCTATGCTTC AAGTGAAAAG GAACGGAAGG 1380 

GATCAAGTGC TGCACTCCCT GCCTCTOSGC CCGGATGGGC GTCGCGATGG AGCTGTCTAT 14 40 

CTCrrCGATC ATGTCTCCGG CCGCTGGTCC GCTCCCGTTG TTGTCAATTC AGGATCGAGT 1500 

GCCTACTCGG ATATGACTCT GCa'GGCGGAT GGAACGATCG GTTATTTCGT CGAAGAGGGC 1560 

10 CATGAGATCT CATTGGTTTT CATTCGGTTC GTCCTTGACG ATCTCTTCGA TGTCCGGCAA 1620 

(2) INFORMATION FOR SEQ ID HC:60 

15 (l> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 9 base pairs 

(B) TYPE: nucleic acid 
IC) 3TRAMDEDMESS: double 
(D) TOPOLOGY: circular 

20 

lii) MOLKCULE TYPE: DHA J genomic) 
tiii) HYPOTHETICAL: NO 
25 (iv) AUTI-SEi^SE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISH: PORVFllYRa>lOttAS GIIIGIVALIS 

30 lix) FEATURE: 

{A» HAME/KEY: raise feature 

(B) LOCATIOM 1. , .879 



35 



(xil SEWENCE DESCRIPTION; SEQ ID 110:60 



AAGTCTCCGA GCGATTCGGC TTTGCCGTCG GTATGGAGCG CGAATACAAT ATCTGGACTC 60 

GTCGGTGGGA AACGCATTAC TTTGCTTATC CTGTATTCTA TGGCGATAAG AAGTAGTAAT 120 

ATAGAATCAA TACAGTGCTT TGTGATGAAA AAAGAAAAAC TTTGGATTGC GATCGTCGCC 180 

GGTTTGGCl'T TCGTATTGGG CCTTTATGCT CTTGGCCGCA GTGTCGCTCA GCrACGCCGC 24 0 

40 TCTCAGCCTT CGGTGACTGT GACCGGTATG GCCGAGCGTA ATTTCAAATC CGATCTGATC 300 

GTTTGGACTG CTTCGTACCA GCTCCAGATG ATGGATCTCG AATCGGCCTA CAAGGCTTTG 360 

AAGGAAAAAC AGATATTGGT AGCAGACTAT TTGAAAAACA AGCAGCTGCC CGATTCGTCT 4 20 

TATATCTTCT CAAGCGTAGC CATCTCTAAA GAATACAACT ACTATTACGA TCCTCGGCAG 4 BO 

GAACAAAACG I'CAGGACCTT TGCCGGGTAT CTGCTCAGCC AGACAGTTAC GGTSACCTCA 5 'JO 

45 CAGGACATCG AACATGTGGA GAAAATATCT CGCGATATAA CGGAGCTGAT CAATCAGGGG 600 

GTAGAGATTA CCTCCGACCG TCCGGCCTAT TACTACACCA AGCTCAATGA TCTGAAGGTC 660 

GAGATGCTGC GCAATGCCTC CGAAGACGCT TTCAATCGTG CTTCGGTCAT TGCGGAGGGG 720 

AGCGGTTCCT CCGTGGGTAA GATGCTATCT TCl'TCGATGG GCGTGTTCCA GATAGTGGGG 7 80 

CTCAACTCGA ACGAAGA'iTA TAGCTGGGGA GCTTCGTTCA ATACGTCTTC CAAGATGAAG 04 0 

50 ACGGCAAGCA TAACGGTTAA GGCTTCTTTC GCTTTGAAG 87 9 

(2) INFORI-IATION FOR SEQ ID HO: 61 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRAimEDWESS: double 
(0> TOPOLOGY: circular 

60 

(ii) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

65 (iv) ANTI-SEHSE: NO 

{vi> ORIGIWAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GIHGIVALIS 

70 UX> FEATURE: 

(A) NAIIE/KEY: misc_£eaturf» 

(B) LOCATION 1. . .840 



75 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 61 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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GGGAAACTCC AAATGAAAAA AACAATTGCA 
GCCGTCGGCl* GTAAGAAAAA TGCTGACACT 
TTGTCCATGG GTATTTTGTA CGGACAGGAT 
CAAGGCCAGC CGATTGATTC GGTAGCTTTC 
5 ACGCGCTTCT CGTACAATCT GGGAGCCATC 
CATGATTCCA TCGATATCGA CAAGTTCTAT 
ACCGTATCTA TCGCCATGAA GCCTGCAGAT 
AAAAAGCAGC GAGAAAAC/A TATGAAGCAG 
TACATCGATA CCTTTAAAAA AGAAGATGGT 
10 AAGACTCTTC AGGAAGGTAC GGGAGCTACT 
TATGTGGGTA CTCTGGTCGA TGG^rAAAGAG 
GCCGTTACCG GTGTGATTAA AGGCTGGACG 
AAAGTTCGCG TGGTAATCCC ACAGGAGCTG 
GAACCGTTCT CTACCCTGAC GTTCGAGATG 

15 



ATTATCGCCT CAGCCCTCTT GGCTTTAGGA 60 

ACCGCTGTCA GTGAAAAGGA TAGCATAGCC 120 

TTTGCCAATC AGTTCGAAAr GTCCCGCTTG IBO 

TTGGACGGTT TCAAATATGG TATCGATACG 240 

TATGCTTCCA ATATAGCTCG TCAGCTGGCT 300 

GCAGCCATGC GTGCGGCTCT TCTTAA-^GAC 360 

GCACAGGCTT TCATGCAACG AATCCAAGCC 420 

ITTGGCCAGA ACATCGAATUV GGGTAATG7KA 4 BO 

GTAACTGTTA CGACAACTGG TCTGGCATAC 54 0 

CCCTCTTTGG CCGATACTGT ACGTGTCAAG 600 

TTCGACAAAA ACGAAGAAGG AATCGAATTT 660 

GAGATGCTCC AACTCATCAA GGTCGGTCAG 720 

GCn'ATGGGG AGACCGGCAA CTATACCATC 760 

GAACTTATCG GGATCAAGCC CGGGAAAAAG 940 



(2) IH FORI lAT roil FOR SEQ ID HO: 62 



|i) SEQUENCE CHARACTERISTICS: 
20 iA> LENGTH: 2409 base pairs 

(B) TYPE: nucleic acid 
to STRAtlDEDHF.SS: double 
(Dl TOl?OLOGY: circular 

25 (ii) MOLECULE TYPF.i Oil A (genomic) 

(iiij H*i'PCTHt:TICAL: NO 

Uv) ANT T- SENSE: HO 



30 



tvl } ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRWONAS GIHGIVTUJS 



(ix» tllATURE: 
35 (A) l>IAt'lE/KEY: misc feature 

{B( UXZATION 1,,.2409 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62 

40 TGTCGAAAGC AGAAGGTCTC AAACCCGAAG AGGAGCCGGT ATCCGTGCAG ACGGATATCA 60 

TTCCGACAAA GCGATAAGAA TCCGATGAAA GTATTACGGC AAGTATTCCT CCCCATCCTT 120 

TTTGTCCTAC TGACAGGTGC CTGCTCCACC ACAAAGAATC TGCCGGAAGG CGAACAGCTG 100 

TATATCGGAA TGGGCAAGAC ACAGATACTC CGGCAGGACA AGAGCCACGC CGGCCAACAG 24 0 

GCTCTGACCG AAGTGGAGAG TACACTGAAA GTTACACCCA ATGGAGCTAT TTTCGGCAGT 300 

45 GCAAGTGCCT CCTTACCCAA GATACCATTC GGGCTATGGC TATACAACAG CTTCGTGGGG 360 

GATTCCACTG TCATTTCOAA ATGGATATTC GACAAGTTTG CAGCCAAGCC GGTTTTCATC 420 

AGTCAGGTCA AATCCGATAG CCGGGCTAAG GTGGCGACGA ACATCCTCCG CGAACACGGG 480 

TACTTCGATG CTAAAGTAAA AAGCAGTGTG ACCACTCTCA AAAAGGACTC GCTCAAAGCC 54 0 

AAAATCTCCT ATACGGTGGA TATGGOCTCT CCTTATCATT ACGACAGCAT CATTCCCTTA €00 

50 CCGATCAGCA CTTTCCCCGA CAGCATTCTG GCTTACAGGC AGACTCCGTC TTTGATCAGG 660 

AAAGGAGACC AGTTCAATTT GGCAAAGCTG CACGAAGAGC GTCAGACCAT CAGTGCCCTG 720 

CTGAGAGACA ATGGTTACTA CTACTTCCGC CCACAGGATA TTATCTACGA AGCCGATACC 7 80 

CTCCTCGTAA GAGGTGCCGT ATGCCTGCGA GCCAAGCTCT CGGAAGATAC TCCACCCCAA 84 0 

GCCATGCGCC CGTGGAGGAT AGGGAAACGG ACAGCAGTCC TGCTCGGAAT GAACGGAGAA 900 

55 AGCCCGACAG ACTCGCTCGA AGTGGAGGAT ATGAAAGTCC TTTACTATCG TAAAATGCCG 960 

GTTCGCCCCA AGATTTT6GC CAAACGCTTT CGTTTCTTCT CCGGCAATCT GTATCGGCAG 1020 

AAAGACGATG AGACGACACG CAAATCCTTG GCTCGTTTGG GAGCCri'CTC CGTTATCGAT 1080 

CTCAATTTTT TGCAACGCGA TTCCATTTCC GGCCTTTTGG ATGTGCGACT GCTAACCACC 1140 

CTCGACAAAC CTTGGGATGC ATCATTAGAG ACCrTGTTCA CGAGCAAAAG CAATGACTTC 1200 

60 ATCGGTCCCG GACTGAATTT TGCTCTTCCT CGGCGCAATG TATTCGGCGG AGGAGAAAAT 1260 

CTTTCTTGGA ATATCGGTGG ATCGTATGAG TGGGAGACCG GCAATCGTCC CGAAAATAGC 1320 

AGCAATCGGC TGATCGATAT AAATTCGTAC AACATGAATA CGGCCGTGAA CCTCTCGTTT 1380 

CCCTCGATTG TATTTCCCGG TCTGCTGGAT AAATACTATT ACTACCCCAC GACTACGACT 14 40 

TTTCAGGCTT CTGCCACCGC GCTGAACAGG GCACACTACT TTAGCATGTA CTCTTTCGGC 1500 

65 TTTTCGACCA CCTACGAATT TCAGCCCTCC AAGGAACACC GGCAT6CTAT TTTCCCGCTC 1560 

AAGCTCAACT ACAACCTCCT GGGGCATCAG ACAGAAACTT TCCAGGCCAT TACGGCGAAC 1620 

AATCCGCCCC TGCTGCTCAG CCTTCAGAGT CAGTTCCTTG CTCAAATGGC GTATATCTAT 1680 

ACGTTCAACA AATCCGTTTC AGAGAAAAGT CCTCATCATC TTTGGATGCA ATTCGGACTA 174 0 

TCCGAGGCAG GCAATCTCCT GAATCTGATC TATCTGGCAG CCGGCAAGAA GTACAGCGAC 1800 

70 ACCAAGAATT TCGTCGGCGT CCCCTTCTCT CAGTTCATCA AAGCCACGGG AGAACTGCGC 1860 

TATTCCTATA CCATACACCG CAATCAGTCA CTGGCAACCC GTTTCGGCAC AGGCGTGATA 1920 

TATAGO^ATG GCAATATGCG AGTGGCACCC TATAGCGAGC AGTTCTATGT AGGCGGTGCC 1980 

AATAGTATCA GAGCTTTCAC CGTCCGTAGC ATCGGCCCCG GACGGTTCAA TCCGGATTCC 204 0 

GACAATCAGT ATTCCTATTT GGATCAGGTG GGCGAATTCA AACTCGAAGC CAACGTGGAA 2100 

75 TATAGAGGCA AGCTTTTCGG GGATCTCCAC GCAGCCGTTT TCCTCGATGC GGGCAACGTT 2160 



SUBSTmrrE sheet (Rule 26) (RO/AV) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AGC^???cc ?^?^St cg^^J^? ^^S^^??"^ r.rccGAA<.r ggcatcogto 

TTTCTCGTGG TTCGTGTCGA rrSS^^S ^CCGGTGTCG GCCTTCGCTA CGATCTGGCA 
AAAGG-rrA-? ACWA?TCC l^rn^^^ GGTCTCCACC TTCCTTACAA TACGGGTAAG 
TATCCCTTC ^^^^^^^^ ACGCTTTAAG GATGCCATCG GTTTCCATTT GGCTGTCGGC 



(2) XNFORHATIOM FOR SEQ ID NO: 63 

U) SEQUEWCE CHARACTERISTICS: 

(A) LENGTH: 23 4 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIJDEDNESS: doubl- 

(D) TOfOLOGY: circular 

tii} MOLECULE TYPE: W/A (genomic) 
(ill) HY POTKETi CAL : NO 
(iv) AIJTI-SEJ'ISE: HO 

(vi) ORIGIMAl, SOURCE: 

(A) ORGAJJISr-l: PORVPHYRaiOlIAS GTHGIVALIS 
(ix) FEATURE: 

(A) HAJIE/KEY: mif^c featur- 

(D) LOCATION 1...234 9 



2220 
2280 
2340 
2400 
2409 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63 



TCTCTCTCCC 
ATTGCCGOCT 
GACGG'i'AGCA 
CCGGAAGATA 
CGCTGGCTAC 
TCGCTCCGGA 
GCCAACCGTC 
GTGGTAGACA 
CGTTATTATA 
CTTGGCAATT 
CCCATCGTAC 
TGGAAGTTCT 
GGTACGAAAT 
ATCGGCAGGG 
GAGCTGCCAC 
GGAC&-1TATA 
TTCTGCGAGG 
AACGTGAATA 
TCTCGCCTTG 
GAAGTCCTCG 
QATCGCAAl-T 
GCCATTCGCA 
CCTCGTCTCC 
GAATGGAAGA 
GCTCAACTCA 
CTGCTGGATG 
CCGCCTACGA 
CTGAACTATA 
TTCAGTATCC 
AAAGACGAAC 
GATCTCGATC 
G'jTTTCGGAC 
TTTGCCGGAG 
ATGAAGATGA 
AATGTCGAAT 
GGCAATGTCT 
CGCTTCTACA 
TTCCTTGTGC 
AAATGGGCTA 
TATCCGTTC 



CGTATATCCG 
GCTTGCTCCT 
GACTATTAGA 
Tl-CGGGACTA 
TGCGCGTCTA 
AAATGGCCGA 
TGGCAAAGGC 
CCGGCTTGTA 
TACGCAATAT 
CGCTTCCTTC 
TCGATGAAGA 
CCGCCGAGGA 
CTGCCGATCT 
TATTCTTTCA 
GTATCGATTC 
TCCGGGCATC 
ATGATGTGGA 
TCCGAT'ITGT 
TGGACTGCTA 
GCACCAATTC 
TGTTTCGTGG 
AGGGTTCGCA 
TCTTCCCATT 
TCGGGTAT7UV 
ATTATTCATG 
TCGATTATCT 
CTGCACTGTA 
CCACGGCTTC 
AGACAGCCGG 
ACGGGTTGTA 
TGGCTAAAAC 
TGGCTTTCCC 
GATCGAACAG 
CTCCGGACAA 
ACAGGACAAA 
GGACGATAAA 
AGGAAATAGC 
GGCTGGATGC 
TCACACGCCC 



TTTTCCTATG 
CATGCTTGCT 
CAGGGTAACG 
TACCCTCCAG 
TAGCAGCTCG 
ACCGCCTGTC 
GATGGCCGGC 
CAAGAAAGCT 
GGCTTTGGAT 
GGCATACAAG 
GAGAAAGGCG 
TGTTTATTAT 
GAAATTAGTG 
TGCCGATTAT 
GATTTCGCGT 
GGCTCTCACG 
ACGCTCTTAT 
GGAGCACAAT 
TATTCTTACC 
CGCTGGAGAC 
GGCGGAGATG 
CAGCTTCATG 
CATITCTGAC 
TTACCAGACA 
GCAGACCTAC 
CCATCTCCCG 
TAACTACACG 
GTCCATGGAG 
CAACCTGCTG 
TAAAATGTTC 
CGTTCTTCTC 
TTATGGCAAT 
CGTTCGCGGC 
GACCTTCTTC 
GCTGTTCTGG 
GGAGTATGAG 
TTTGGCCTAC 
CGGACTGAAA 
AAACCTTTCT 



TCCTCCCATT 
TCCTCCTGCT 
ATCGCAAGCG 
CAACCCAATT 
AATCCGAACA 
CTCATCGATT 
GATGGCTl-TC 
CGCATTACTT 
GTGAAGAATC 
GTCGGGATCA 
ATAGCTCGTC 
GAAGCAGATA 
GTCAATGGCA 
GATCCTCT CG 
GGCGATTACA 
CGGTCGGTGT 
ATCAAGC-i'GA 
GGTAAGGATG 
GTTCCGGCCA 
TTCGGGGCGG 
TTCAATATCA 
GAATATGGGG 
GAAACGCGCC 
CGTCCGGAGT 
CTGCACAATC 
TACATCGATC 
GAGCAGTTTA 
CGTACCGTAT 
CAAGCCATTT 
GGTCTGCACT 
GAAAAGGACA 
GCTCGCCATA 
TGGAGTGTCC 
GATCAGATGG 
AAGTTTCGCG 
AATCAGGAGG 
GGTCTGGGGC 
GCCTACGATC 
TCCAATTTCG 



CCGTTCGGTA 
CGGTCACCCG 
AAACGGGCAG 
ACAGACl'GTT 
GCAACAGCTG 
CTGTCCTCAC 
TCGATGCTAC 
ATCTGATTCA 
CACTCCTTCC 
GCGAGGGTTC 
ATATGCGCAA 
CTACCGnTC 
TCGGGCGTTA 
AATCGGACTT 
CTGTTTACTA 
CCGTTACACC 
ATGCGCTCCC 
AGATTGCTCl' 
AGAGCAAATC 
CTTTGTCTCT 
AACTCAAGGG 
TGGAAAGCTC 
GGCGGCTACG 
TTGATCGGGT 
GTCTGCGTCA 
CCGACTTCGC 
TCCTCGGCTC 
CCAATCCTTT 
CTTATCTGAC 
ATGCTCAGTT 
ATACTTTGGC 
TACCCTTTGA 
GTACCCTCGG 
GTGATATTCG 
CAGCAGCTTT 
ACGGTCTCTT 
TTCGTCTCGA 
erCAGCAGAC 
CTTGGCACAT 



TCTAATCGGC 
TTATGTGCCG 
TATCGCTCTG 
CGGGATGACT 
GTGGAACCGT 
CGATCGTACT 
TGCTCGTGCC 
GCCCGGAAGC 
TCCCGTTGCG 
TCCCTTGTCG 
CAACGGCTTC 
AGGAGGATCG 
TCCATATCGG 
CAGAG^ITCAG 
TGGGAGTAGC 
GGGAGCTTTT 
TATCGTTCGG 
GGCGGATAGC 
GTTCGAAGCC 
CGGTTTCACC 
TGCTTACGAA 
GCTCCGTTTC 
CCCATCCACG 
GATTCTCTCC 
TACGATCCGC 
CCAATCCCTT 
GGCATATATA 
TACGGCACGG 
CGATTCTCCG 
CGTCAAGCTC 
ACTGCATCTG 
GTTACGTTAC 
CCCGGGGAGT 
TCTGGATCTG 
TGl'CGATGCC 
TCGTTTCGAT 
CTTCGATTAT 
AGGGCGTTAC 
TGCAGTAGGC 



60 
120 
180 
240 
300 
360 
420 
4U0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1 740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
23-10 
2349 



(2} INFORMATION FOR SEQ ID NO: 64 

(1) SEQUENCE CHARACTERISTICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



"in* 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(A) LENGTH: 2625 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI-IDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA (genomic) 

(ill) HYPOTHETICAL: NO 

)iv) AMTI-SEHSE: NO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAIIISM: PORY PHYROHOMAS GINGIVALIS 

(ixl FEATURE: 

(A) NAME/KEY: mire feature 

(B) LOCATION l..,2625 

(Xi) SEQUENCE DESCRIPTION: SEQ lU NO: 64 



GTCGAATCTA 
TTGCTGCTGG 
ATCGATTCAG 
TCCCTCCGCG 
TCTGTCTTGA 
TCTCAAGAAG 
CAGGTAATAG 
CGTGTGGCTG 
ACTCCCTCTA 
CGTGGATTCG 
GAAGACGGGA 
CAGATTCAGC 
AACATTATCA 
AATGATGGAT 
ATCACCATTG 
tSCATTCTCTT 
ACCGGATTCG 
TATGACAAAT 
CCTACGGCTT 
AAGATGGATG 
GGT03TCGCG 
CCCTATGAAC 
GCCAATGCTG 
AAGTGGTTCG 
CCCGGATACe 
GGCGGTAGCT 
CAACTGAAAG 
CTCTTCGCAC 
AACa*ACGAAC 
GGCGTATCGC 
TTCGCACAGG 
GGCAATATCT 
CTTACAGGAG 
GGATACTATA 
TATCTCAATG 
ATTCGTCAGA 
GTAAGTTACA 
AAGGGTCl'TC 
CTGTTCAAGG 
TTCAACCCCG 
TCATGGAAGT 
GGTTCACTCA 
AGCGATCCCG 
TACGGTTTCG 



AATTOrrATG 
CATTAGGCTC 
AAACGTCCGA 
GTGTAACCAC 
CTTTCCGTTG 
ACCTCGGTAC 
CCTCTGTG'3T 
ATATTCAGGC 
CCTATACGAC 
ACACTTACAA 
AAGTATATTG 
GCGGACTCGG 
CGAAGACl'AC 
TGCACAAAGA 
CAGGCTCCCA 
ACTTCTTCAA 
GTGCACCACA 
ACGGCATCCG 
ATGCTTACAA 
AAAATACCTC 
CTTATGGAAA 
AAACAAAGGT 
CGGCGAGCAA 
GTCTACTCTC 
ATGGGCGTTA 
ACTACATAGA 
TCGGTGACAT 
AGATGGAGCA 
TATACCGCAA 
CGTGGAAAAG 
GACACAATGT 
ATGCTGCGGG 
AGGTCGGCTA 
CGAAGTGGAT 
GCGTTGATGC 
TCGACCTTCG 
CTTCTTACGA 
ACGTCGGAGA 
GTTTCCATGT 
CAACGCGTAA 
TGCCCGATGT 
GCACCACATT 
ACGACAATAT 
GCCGCACTTG 



TCTTATGAGA 
CTCTCTCGCC 
ACCGCTCATC 
CGATATGGAT 
CGTAGGTTAT 
GATTCfCCTC 
GGCCAAAGAC 
AGCATCGTTG 
AAAAGGAAGC 
CTTCGGTGTA 
GAGCAATTGG 
AGCCTCCAAG 
GGACGCCAAC 
ATCGTTCTCC 
TATGACGGGT 
CGTTTCG7\AG 
ATGGCACAAC 
TCACAATCAA 
TACGTACCAC 
TCTTTATACc 
GAACAGTAAG 
GACTCCCGAT 
TGGCTCGGAA 
TTCATTCAAG 
CTACCGTGGC 
GGATCCCAAG 
TGTAAATCGG 
TTCGTCCGAA 
TCACAACTAT 
CTTCCTTCCG 
ATTCeCCAAT 
GGCTATCATT 
TGGATTCACG 
GGATCGCGTG 
TGTTCACTGT 
CGGTATGTTC 
CGAAGCCGGC 
TGCAGCACAG 
CATAGGTAAG 
TGCACAGCAG 
AGGTCTGTTC 
CTATTTCAAC 
CATCGGTAAG 
GTCTACCGGT 



AAAAGAATTC 
ATAGCGCAAA 
GGTGTATCCG 
GGTGGCTTCC 
GCTACCGTAA 
GATCCCCAGG 
CGTATGACGC 
AATGTCGAAT 
GGAGGTTTCG 
CTCATCAACG 
AGTGGTCTGA 
CTCGGTATCA 
ACCGGAGGTT 
ATTTCTACGG 
Cl'GGGTTATG 
AAGTTCAATG 
CAACGTTCTT 
TCCTTCGGCT 
AAGCCCCAGT 
gCAi4CCTACG 
TGGGTATTGA 
GGACTTATCG 
GCAATTTTTG 
AAGAAACTTA 
GACCACTATG 
ACAAAGCTCG 
GACTACACAG 
TGGATCGATG 
GGCGGTAGCA 
TGGAGTGGCA 
GGCGGTTTCT 
CCCAATGACA 
AATCACAAAA 
ACCTCGAAGA 
GGGGTAGAGG 
TCTCTCGGTG 
AATGAGACAG 
ATGACGGCTG 
TACAACTTCC 
TACGAAGCGG 
GATCTGTCTG 
ATGGACAACG 
AAACACGATG 
ATTCGTGTAA 



TACAACTTTT 
CAGTGGTGAC 
TAAGCACCGG 
CATTCGAAGT 
CTCGCTCTAT 
CCATCGGCTT 
CGGTACCCGT 
TTCCCGAACT 
GTGATGGTCG 
GAGTTCCTGT 
l-GAATCAAGC 
GCTCGGTAGG 
CGGCTTATGT 
GTATGAACGA 
TGAAGGGGCT 
AACGTCATAC 
CCAAATATTC 
ATCTGCGAGG 
TCTCGCTGAA 
CATCTTTGGC 
TCAACTACAA 
ACTACGATGC 
CCCTTGGCTC 
ATAGTTCGCT 
ACAAGATCAC 
CATACCATGC 
GCGAAATCAT 
CATTCGTATC 
AGTCCACCGG 
AGGCAGGTCT 
TCACACGTGC 
AAGCCAATAT 
ACTTCGAGTT 
GAATCGGAAA 
CTGAGGTCAG 
ACTGGACTTG 
GGCAGGATAT 
CTGTATCGGC 
TTGGCAAGAA 
ATGGCAAAGA 
CATCCTACAA 
TAGCCGACAA 
AGGCTTCCGC 
ACTTC 



CCTGACCGGA 
CGGTAAGGTG 
TCAGGGAGCA 
ACCGGCCAAA 
AGGCAGAGGT 
GGATGAGATT 
TTCCAATATC 
GGTTAAATCC 
TACCAATGTG 
CAATGGTATG 
CAGTACCATT 
TGGTACGATG 
CGGTATGGGT 
CGGTTGGGCT 
GAAGGGACGT 
CCTCTCTCTT 
TGTAGCCGAC 
CGAACTGACT 
CCACTTCTGG 
TACOGGTGGA 
CACCGGACAA 
CGTACTGGCT 
CAACTCTCAC 
GACTTTGACA 
C(5ATCl*GCrC 
GGAAGGTCAG 
GTGGCACGGC 
AGGATCTATC 
CTACCTGCCC 
GAGCTACAAG 
ACCACTCTTT 
GGAAAAGGTG 
CAATATCAAC 
CGAGTATGTT 
CTATCGTCCT 
GCAAAACAAT 
AACCTATATC 
AGACATAGAG 
CTATGCAGGA 
AATCGTGGAA 
TTTCAAGCTT 
GCGATATGTG 
TCTCGTATGG 



GO 
120 
IBO 
240 
300 
360 
42Q 
430 
5^0 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
IbOO 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S0O 
2625 



{2 J INEORtlATlOW FOR SEQ ID NO: 6b 

(i) SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH; 1360 base pairs 

(B) TYPE; nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: circular 

75 fii) MOLECULE TYPE: DNA (genomic) 



SUBSTrrUTE SHEET (Rule 26) (RO/AU) 



•iF=i> S"-' ***'i> 'i-^* f 



-ir' ^=^nj( MP«»i -ipTh 

iU3 Q ic:; c3j lui u 



WO 99/29870 



PCT/AU98/010Z3 
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liii) HYPOTHETICAL: WO 
<ivj WIT I -SENSE: NO 



(vi} ORXGH^IAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIUGIVALIS 



10 



(ixl FEATURE: 

(A) HAME/KEY: misc feature 

(B) UXZATIOH 1. . .1380 



(xil SEQl/ENCe DESCRXPTIOM: SEQ ID HO: 65 



15 



20 



25 



30 



35 



AACAGATGTA 
CTCCCy\GCTT 
ATCTTTCTCC 
CGACTTGAGA 
CGAAATACCA 
GTTGCTCAAC 
GACATTGATT 
GATGAATATG 
CTTTTCATTT 
CAATACGCTT 
GAGACTGAAC 
AGAGAAGAGG 
GCTTTGGGAG 
GAAGCTCTGA 
CGTGCTCGAG 
CCTGCCGAAC 
CGTGCTCTCT 
AGATACCGAA 
GTTAATAATG 
GATGGTGTAG 
CACGGTAACT 
CGTGTTAAAA 
ATTATTCACT 



ATCCCTCGCG 
TTd'AATCCA 
TCTCTGCATT 
AGCAACCTAA 
AGAAAGACAA 
GCAAGCAGAT 
CCATGACGGG 
CCCAAGCTCT 
CATCGGCCAA 
CTGCATACAA 
GTGCGACTGT 
AAAAAAA6AA 
CCAAACAAAA 
ACACAAAGAT 
AAGAACGTGA 
CG<3AACGGAA 
CGGGCAGCTT 
TCGTAAGCGA 
GAGGTATCGA 
TGTCCAGTGT 
ATATCACGGT 
CGGGTGAGGG 
TCGAAATCTG 



TCAATACTTC 
TCTATCTTATG 
TATCCTGCCT 
GGAGGCCCTC 
GCAAGACAAA 
GGTACAACTC 
TGTATGTCAT 
ACAGTCTATG 
GAGCTTTGAC 
GCTGGCATCT 
AGAAGACGCC 
ACTCGAAGGA 
AGACTTGGAA 
CGAGAAACAG 
ACGGTTGGCA 
GGCGGAGACC 
TGCACAGAJ\C 
CTTTGGCGTG 
CATCGCTGTA 
ATTCGTGATA 
TTATGCGAAT 
TCTTGGTCGT 
GAAAGAACGC 



CACCl'ATCGC 

aacttttcaa 
gctctcccac 
aaagccatcg 
caaaagcatc 
ttggacaatg 
cagctctctg 
caaaagcgga 
6aaggcatgc 
gtccggctgc 
aaaaagg;aga 
cagcaagccg 
gcgcagctgc 
attgccaagg 
cgcgaagcca 
aaaggcggct 
aaaggtcgcc 
catcagcaca 
gcaacaggat 
cccggttata 
ctgagcaaag 
gcctatacgg 
agcaaacaaa 



AGAATGACGT 
TCCGCCTTTT 
AAAAATCCAA 
AAAAAACCGA 
TCAACCTCCT 
AGGTCAAAGA 
TACiAAGAGAA 
AACGCTCGTT 
GACGGATGCG 
CCGATACACG 
AAGGACATCT 
AGCAACGTCG 
GAAAGCAGAA 
AAATAGAAGC: 
AAGCCAAGGC 
ATGCTATGGA 
TGCCCGGCCC 
GTGAGCTGAA 
CCGATGCTAC 
ATTCGGCCGT 
TGTATGTAAA 
ATCCTTCCAA 
ACCCAAGACT 



TGGAAGATCT 
CCTCTGCATC 
GCAGGTACAG 
TCGCGAACTA 
GAACAACCAG 
GTTGCAATCC 
AGCCCGATCC 
CGATCGCATC 
TTTCTTGGAA 
TAGCAAGTTG 
CTTAGTCATC 
GCAGGTGCAG 
AAAGCAAGCC 
TGCCGAACGT 
TAAGCCGOT'T 
TGCCTCTGAG 
CGTTCGCGGC 
AAAAGTACAA 
CAGCGTATTC 
AATGGTTCGT 
TTCCGGCACT 
CAACCAGAGC 
ATGGTTACGA 



€0 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
B40 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 



40 



(2) INFOW-IATIOM FOR. SEO ID tJO: 66 



45 



ti) SEQUENCE CHARACTERISTICS: 

<A> LENGTH: 10P.6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



50 



(ii) MOLECULE TYPE: DMA (genomic) 
(iiil HYPOTHETICAL: HO 



Uv) A^^Tr-sEnsE: no 



55 



|vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROT'IOKAS GIHGrVALTS 



60 



UX) FEATURE: 

(A) NAME/KEY: misc_f eaUure 

(B) LOCATION 1. . .1026 

(Xi) SEQUENCE DESCRIPTION: SEC ID NO: 66 



65 



70 



75 



AGTTTTTATC 
ACGAGTGTTT 
TCCATCGAAA 
GAAAGGAATC 
GATCAGTCAG 
ACCTGGCTTC 
CGCAAAGTAT 
GCTAACGAAA 
ATAGCCTCTC 
TTGGACAATG 
GAGAAATTTC 
TACGTAAACT 
ATTAGTTCTG 



AAGAAATACA 
TGCTCTTTTC 
ITTCTATAAG 
AAGGAAGTGC 
GGGCGAATCC 
CTGACGATAT 
ATGTCGTAGC 
GCGATTTGCA 
CTTTCCTGAT 
TGCCCCTTGT 
AGATTGTGCC 
TCGACAACGA 
CTAATGGTGT 



CAGACTTATG 
CTGTTCAAAG 
GGTAGATCAT 
TGCCGAAAGG 
GGCGAAATAC 
GAAGGTGAAG 
CAATGTTGAT 
GACTGTAAAG 
GTCCGGAAAC 
6CGTGCCATT 
GATAATTGTC 
GACCTACGTA 
TTGGCCl'CAG 



AAAAAGTATT 
AACAATCCTA 
TTCACCAAAA 
CTCATTACCA 
TATATTACCG 
TTGGATATGA 
AATGCGGTTA 
AGGACGACTG 
AAGACACACG 
GCCAAGGTGG 
AATGGTAGTT 
GTGAAGCCAA 
ATTACAGATT 



TGTTATATGC 


CTCG'JTGCTA 


60 


ACGAGCCGCT 


GGAAGACAGA 


120 


CGGGTGAGGG 


AGTACGCTAT 


180 


ATcrrrAccT 


CTTGTTGTTC 


240 


GTAACACTTT 


CACCGGAGGG 


300 


CACAATCCGA 


GGCCGGAGAG 


360 


AAACGGCTCT 


TGATGCTGTC 


420 


CAATGCCGTG 


GTCGACCGAT 


480 


ACTTCTTGGC 


CAATCGTCTT 


540 


AGCTGAATAT 


CTCGCTGAGT 


600 


TGAGTGAG'IT 


CAAGTTCAGA 


660 


CGACCAAGCC 


GGACAATCTC 


720 


GGACTGTATG 


GGGTGCTTCC 


7 BO 



SUBSnrUTE sheet (Rule 26) (RO/AU) 



^'i' «3i ini. Q o m n«3f C3i jiT: jOi lUi il^i 
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C^rc^l ;w.Tc.c^. ... 

GAGGTCGCAT TGCCTCGtS? G^^^c ^^^?S5r ^^^'T^^ GGCTACGGTA Voo 
CTTTATC.TX T.CCTTTOCC ..ACAA.A^? ^^5^^^ ^^S;^ ^--^^^ ^S60 



(2) INFOWIATIOM FOR SEQ ID NO: 67 

(i> SEgUEHCE CHARACTERISTICS- 

(A) LEMGTH: 987 base pair- 

(B) r/PE: nucleic acid 
(CJ STRAiJDEDNESS: double 

25 TOPOLOGY: circular 

(il) nOLECULE TYPE: DHA {genomic? 
(iii) HYPOTHETICAL: HO 
(iv) ANTI-SF.HSE: WO 



20 



25 



60 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGAHISH: PORYPHYRCbtOHAS GIHGIVALIi' 
FEATURE: 

(A) NAIIE/KEY: misc featur- 

(B) LOC'\TIOH 1. . . 987 



50 (2 1. IMFORHATIOH FOR SF.Q ID UO:68 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2634 base pairs 
cn tB> TYPE: nucleic acid 

(CJ STRAHDEDHESS: double 
(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: HO 

(iv) AIITI-SF.H.SE: MO 

ge ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROflONAS GIMGXVALIS 
(ix) FEATURE: 

(A) NAME/ KEY: mlsc featur- 
es) LOCATIOt^ 1...2634 



PCT/AU98/01023 



840 



102S 



(-^i> SEQUENCE DESCRIPTION: SEQ ID NO: 67 

^T^^Z T^^^l EZ--^"° -C-TACTT 

CCGGCCITCA TCCTCGCCGC TACTGCTTTG CCTTTTTTTT CACGATTTl'C 120 

TCCGATCGTA CGCTGGCTGT GA^C^^^AG C^A^I^I-? f^S*'^''^ TGCTTCAGGC Iflo 

35 GATAAGTCGG TGCAGGTGGT GGCATT^S? S^??^*^^^ ACTTCATC6A GTCCATTGCG 24 0 

CCTTCGCCTA CCGtS^?^ S?g?2c ^^^^^^^^ GGAATACGAC 300 

CTGGGGTTCG AGCAAAGAAA T^^^g^ S^^GA^A l^.^^"^^ TATAGGAGGA 360 

TTCGAAATGG GCAAAGCCTT GGCCGATG^ ^rl^SS^ ATAACCCTAA GCTCCCTCTT 4 20 

C^TCATTCI-C ATACAGaSS G^G^^ ^^^^^a^ ^l^^^"""^ CTCCTGCACA 4oS 

40 GCAAAGGCAC TCAGTCGTGC TGCATA^GAC ^TTCGAGCAG TGTGGTAGGG 540 

GACAAATGGG ACAAACCGCA C^CCT^C ^^Ic^S? J^SFJ'^^^^ GAACGAGAAA 600 

GTCGATACCA TGTTTGCCAA TGGCAaJgca r^^f^^SZS ^^SACAGCGT GAAGAGACrc 660 

CTCAGCTTTT TCGCCCAAGA GTT^ctg nr^t^^ TCGTCATATA TCACCCATCG 720 

GAGCCTACGG CTGCCCACCT TCG^TO^CTG A^^^Sl^TS^ TCATAGAGGA AGATGGGAAA 780 

45 ATCGTATTTA TCCAACCCG^ I^^^G^CG ^^CA^^ CACGTGCCGA TGGTGTCAGA 84 0 

GGTGCTCGTC CGGTAAGGAT CAATCCTCTG C^f^^S^ AGGACATCGC ACGCGAGATC 900 

ATTGCTCGCG CTTTGGCTCA TOAACG^ CGCAGCTCGT GGGAGOAGGA AATTTTACAT 960 



60 



ly-i) SEQUENCE DESCRIPTION: SEQ ID NO: 6 9 

^^"f?^ S^^^^S "^^-^-^ '^'°="'A.a eo 

cooc^c „oc.„=.o T^^S^ SSc^--- ^^^.ccx. ..o 



SUBSnrUTE sheet (Rule 26) (RO/AU) 



wo 99/29870 
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10 



15 



20 



25 



30 



35 



40 



ACCGATGCCA 
ATCACTATTG 
TATCTTCGTA 
AGCCAGGAGC 
GAAGAGGATG 
CGCCGTCTTG 
GCTTCTAACC 
CAGAACTGTG 
CI'CATCGACA 
CCTGCCAATA 
TCTTCTGCTA 
ACATTCAATG 
AATGCCrCCA 
TACCGCAACC 
CGCTCGCTGG 
TTTCACACGA 
GTGGGTCrrAG 

TTCTcrrrcA 

AAGAGCTATT 
GGCTACCCTA 
TATATGGGCG 
CTTTTGTTCG 
TGGCAGACCG 
AATATCAACA 
CTTGTTGGCG 
ACCACACTGC 
TTCCGCGCAC 
CAGAAAGTAT 
GCCGATATG'f 
ACTCGTTTGC 
CGCTACACGC 
GTCGCATACA 
GAAGCACAGG 
GCCAATGCAA 
.. CGTACCCCCA 
ATAGCCCTTA 
GTGAAGTCTG 
AAGGCTCCCC 
GTGGGTTATG 
AATATCTTCA 
ATCTATGGTC 



ATATCATCGG 
CTATCAAAGG 
ACTTGCGTCC 
GCGTAGTCCG 
CCATCAATCT 
CTCCi'ACTCT 
TGGCTCAAGG 
GTTTCAATCA 
GCCGTCCCAT 
TGATCGAACG 
TTGCCGGAGT 
AATCTCTGAG 
TCGTCAGCGA 
ATTGGGATGC 
GAGCGCATTC 
TCAGTGAATT 
CTGAACAAAC 
ACTATAAACA 
ACGGAGGTAT 
TCCCTCAAGA 
GTATCCAGTA 
GAGCCGAATA 
GCGAGGATGC 
ACTACAGCCT 
CTCGCTTGGA 
GTTTCAACG'i' 
CCCAGGTATT 
TCAACGATCC 
ATCATCGTTT 
TG*3ATGTATT 
CTATCAACGG 
AGTCCTTCCA 
AGTGGGGTCT 
ATGGACAACA 
GCGTATACGG 
CGGCAGCATA 
CCGAACTGGA 
GTATTGATGA 
ACrrCCACGT 
ACTCTTTCCA 
CTACGCAGCC 



TCACGTCAAA 
CACTACCTTT 
GGGTGAGATC 
CGTAGAAAAG 
GGACGAAGTC 
GGTAAATGTA 
CTTGTCATTC 
AGTTCGTATC 
CATGAGTGCC 
TGTGGAGGTA 
GGTGAATATC 
CTTTACCGGT 
TGACAACCGT 
TAACAATGAC 
TTATTTGCGC 
CCGCCCl-GGT 
TGACCATAGC 
CCACTTCCAG 
CGGAGAGATT 
TCAATAC6GC 
CAGCTACGAC 
TACGCGTGAT 
CAATGGGAAT 
ATTCGGTCAG 
CA,^GCATAGC 
GAATCCGGAC 
CGATGAAGAC 
GAACCTCAAG 
CGGTAACGTC 
CACCAACGAG 
TAGCGGAGCC 
GCTCCAAGCC 
GAATACGGTG 
GGAATACAAG 
TTATTTTACT 
TACCGGTCAG 
TATTATGCAG 
GCTGAAGAAG 
ATTCCAGGCT 
GAAGGACTTC 
GCGTACAGGC 



GACAGCAAGA 
GGTACATCTA 
ACTTTGATTA 
GACAAGACTA 
GTGATTTCGG 
TTGAACGAAA 
CAGCCGGGAG 
AATGGACTGG 
CTTGCCGGTG 
GTACGTGGTG 
ATCACCAAGG 
TTCAGCAAGC 
GCCGGTGCCA 
GGTTATTCCG 
TTGAGCGACT 
GGCGATCGTA 
GTATTTAGCG 
GCTTATACTT 
GACGTCAATG 
AATAATTATG 
TTGGACAAAT 
GAACTCAATG 
ACCATTCCCC 
AACGAATGGA 
GAAGTCAAGG 
ATCAACCTGC 
TTGCACGTAG 
CCTGAAATTT 
CAGACCAACT 
GAGCAGCCTG 
AAAGTATTCG 
GGTCTTACCC 
AAAGACACCA 

aacgaatcx:a 

TTGGCCrrACA 
ATGTATGTAC 
AACAATCCTG 
ACACCTGCAT 
ACTGAGGTTC 
GATCGTGGAG 
TACATGGGCT 



CGGGTGAACA 
CAGATGCAAC 
TGCGTGGCAT 
TCGAGGTGAA 
CCAACCGCGA 
AAGTCTTCTC 
TTCGTGTAGA 
ATGGTCGTTA 
TTTACGGTCT 
GAGGATCGGC 
AACCTTCl'CA 
TGGATAACAA 
TGGTATTCGG 
AATTGGGTAA 
ACAGCAAATT 
TCGATTTCCC 
GAAACTTGAA 
CCGGACAGAT 
GCCACCCCGG 
GCGTGACCAA 
TCCTCCTCAT 
A%33TGATGCC 
TCTATCCCGA 
AAAATGACAG 
ATATGATTCT 
GCGCTACATA 
CGGTTGTAGG 
CTCATGCATT 
TCCTTGTGGA 
ATCAGCACGA 
GTCTCAATCr 
TGGCCAGCAA 
ACGGAGCTTT 
TGACGGATAC 
ATCCTGCTCA 
CCCACGCTAT 
AGATTACCGA 
TCTTCGATTT 
AACTCTATGT 
CTGCACGTGA 
TGGTAGTGAA 



CCTTGTCGGT 
CGGGCACTAC 
GGGCTATAAG 
TTTCGAAGCA 
ACTGACCCTT 
GCAAGTCAAT 
GAACAACTGT 
TGCACAGATC 
GGAGCAGATC 
CTTGTACGGT 
CAATTCTTTC 
CACGAACTTC 
CCAGGCTCGT 
AATAGATGCC 
GACGGGAGAG 
TCCTCACGTA 
ATACGATCTC 
CGTAAATCGC 
TGGTACGGAA 
AGGCAAGACA 
GCC7TCGCAA 
CATCCTTTCA 
ATTGGATCAG 
ATGGAGCATC 
GAGTCCTCGT 
TGCAAAAG'^G 
CGGTGAGGCA 
CAGTTTGAGT 
AGGCTTCTAT 
TGGCATCAAA 
GGAAGGTAAG 
CAAATACGAC 
TGTTACXTGAG 
GCAGATCACC 
CTCATGGAAC 
CGAATATGGT 
CGAAACCGGA 
GGGCTTGAAA 
AGGTATGAAC 
CAGCGGATAT 
GTTC 



240 
300 
360 
420 
480 
540 
600 
660 
V2 0 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1600 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
240O 
2460 
2520 
2580 
2534 



45 



50 



(2) IHFORllATia^ FOR SEQ ID NO; 69 

(i) SEOUEUCE CHARACTERISTICS: 

(A) LEIIGTK: 618 base pairs 

(B) TYfE: nucleic acid 
iO STRAMDEDTIESS: double 
(D) TOPOLOGY: circular 



55 



(lit MOLECULE TVeE: ONA ( genomic t 
(iii> HYPOTHETICAL: MO 
11 V) AI ITT -SENSE: NO 



60 



65 



fvl) ORIGIWAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ixl FEATURE: 

(Al WAI-IE/KEY: mi sc_f«atuce 

(B) LOCATIOJI 1 . . .618 

(Xi) SEQUEHCE DESCRieriOIJ: SEQ ID HO: 69 



AAACAGATAG TTATGACAGT AAAGCGCGCA GTGCGAATAG CACTTCTCAC GCTGATAGGC 60 

ATTCTTTTTT CCTCACCTTC TCTTGTTCGG GCGCAAAGTC TTTTCAGCAC CGAACATGTC 120 

TTGCAACTAT ACAACAAGAT ACTCTATGGA GAGTCGGCGG CGGATACCGT CGCAGAGAAA 180 

70 ACGGCAGGTG AGTCGGCAIT TCCTTTTATA GACAAACTCA TCAATCTCGG CCGCACTTTC 240 

CTCGGCAAAC CATATCGCTA TCGCGGTCCT TCCCCATGGC CGATGGACTU CTCGGGCTAT 300 

GTGTCITACC TCTACTCCAA ATTCGACATC AAACTCCCAC GTGGTGCGGC AGCACAGAGC 360 

CAATATACGA ATCCTATCGA GCGCGAGGAT GTTCGTCCGG GCGACCTCCT TTTTTTCAAA 420 

GGCCGCAATG CACGCAGCAA CCGTATCGGG CATGTAGCTT TGGTCGTATC TGTCGATGAA 4 80 

75 GATGATATTA CCATGATGCA CAGCCGCAAT TCGCGAGGGA TCGTGATCGA AAAACTCAAT 5-10 



SUBSTTTUTE SHEET (Rule 26) (RCVAU) 



n"-^ ii^t «n """M .rfT' ij--^ ^p^. :jf=]| h^-^j 

^ Oi iQi O m %S nC3i O U iU«n 
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CGCAGTGCAT ACTTCTCCCG TCGCTTGGTG AGCTATGGCA GGGTACCCCG AOCCAAGAGA 600 

GTGATCCCAC GAAAAAGT 6i8 

5 (2) INFORMATION FOR SEQ ID NO;7Q 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1401 base paxes 
iB) TYPE: nucleic acid 
10 (C» STRAJJDEDNESS: double 

(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

15 (ill) hTi'POTMETICAL: NO 

(ivj Aiai-SEUSE: MO 

(Vi) ORTGINAL SOURCE: 
20 (A) ORGANISM: PORYPHYRaiONAS GIHGIVALIS 

(ix) FEATURE: 

[A) NAl IS/KEY: mi sc_f ea ture 
IB) LOCATIW 1 . . .1401 



25 



55 



65 



70 



(xi) SEQUEHCt; DESCRIPTION; SEQ ID H0:70 



AA.z\GGTACGT GG7V\TAGAAA AAACCGAAGA GAAGAAATGA AACGGACAAT CCTCCTGACG 50 

GCACTGACCG TCCTATCTTC GCTCTCCTTG CTTCGTGCAC AAAATGAATC CGAAGCATCA 120 

30 ACCAATCCGA TGTCAGGCCT CTCCCTGCAA GACTGTATCC GGATAGCCAA GGAGCGCAAC 180 

CTGAATCTGC GCAGACAGGA GATCGAACAA GAAAACCGAA TCATTAGTCT CGATGCAGCA 240 

CGACACAGTT TCCTGCCCTC GGTCAATGCA GGCATCGGAC ACAACTATAG CTTCGGACGT 300 

TCGAAAGACA AAACGGGAGT AACCGTAGAT CUCrcCTCGA TGAATACCAA TCTCAGCATC 360 

GGAGCTTCGG TGGAAGTATT CAGCGGCACA CGTCGTCTGC ACGACCTCAA GCAGCAAAAG 4 20 

35 TACAACGTGG AGGATGGTAT AGCCCGACTT CAAAAAGCGC GTGAAGACCT CAGCCTGCAA 4 60 

ATCGCGGCTC TCTATATCAA TTTGCTCTTC CGTCAGGT^ TGACTCGTAC GGCAGAAACA 540 

CAGTTGGCAC TGATTCGCGA GCAACGCAAT CGCACGGCCG AAATGGTTCG CGTAGGTAAA 60O 

TGGGCAGAGG GTAAGCTCCT CGACATAAAT GCCCAGATGG CCAAGQACGA ACAACTTCTC 660 

GTACAATATC GTTCGGAGGA GGAGCTGGCT CGTCTGGACT TGGGGCAAGC CCTCGAACTG 7 20 

40 GAGCACCCCG AAAGCATTGC AGTCAAGGCT CCCGACACAG ACGTTCTCGT AGCAGAAAGG 780 

TTGGGATCTC TCCTTGCTCC CGAAGAGATC TATCGCACGG CTCTCGGCTT GAAACCGGCA 840 

CTGCATTCGA GCGAGCTGCA AATAGCTTCG GCACGCGAAG GTCTGGCCTC GGCTCGTGCG 900 

GCATAOLTCC CGACGCTCAG CCTCTCTGCC GGATACAGCA ACGGTTACTT CCGCGACCTC 960 

GGCAAGGA6T ATGCCGCCAT CAACCCCTCC TTCTCCGAAC AGTQGAAGAA CAACGGCAGC 1020 

45 TAGAGTATCG GACTCTCTTT GAATATCCCC ATCTTCTCTG CCATGCAAAC GCAAGATCGC 1080 

GTTCGGAGGA GTCGCCTGCA AATACGCTCA AGCGAGCTTC GACTCGTCGA AGAGAAAAAA 11 40 

GCCCTCTATA AAGAGATCAG GGAAGCATAC AGCAATGCCG TGGCAGCCGA TAAGGCCATC 1200 

GCAGCAGCCG AAAACAGCAA GGCCGCTACG CTCAAGGCAT ACGAATACGC TCGCGACAGC 1260 

TTCGAGGCAG GGCGCTTGTC TGCCTACGAA TATGCCGAGG CAAAAACAAA ATACGCCCTC 1320 

50 AGCCAAGTGG AAGAACTTCG TGCCAAGTAT GACTTCATAT AGAAAGCCAA AGTTTTGGAT 1380 

TTCTATCAGG GCAAAGACTT C 1401 



(21 INPORHATION FOR SEQ ID HO: 71 



(1) SEQUENCE CHARACl' ERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRAfJDEDHESS : double 
60 (D) TOPOLOGY: circular 



fii) MOLECULE TYPE: DNA (genomic) 

(iil) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: WO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa^OUAS GIHGIVALIS 



(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1...1353 

75 ir.i) SEQUENCE DESCRIPTION: SEQ ID NO: 71 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



,i«it a^^Hi iF--=" ^=^i> jf"i) jf^l; 

o 33 ic:; 43 ioi m n„ji is^i ic- Oi lui u 



wo 99/29870 



PCT/AU98/01023 
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10 



15 



20 



25 



AAAACATCAT 
GCGGCTTTGG 
GACTATGCAC 
AAGGCCGATT 
GCTTCGTGGA 
AGCTTCAACA 
GTCTATCGAC 
CAGCAGGAGC 
CAAATGCAAG 
GCTCGAATGG 
ATGGCCGGTG 
CGGCTCAAAG 
GCTGACAGTC 
GCTGCCCATC 
CGTTTGCGAG 
ACGGGATTCT 
AACCGTCGGG 
CTTGTGAGCC 
GAAGCGGAGC 
CTGGCTTCCT 
GTCTTGCAGC 
CGGCTCCTGG 
AAACTCA-]*AG 



ATCGGATiTAA 
CTGCGAATCC 
GCCGGCACAG 
ACCTTCAGGC 
ATTTCGGACG 
ATTCGTACAG 
TGCGGATGGC 
TGGCAGCTCT 
AGCTGGCCAT 
AAGAGCTCGG 
ACCGTTTGGC 
AAAAAATGAA 
TCTCCGCCGA 
ATCATCCCGT 
CCGCGCGAGG 
C7\CGCTTTTT 
GGGAATACGT 
ATCTGCGTCA 
GCAGGCTCTA 
ACCGCCAGGC 
GTTATGAGGA 
ATGCCCGTGT 
CCTATTACGG 



TATGCGTTTC 
CCTTACGGGC 
TTCGGCCGTG 
CGTCGGCAAT 
CGGATTGGAT 
CATACATGCC 
GCATGCACGC 
CGGCACCACG 
GCAGAAGTAC 
GATGAAGAGT 
CCTGACTCAA 
CTTCCCCATC 
CATGGCCGAA 
CCTCCTCCGT 
TGCATTCCTC 
GAATGGATCG 
CAGTCTGAAT 
GGCGCGTGCC 
CAGCGAGATC 
GAAGGAGCAT 
GGGGCrCAAT 
GCAGCGACTG 
CTGCCTTTCG 



CAACATTATC 
CAATCGAATA 
GCGCTGTCCG 
TTTCTGCCCC 
GCCGAGACGA 
ACGATGACCC 
CGGGAGGCnT 
GAGGCCTACT 
GAGGAGAGCA 
CGTCCCGATG 
GCGGACAATC 
GATGACGAAC 
TCGGACAGCT 
GCCAAACTCG 
CCGAGTGTGT 
GACTATACGC 
CTGAGTATCC 
GAACGCAGGG 
CCCCAAGCCA 
ACCGACGCCA 
ACGGCCATC6 
AGAGCGGCCA 
GAC 



TCATCTGTAC 
TGACCCTCGA 
CTGCGGAACT 
GTGTATCGGC 
ATACCTACAC 
TTTTCGACGG 
CGCGCCTCTC 
ACGACCTCGT 
GCCGCCTCCA 
TCCTCGAGAT 
AGTGCATCAT 
TCGTCGTAGA 
CGGCCGGCGT 
ACGAGCAGGC 
CGGTATCCGG 
CCTTCAGCGA 
CCATCTTTTC 
CGGCAATCGT 
TGGCCGACCG 
TGCAAACCGC 
ACCTGACCAC 
TGACCTACCG 



GGCTGCCGTA 
AGAGTGCATA 
GGAGCAGTCC 
CGGAACCGGT 
CGACATCAAC 
TTTGCAGAGT 
CGTTCGCGAG 
CTATGCGCGC 
CCGGCAGACG 
GCAGTCGCGA 
CGCTCTGATC 
CGATATGCCG 
CTTCGCCCGT 
TGCCACCGAC 
AGGATGGAAC 
GCAGTTTCGG 
GGGATTCAGC 
CCGACGGGCC 
GGATGCCGCT 
T'l'ACGAAGCC 
TCAGGCCAAT 
GCTCAAATGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
700 
640 
900 
950 
1020 
1080 
11^0 
1200 
1260 
1320 
1353 



(2) IHFORI'lATION FOR SEQ ID HO; 72 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LEHGTH: 2Q96 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMDEDNESS : double 

(D) TOPOLOGr: circular 

35 (iJ) MOLECULE TYPE: DNA (genomic J 

(iii) HYPOTHETICAL: NO 



40 



(iv) ANTI-SENSE: NO 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAfl GINGIVALIS 



45 



(i,x) FEATURE: 

(A) NAME/KEY: mlsc_f eature 

[B] LOCATION 1...2886 



(xil SEQUENCE DESCRIPTION: SKQ ID NO: 72 



50 



55 



60 



65 



70 



75 



GCCATTTTTG 
AAGAGCATGA 
ATGGCAACTG 
TCTTCAAGAT 
GATTTCAATC 
CCCGTCCGGA 
ATCTCTTCCA 
GCCATTACGC 
ACCCCCGACC 
GCTTTTGCCA 
GGAGGGACTT 
GGACGCCCCG 
GAGATCAACA 
CAAATGATCA 
ACGAAAGGA6 
AATTTCGGTG 
AGAGGATGCA 
ATGAAGGeiT 
GAAGTCCCTC 
TCGAGCGGTG 
AAGACTCCGG 
TTCTATTTCA 
AATCAGAATA 
GAGTTCTACG 
CGCATGGACA 
TATCGTGACG 



TCGTATCATT 
ACAAATTTTA 
CACTGACCGC 
CCGCCGGAAC 
CCJGAAGACCT 
TAGGACAAGT 
TCGGAGACGT 
TTTATTACGA 
ATGAAATTGT 
CAGAGCCGGT 
TGCCTGACAT 
TAACGGATAA 
TCAATTGTCC 
TGGTAAAAGG 

AcrrrACTCC 

TAACGCAATC 
GCAATGGTAC 
TCCTCCCGAT 
TGCGCTATCG 
CCGGTATTCA 
CTCTGAATAC 
AATACGATCA 
AGCACGTGGT 
GCAGACTGAA 
TCTATCTGGA 
GTTATAAGCC 



GCAAATTGAA 
CAAATCACTT 
TTCTGCACAG 
GCATTCATTC 
GATCGCACAG 
AATACCGGTG 
AGATGTATAT 
TGCATTCAAT 
GTTGGGAGCA 
ACCGGGGAGT 
CAAGATCTCC 
CCATTACGGG 
TGAAGGTGCA 
ACAGTATATC 
GCTGATCATT 
CGAGTTGGAT 
ATTGGCCATC 
CAAAGGTAAA 
TGTCTATTAC 
TCATCCGGCC 
ATGGATCTCC 
AGGTGGTACG 
CGGCACACTG 
CAGTCATTGG 
TCCCCAAAAC 
TTTGCCCTCT 



AAAATAACAG 
TTGCAGTCAG 
ATTTCGTTCG 
GACGATGCAA 
AiGCCGTTGGC 
GATGTGGACT 
CGCCTGGAAT 
ATTCCGGAGG 
TATACGAACG 
GAGCTTATTA 
GGTGCGGGTT 
ATCGGTGAGG 
GACTGGCAGG 
TCAATGTGCT 
TCTGCCGGAC 
AAGTGGATCT 
TTCCGTGOCA 
TCCGATGGTC 
AATGGATGGG 
GGAGATGCCA 
TCCAGTGGTT 
GAAGGAGGAT 
ACCGGAGGTG 
AACGAGTATG 
AATGGCCAGA 
GTGCCCCGGC 



AGAATAAGTA 
GACTCGCTGC 
GAGGGGAACC 
TGACTATCCG 
AATCGCAAAG 
TTGCATCCAA 
TCAAGTTGGA 
GC'3GACGCCT 
CCACTCATCG 
TGGATTATGA 
ATATATTCGA 
ACGATTCCGA 
CAGAGAAGAA 
CAGGCAACCT 
ACTGTGCTTC 
TCACTTTCCA 
ACAGTATCAT 
TCTTGCTGCA 
ACAGTACGCC 
TGAAGATTTC 
CCGGAGGGAC 
CGTCCGGTTC 
CCGGCAATTG 
CATCCGATGG 
CGACCATCCT 
TATTGTTGCA 



TAATTCAGAC 
CrrCGl'GTCG 
CTTGAGTTTC 
CCTTACTCCG 
AGATGGCCGG 
GGCTTCGCAC 
AGGAGCCAAA 
CTATATCTAT 
CCGCAACGGA 
AGTGTCTCGC 
CAAAGTCGGC 
TTOGGATTGC 
CGGTGTGGTG 
GCTCAATAAT 
CATAACAACC 
CTATGAAAAA 
CGGAGCTTCC 
ACTCAACGA7 
CGATATTCCC 
CATCCTAAAG 
TGACGATCAC 
TTCTCTCTTC 
TGGCGGGACG 
CAATACGAGC 
CAACGGAACG 
GTCTACAGGC 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
€00 
660 
720 
78Q 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 



SUBSTTTUTC SHEET (Rule 26) (RO/AU) 



O £31 S Jl. isS S to o o f o lui o 



wo 99/29870 



PCT/AtJ98/01023 



49 / 490 



10 



15 



20 



25 



30 



GATCAGGTCG 
GTCGAATACC 
TCGGATGCCA 
CGCTTCATTT 
TCTGOCGACC 
CCCGGAGGAG 
TTCGGAGAAA 
GCCCCACAAA 
ACATATCCCG 
ACTTTCCACC 
TCCCiAITGGC 
AATGACCATA 
ATCCGTTATG 
AACAACGQAG 
AAATA7C1-GG 
TATGCTAAGG 
AAC?GGAACAT 
ACAGAGAGCG 
GTTGCTCAfiA 
TTCAGCATTA 
CAGGTTCGTT 
GCCGGTACTT 
AAGCAG 



AATTGAATTG 
ACATATTCCG 
TCQACGAAAG 
ATCCCTCGCC 
TTGCCATAGG 
GAGTATCATT 
GCCCCAATCC 
CCCCGAATCC 
AAAAGGCTGC 
TCTTCCTCAA 
AGGCCGGAAC 
TGCTTTTTGC 
TAAGAAATCC 
TCTCTTTCGA 
TGGTAAATAC 
GAACGAATGT 
TTATCGGCAC 
ATGAATACGA 
TTGAGAATAA 
AGAACGCTCA 
CTTGGAACAA 
ATATGCTCGT 



GACGGCTGTT 
AAA'l-GGAAAG 
TATTATCGGT 
GTTGGATGGA 
AGACATTCAG 
AAGCTGGAAA 
TGTGTTCAAA 
TCCCGTTGGC 
TATCGCTGCC 
GAGCAACACA 
ATGGTTGAGG 
CGGTATCAGA 
GGATAACCTT 
AGGCTACGGA 
CGATGCTCCG 
GGCTCCATTC 
ACAGGATCCA 
AATAAAACTG 
CAATGCTGTC 
TATGGTTCAC 
CCTCCGCAAT 
TATGCAGACG 



CCTGCCGATC 
GAAATAGCTA 
AGCGGTATCA 
GTGGAATCTT 
ACCAAGCTGA 
GTTCCTTTCT 
ACCTTTGAAG 
GTAGTCATl-G 
GTTTATGTAA 
AACAGAAGAT 
ATCAATTTGG 
ATGCCTAATA 
TTCTCCATTA 
ATACCCTCGC 
AAGATCGATA 
CXICGAATTGG 
TCCGTCACAA 
GTATATAAGG 
GTTGCATATC 
GCTGCCGCCC 
GGCGTGACAT 
GCAAACGGCC 



AATATCCATC 
CGACAAAGGA 
TTCGATACGA 
ATAAGGATAC 
AGCCGGACGT 
TAAGCCAGTT 
TGCCCTATGT 
CAGACAAGTT 
TGCCATCCGC 
TGCAGAAGGT 
ATAAGCCGTT 
AGTACAAGCT 
CCGGTAAGAA 
TCTTGGGCTA 
TGTCGCTTGT 
TCG(3CATATA 
CTTATTCGGT 
GATCGGGCAT 
CGTCTGTTGT 
TCTACTCATT 
TCAGTGTTCA 
CTGTGAGCCA 



ATCTTATCAG 
GTTGTCCTAT 
AGTAAGCGCA 
GGACAAGACT 
AACACCTCTC 
GGTTTCCCGA 
TTCTGCCGCA 
TATGGCCGGT 
TCCGGACTCT 
GACAACTCCC 
CCCGGTGAAT 
CAATCGTGCT 
GATTTCATAT 
TATGGCTATC 
ACAGGAGCXTT 
TGTCTATAAG 
TTCAGACGGA 
TTCGAATGGC 
AACAGATCGT 
GGATGGCAAG 
AGGACTTACG 
AAAGATCGTG 



(21 It^FORIlATIOM FOR SEQ 113 NO:73 

(i) SEQUENCE CHARACTERISTinS: 

(A) LENGTH: 2106 base pairs 

(B) TYPE: nucleic acid 
iC) STRAIN DEDNESS: double 
[D) TOPOLOGY: circular 



1620 
1680 
1740 
1000 
1860 
1920 
1980 
204 0 
2100 
2160 
2220 
2260 
234Q 
2400 
2460 
2520 
2580 
26-10 
2700 
2760 
2820 
2880 
2886 



35 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii> HYPOTHETICAL: NO 



40 



(Iv) AHTI-SEIISE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 



45 



(ix» FEATURE: 

(A) NAME/KEY: mlsc_f eature 

(B) LOCATI0r>l 1...2106 



(xi) SEQUENCF. DESCRIPTION: SEQ ID NO: 73 



50 



55 



60 



65 



70 



75 



TGGAAAAAAA 
ACAGAAAGCT 
TTATCATT6A 
GATGGCGAGA 
ACAGAACCCG 
TATTTACAGG 
GCCTATATCA 
AGCATCTATG 
CAATrrGTGG 
ATGCTTGCTC 
GATTATAAAC 
CAAGGATATG 
TTCCrTACCA 
CTTGATATTG 
TGGACACTCT 
ACAGGACAGC 
GACAATCAGG 
TCTTTACAAG 
CTCCATCAOG 
ATCTTCATOA 
AGCATTAGCA 
CATCTTGTAG 
ATAATCACTT 
AATGGGGTAG 
GTCAATACAC 
AACAATAAAG 
GGGACACAGA 



GTAATCCTCA 
ATTTAAGCCT 
TGTTACTCTC 
ATACGCTTGT 
GTCAGGGCAT 
GACAACTTCG 
TTCCGGTCAA 
TAGTGGCCAA 
TAGAGAAATC 
ATGGCAATAA 
TCAAACGAGT 
AAGTGGTCGG 
CAGAAGCTCA 
CAGAGTCGGC 
CCACACCGGA 
CGGACAACAC 
TCAAGAGCAA 
AGCCGGAAGC 
ATGCATTCAA 
ATAACGTGAA 
TACAGTCAGT 
CAAGTAGTAG 
CCAAGATACC 
CCGGTTTGAA 
TTGGCACAGC 
CGATTTACCA 
CAATCATCGG 



ATACCGGGGC 
AAGGAGGAAT 
TCTCTGGACG 
TTTACGCGTA 
ATACAATGAG 
TTGGCAGGTG 
AGAGGAAATG 
TCTCGATTTC 
TATTGAAGTC 
GCAGATCAAT 
GGCAGCAAAG 
AAATATACAG 
AGAGATCCCA 
ACCTGCCAAT 
OAAGCGACCG 
AGCCAAACCG 
TGTCCTCTAT 
TGTTTCTGTA 
TCTGCCTGCC 
CACATACTCG 
TACCTTTAGC 
CGACCAGTAT 
GGTTAATAAC 
AGAGACTGTC 
ATCGGCATGG 
TGTCGTAGTA 
TTATCCCCCC 



AGGACAGCAA 
CAAACAATGA 
GGCTGTACAC 
GAGACCGGTA 
AATAAAGTAG 
AAGTCTACAG 
CGACCACTAT 
AATGCTCCGG 
TCTTCTACGA 
ATGGCTACGA 
ATTCGCATGA 
GCAAAGTTTC 
GCTGCTGCAT 
TCTATCCATT 
GAATTCTTCA 
TACTACTACA 
AATCTGAATG 
AACGGCACAC 
ACCAATTACT 
QTGAAATATC 
TACGTCTCTT 
CCTACGATTA 
GTACCAAAGA 
ACAGTACTCC 
AGACCAGACG 
CTGCTTCCAC 
ACTGAAACAA 



ATTACCGAGA 
AATATCTTAT 
ACGAGGAGCT 
AAGCCCCAAA 
GCTCCATTTC 
ACTATCAAAT 
TCAATGGCAA 
CCACAGAAGC 
CAGCCCCTGC 
CAGAAGGGAA 
TAAAACCCAC 
GCAATTCGGT 
CCTATAAGAC 
TCTATTCTTA 
TCATGGTCAA 
GAGTGCCCCT 
TGAAAATCGA 
TCGCAATAGA 
TGATAGTGGA 
AAACTTCGCA 
CTGATGGCAC 
CAAGCGATAA 
AGATCGTTTT 
AATATCCTGC 
GATCTTTGGC 
CCGAGAATTT 
TTTCTTTTCA 



rCATTATTVAT 
CAGACTCTTC 
CTCTATTTGC 
TGCTCGTGCC 
TGTGCTCTTC 
CCATGAAGGG 
CAACAACTTC 
TGCGCTTTCT 
CGATTTCGTA 
ACTGTTGGGG 
CATCAATGTG 
AACGAAGGGG 
ATCGGAATAT 
CTATAACAAA 
ATTCAAAAAC 
CGAATCTCAG 
AATCTTGGGT 
AGAATGGATT 
ACAGCACGAA 
GAAACCAATC 
TCAGGACAAT 
TACAAGCATC 
TGAGGTAACT 
ACAATTTATT 
TCCGGGGCTT 
ATTTGAAGAT 
TAAGAAAGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
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ID 
15 
20 
25 
30 
35 
40 
45 
50 
55 
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75 



AACAATACCT ATCCGATACT ATGOTCTGAC ACAAATACGA CAAAACAGGA CCTTGAGACA 1680 

TCAAGAATGA TTTCACCTTC CTTTGAGTTA GCCTCCCAAC TTG(3GGCTAC TCTCCCGATG 174 0 

CCCTATCTCG AGTATTGGCC AG3GACATCA TATCTCCTTG ACTATTCGGG AAACTATAAT IBOO 

AATAAGAGAT ACCCCTTGTT TAATTGCGCT TTTTACTGGG AGAAAAGAAA AGT'J'AATAAC IB 60 

GAAGAAATTA AATTCGATGA CTGGCGTTTG CCGACAGAAG CTGAGATCAA ATTGATAGAT 1920 

AAGCTGCAAC ATAATGAGCA GAGTGCTGTC CAAGCTATCA TGACAGGGAA TTATTATTGG 1990 

GATAGTTACT CTGCAAATGG GTCTTATAAA ATGCAAGGAG GAGGGGGCCA AGGAAATTCC 20 JO 

TCCAAAGCCT ATGTTCGTTG CGTGCGGGAT GT'IAAAAAGC CGATTCGTGA CAAGAAGTCA 2100 

CCTAAG 2105 



t2» IHFORHATION FOR SEQ IV UO:'f4 

SEQUEHCE CHARACTERISTICS: 
tA) LEHGTH: 3936 bass pairs 
(B) TYPEi nucleic acid 
1C> STRANDEDHESS: double 
(DJ TOPOLOGY: circular 

lii) HOLECULIv TYPE: DHA {genomic) 

(lii) HYl?OTHETICAL: HO 

(iv) AIJTI-SEHSE: 110 

(vl) ORIGINAL SOURCE: 

{AJ ORGAtllSH: PORY PHYROI lOMAS GIUGXVALIS 

(ix) FEATURE: 

{A) MAI-IE/KEY: misc_feature 
ID) LOCATIOIJ 1. . .3936 

lr.i> SEQUEl-lCE DESCRIPTIOH: SEQ ID NO; 7 4 

AAGAGGGTAG ATAGCTATGA ATGCAGACAA AAAGCTTGCA AATGTGCAAT TTGTGTGATA 60 

CAAAAATTTA CTAATGTAAA ACTAAATGAT ATGCGAAAAA TTTTGAGCTT TTTGATGATG 120 

TGCTCTCTGC ATTTAGGTCT ACAATCTCAG ACTTGGCATG GA6ATCCGGA CTCAGTGGCA 180 

GCCCTACCTT CTATCGGTAT TCAAGAGTCA AGTTGTACCC GAATCACGTT CGAGGTTGTT 240 

TTCCCCGGA'l- TTTATAGTGT GGAAAAACGA GAAGGCAACC AAGTCTTTCA GCGCATTTCC 300 

ATGCCG-SGTT GTGGCTCGTT TGGGAATCTG GGCGAAGCTG AATTGCCTGT TTTGAAAAAG 360 

ATGATA6CCG TTCCGGAATT TTCAACAGCT AACGTTGCTG TAAAAATCAA AGAGACGGAG 420 

ACATTCGACA ATTATAATAT CTATCCTAAl" CCTACCTATG TCGTAGAGGA GTTGCCTGAG 4 80 

GGGGGGACTT ATCrGGTAGA GGCTTTCGCC ATAAACAATG ACTATTATAG CCAAAATGTA 54 0 

AGCCTCCCTT CTACTCACTA TGTCTATTCT CAAGACGGGT ATTTTCGCTC ACAAAGATTT €00 

ATCGAAGTTA CCCTGTATCC TTTTCGATAC AACCCTGTCC GACAAGAAAT TCTATTTGCA 660 

AAAAAAATCG AGGTTACAAT AACTTTCGAT AATCCTCAGC CACCTTTACA AAAAAACACC 720 

GGCATATTTA ACAAAGTAGC CTCCTCTGCA TTTATTAATT ATUAAGCTGA TGGCAAATCG 7 80 

GCGATAGAAA ATGATATGGT GTTCAGTCGT GGTACAACAA CGTACATAAG CGGAAATGTT 84 0 

GCCAGCAACC TCCCTCACAA CTGTGACTAC TTGGTTATTT ACGATGATAT GTTCAACGTA 900 

AATCAACAAC CACACGACGA AATCAAACGG CTGTGCGAAC ATAGAGCCTT CTACAACGGC 960 

TTTGATCTAG CTGCTGTAAG TATAAAGGAC GTATTGAATA GCTTCCCATC AAATGCCACC 1020 

TCATACATCA ACGAAACTAA ACTGAAAAAT TTCATTCGCT CAGTTTACAA CCAAAGCAAT 1080 

GCGAAGAGGA CTTTAGATGG CAAACTGGGA TACGTGCTAC TGATCGGAAA ACCAITGAGC 114 0 

AAATATTTGG CTGACACTGA TAATACAAAA GTCCCAACCT CTTTTATTCA TAATGTCTCC 12 00 

TTAATTCCAA GTCATCCAAC TTTTGGTTCC ATATGCGCCT CCGACTATTT TTTTAGTTCT 1260 

GTTTCGCCCC TTGATACTGT CGGCGATTTG TTTATCGGTC GATTTAGCGT CACCAATGCT 1320 

CATGAATTGC ACAATCTGAT TGAAAAGACT ATCAACAAAG AAATCTCATA TAATCCTATT 1380 

GCACACAAAA ATATTCTTTA CGCAGAAGGG AAAGGCTGCG ATGCTCCAAT CTTACGTTTA 14 4 0 

TTCTTAAAAG AAATCGCCTC TGGTTACACA GTCAACTCTA TCTTAAAATC TAATCAGCTC 1500 

TCTGCAATAG ACfCGATATT TGACTGCTTG AATAATGGTT CCCATCATTT TTATTTTAAC 1560 

ACTCATGGAA TGCCGACTGT TTG-SGGGATA GGGCAGGGAC TCGACGTCAA TACTCTAACA 1620 

GCCCGATTGA ACAATACATC TTCGCAGGGA TTATGTACGA GTCTATCATG TAGTTCGGCT 1680 

GTAGCAGATT CAACTATTAG ATCGCl'TGGA GAAGTCCTGA CCACATACGC ACCTAACAAG 17 4 0 

GGATTCTCGG CTTTCTTAGG AGGAAGCAGA GCCACCCAAT ATGCCGTTTA TTTAGAAG«5C 1000 

CCCTGTCCTC CGTCAGAATT TTATGAATAT TTACCTrATT CTTTATATCA CAATCTCTCG 1860 

ACTGTTGTTG GCGAAATGTT Gd'ATCATCC ATTATCAATA crrAATTCTGT TGATACGTAT 1920 

TCGAAATTCA ACTTCAATTT GCTTGGCGAC- CCTGCACTAA ACATTATGGC TCATGGCATG 1980 

GAGGTTAGTA ATTGTATTAC ACTACCAAAC AACACCATTA TAAGCAGTCC GATAACAATA 204 0 

AAAAAT&^TG GCTGCCTAAA AATACCGGAA AAAGGAGTTT TqCATTTTAC TAATAATGGC 2100 

TCCATACAAG TCATGTCCCj AGGAACTCTG GAAATAGGCA ATCAGGCTAA AATATCCGGA 2160 

GAGACCGGTG CTAACCCCAC CTTTATTACC GTTTACGGCG ATGGTCTTGC GATTAACAAG 2220 

CAGGTAGAGA TAGACAATAT AGACCGACIT AACTTGTTTT CTACGCATTC GGTCATGCCC 2280 

AAATTTCATT TTGACAGTGT GAAATTCAAC AGTGCCCCGC TGTATACAAC GAACTGTATT 234 0 

GTGGAGATAA GCAATTGCGA ATTTACCAAT CGAAGTGACA TTATTTCAAA GAATTGTGAC 2400 

CTAAGCGTTG AAAACAGTAT GTTTAGCAGT TCGGGGATAA CGGTATTCAA GCCTATGGCT 24 60 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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ACAAGCTCCA TCACCGGATT ATCTACAAAA 
ACAGGAAACT TCGCCTACCA TATCACAAAC 
GCCATCAACT TAGACAATAT TCCTGAGTAT 
GATGAGGCTC TTGTACTAAA TAATAGTGGC 
5 CC5GAATGTGA TAAAAAACTG TAGGATTGGG 
AACCGAAATA AGATCAGTAA CAATCATATA 
TATTTCGATA ATGCTCCTGT AATCAATGAA 
ACTTGGCAGC TCTATTCATC AAACGGTACA 
CAGG-SGGGAG ATAC7VGATAC ATGGATTTAC 

10 GTTTCAAATA ATCACTGGGG CAACAATGAT 
CCAGACTTGT TCATTTGGAT ACCTTTTTGG 
AGCTCTGCTG AAGCAGTAGA ATTCCAAACA 
CTTTCGGCAA AAGTOGCTCT CAAGATGATG 
ATAGCTGCTT TGAAGGAAI'T GTTCAGGATA 

15 TTGAAAGATT ATTTCAGATC CAATCCAACC 
GCTGATTTCC TGTCTGCGCG ATGCGATATT 
TGGTACGAAA ATCGCTTGAA TAGTGAAATC 
GACCTTGGTG ACATTTATTG GAATATGCAG 
TTGAACATAC TTTCCTGTGA ACAAAGGAAA 

20 TATTTGTTGT CAACTCTTCC CGAATCAACA 
AAATCAAGCC TTGftTAAATC CAAGATAATC 
GTAACAATAA TCl'ACTATAC CGATAACCCT 
AATG<3ACCCT CGGCTGATAT AACCGGGTTG 
ATACAGTTCA ATACATCCAA CTaTGATCCC 

25 CAGAAAATTA TAGATACGGA AAAATTACGA 



GCAAAGAITA CCGACAATAC TTTTTTTGCG 2520 

ACGCCAGGCT TAACAGCAAC CTCCAATGCT 2590 

TACATTTCCG GTAATAAAAT AGTCAA'ITGC 2640 

AACAGAACGA ACAGACTCCA CAATATCACA 2700 

AfiCACGCT'rr ATAATTCCTA TGGTATTTAC 2760 

GGAGTACGTC TCCTCAACAA CAGTTGTTTT 2820 

GAAGATAAGC AGACGTTTAT TTCTAATAGG 2880 

TTCCCTCTCA ACTTCCATTA CAACAGCl'TG 2940 

AACGACACGT ATACGAATCG CTATATTCAC 3000 

TTGTTTGATC CGAATCAGGT TTTCAATACG 3060 

GATGGATTGC CAAATGGGAG ATCGGGCAAT 3120 

GCATTGGACT GTATTGGCAA TAGCGATTAT 3180 

GTT6AAACC1' ACCCGGAATC CGACTTTGCA 3240 

GAGAAAATGT CAGGCAACGA tTACGAAGGC 3300 

ATCATCTCTT CCCAGAACTT GTTCCCGACA 3360 

GTGTGTGAAA ACTATC3\GTC TGCCATCGAT 3420 

TCCl'ATCAGG ACAGTGTTTT TGCAGTCATT 34 80 

TTAGACTCAC TCAOAGGGAC TGGTATAGAT 354 0 

TCGCTCGAAA GCCATCAAAA TGTAAAAAAT 3600 

GGTACrCTCC TGCCTCCATT AGAATGCAAC 3660 

TCTAirTTCGC CCAATCCGGC GAAAGCTGTT 3720 

TCCTGTTCrG TAATAAAAAT ATATGGAATA 3780 

CCCAAACATC TATCCGAAGG TTATTACAGC 3040 

GGTTTCTACC TGGTAACGCT AAATGTTGAT 3900 

ATCAAA 3 936 



{2) INFOWIATION rOR SfelQ ID HO: 7 5 

30 li> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 281 4 base pairs 
fB) TYPE: nucleic acid 

(C) STRPdiDBDtAESS : double 

(D) TOPOLOGY: circular 

35 

(in HQLECULE TYPE: DNA (genoioicl 

till) HYPOTHETICAL: MO 

40 (Iv) ANTI -SENSE: KO 

(rl) ORIGIHAL SOURCE: 

(A) ORGAJJISM: PORYPHYRatONAS GIMGIVALIS 

45 (ix) FEATURE: 

(A) HAME/KEY: mlsc_f eature 
IB) LOCATION 1,,.2814 



50 



(xi) SEQUEl^CE DESCRIPTIOTJ: SEQ ID HO: 75 



TCAGAAAATT ATAGATACGG AAAAATTACG AATCAAATAA TGGCTATCAT GATGAAAAGT 60 
ATTGTTTTTA GAGCATTTCT AACGATTTTG CTCTCGTGGG CAQCGATCAC GAATCCGACT 120 
SCTCAAGAGA TCTCAGGCAT GAATGCATCC T6TCTGGCTG CTCCQGCTCA ACCGGATACT 180 
ATCTTATATG AAAGTTTTGA GAATGGACCT GTTCCCAATG GCTGGCTTGA GATAGATGCT 240 

55 GATGCTGATG QTGCCACTTG GGGAAGCCCA TCAGGCTCTT TCTCTGTACC TTACGGACAC 300 
AATGCCCm* QCACCTACTC CCATATACGT TCCGGTATCT CAACAGCGGG CAACTATCTG 360 
ATTACACCCA ATATAGAAGG AGCCAAACGG GTCAAGTACT GGGTATGCAA TCAGTATAGT 420 
ACCAATCCGG AACATTACGC AGTAATGGTA TC6ACAACGG GGACTGCCAT TGAAGACITT 4 80 

G-n-TTGTTGT TTGATGATTC CATAACAGGG AAACCGACTC CTCTTGTATG GCGTAGACGA 54 0 

60 ATCGTGGACT TACCGGAAGG GACCAAATAT ATTGCATGGC GACATTACAA AGTCACCGAC 600 
TCACACACAG AATTCTT6AA ATTGGATGAT GTCACTGTGT ATAGGTCGAT CGAAGGGCCC 660 
GAACCTGCTA CCGACTTCAC AGTAATCAAT ATTGGTCAGA ATGTGGGACG ATTGACTTGG 7 20 

AACTATCCGG AGGATTATCA ACCGGAAGGA AAGGGGAATG AAGAGTTGCA GCTTAGCGGC 7 00 

TACAACATCl' ATGCGAACGG TACACTACTG GCACAAATAA AAGATGTCTC CATACTGGAG 8 JO 

65 TATGTGGACA GCACTTACTC TTTGCGAGAC AATCCCTT3C AAGl^SGAGTA CTGCGTTACA 900 
GCCGTTTACG ATGAAAGCAT AGAATCTTCG ACCGTATGTG CCACGCTGCA TTACGCCACG 960 

GATGCCATCC TTTATGAAAA TTTTGAGAAT GGACCTGTTC CCAATGGTTG GCTTGTGATA 1020 

GACGCTGATG GAGATGGATT TAGCTGGGGA CACTATTTGA ATGCATACGA CGCTTTTCCC 1080 

GGCCATAATG GAGGCCATTG CTCCTTGTCG GCTTCTTATG TTCCGGGTAT AGGCCCGGTG 1140 

70 ACTCCCCACA ACTATCTGAT TACCCCCAAG GTTGAAGGAG CCAAACGTGT CAAGTACTGG 1200 

GTAAGCACGC ACGATGCCAA TTGGGCAGCG GAACATTACG CGCTGATGGC TTCGACAACG 1260 

GGGACTGCTG TCGGAGATTT CGTCATATTG TTCGAAGAAA CCATGACAGC GAAGCCGACC 1320 

GGCGCATGGT ATGAAAGAAC CATCAACTTA CCTGAAGGGA CTAAATACAT OGCATGGCGG 1300 

CATTACAACT GTACCGATAT ATATTTCTTG AAGTTGGACG ATATCACTGT ATTCGGGACT 14^0 

75 CCTGCATCAG AGCCCGAACC TGTTACCGAT TTCGTTGTCT CGCTTATTGA AAACAACAAG 1500 
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GGACGATTAA 
CCATTGCAGC 
GACCCGACTG 
GTGGAATAl-1- 
AAGCTGATTT 
ATTCCTGAAG 
CCTTGGACTA 
ATGATTGGCG 
CTTCTCAAGT 
ATGGTTTCTA 
ACCGCTTAAGG 
TATATTGOJl' 
ACGGTTTATC 
ATTGAGAATA 
ACTGATGATA 
CTTGrrCACA 
GACGGTCAGG 
CAATCG'^TTT 
ACAAGCTTGA 
CGGAGCAAGT 
ACTCIATTCAG 
AAAGTAGTCG 



AGTGGAATTA 
TTGCCGGCTA 
TTTTGGAGTA 
GTGTCACTGC 
ATGATTCTCA 
GCTGGTTGTT 
TGTATGGACA 
TTTTAACTCC 
ATTGGGTAAG 
CTACGGGAAC 
CTAACGGTGC 
GGCGGCATTA 
GTTCTACTGA 
ACAAGGGTCG 
AAAAACCATT 
TACAAGACCC 
TGGAAATGGA 
GCGATAAGCT 
AAATATATCC 
CGACAATCGA 
AGAAAACGGA 
GTGGAAATAA 



TCCTAACGGC 
CAATATCTAT 
TATCGATGAG 
CGTTTATAAC 
ATCGGACATT 
GATTGATGCT 
TGACAGTGAG 
GGATAACTAT 
TGCGCAAGAT 
TGCTGTTGAA 
ATGGTATGAG 
TGATTGCACC 
GACTGTTCCC 
CCTGAAATGG 
GCAGCTTACC 
GACTCTTTTG 
ATATTGTGTC 
GAACTATACT 
TAATCCGGCA 
GTTGTATAAT 
AATCGATGTT 
AACAACAACC 



TACGAACCCG 
GCAAACGGCT 
ACTTATTCTT 
GACAATATCG 
ATCTTATATG 
GATGGCGACA 
AAGTGTATTG 
TTGGTTACAC 
GCTGTTTATT 
GATTTTGTCC 
CGAACTATTA 
GATATGTTTT 
6AGCCTGTTA 
AATTATCCTA 
GGCTACAACA 
GAGTATATCG 
ACTGCCGTTT 
ATCACATCCT 
TCGTATGTGG 
GCGCTGGGAA 
TCACGTCrrCA 
GAAAAGGTAG 



ATAAGACTCA 
CGCTCCTTGT 
CACGAGACGA 
AGTCCCAATC 
AAGGCTTTGA 
ATGTTAATTG 
CATCCCCTTC 
CCAGACTCGA 
CCCCTGAGCA 
TCTTGTTCGA 
CATTGCCTGC 
TCTTGCTCTT 
CTGATTTCGT 
ACGGCTACGA 
TCTATGCAAA 
ATGAGACTTA 
ATAACGACAA 
TGGATAATAT 
TAAGGATAGA 
TTTGCATATT 
ATGACGGAGT 
AGATAAAGAG 



TGATAAAGAC 
TCACATACAA 
TCAGGTGGAA 
GGTTTGCGAT 
GGCCGGAAGT 
GGACTATTAT 
GTACTTACCG 
AGGAGCCAAG 
TTATGCTGTG 
AGAGACAA7G 
AGGAfiCAAAA 
GGATGACATT 
TGTCTCGCTT 
ACCCGATAAG 
TGGCTCGCTC 
TTCTTCACGA 
TATCGAGTCC 
TCAATCTGAT 
GGGATTGAGT 
AAGGGAAGAG 
CTACTTGATT 
GCCG 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
20-10 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2814 



25 (2) INFORMATION FOR SEQ ID WO: 7 6 

(1) SRQUEHCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 
_ (B) TYPE: nucleic acid 

*iU <C) 3TRAIJDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

35 (iii) HYPOTHETICAL; NO 



40 



45 



50 



55 



60 



65 



70 



75 



(iv) ANTI-SEWSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAiaSM: PORYPIIYROC>IOHAS GINGIVALIS 

(ix) FEATURE: 

(A) MAME/KEY: mi sc_ feature 

(B) LOCATION 1 1818 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76 



ATAATCTTCT 
CAATl'ATATA 
CGAAATACAG 
CAGACTGTGG 
CGATTAAAAC 
TCCTCTTCAG 
CGGGGGTACC 
CGTCTGATAG 
TCGGCTTTCA 
GTGGACTACG 
CATTATTTCA 
TCGACACCTG 
GCAAAAAATG 
CTGGGTACCG 
AGTAATGAGT 
TTTGCCAAAA 
TATTATGTGG 
AACATACAGG 
TGGCCTAAAC 
TTCGGCGGTG 
ATGCCCAATA 
GGGAATATAG 
GTGCCTTTCT 
GTGAGTTl-CT 
TACTCTTATC 
GATGGAGGAC 
CATCCCATTG 
TATTCTTTCG 



GTACGATTCA 

CGGCCGTCAT 

AAACCAAACG 

AGATGGAGCA 

CTTTCCGTCC 

GCAGGAACAA 

TGAATATCGG 

ATGCAGAGCA 

ATACCGGT6A 

AGCAGCGCAG 

ATAACTACGG 

TTACTCCTCA 

ATGTGATCGA 

ATCCGATGAA 

TGTCCGATGA 

ACAGCGAAAT 

AGGGCGCGCC 

CCGGAGTAGG 

TGGATGCTTC 

TCATTCGAAA 

CGATTGTACT 

CCGATGTGGT 

ATACTCCGAC 

TGCCGATATA 

GCGATATGCT 

TTGTCGCCTC 

CCCCATTGGA 

GCTCGGCTGG 



TCACTCTGAG 
AATGGCTCTC 
CCCCGACACG 
TGCGGATCCG 
GGAATATAAC 
TCTTCCGAAT 
TATCGGCCAT 
GGAGAGACTG 
CTTCGACGGC 
GCCTTCCTTT 
ACGGGGAGCT 
GATGGACAAC 
TGCCAGGATC 
GGCTCTCACA 
TATTAAGCTC 
GATTCAAACG 
CACAATCGGA 
GATTTCTTCC 
GCTTAGTATC 
TGGTCTCGCC 
CCCTTCGCGC 
ACGTATGGAG 
TCTACCCTTA 
TGCCGACGGC 
CCGCTTTCTG 
CATGCAGCCC 
TGTCAGACTC 
CTCGGAAGCC 



TTGGAAATAA 
TCTGTATCAT 
CTGCGCAGGG 
CTTCCGGCTG 
AAGCGTACAT 
ATCCTGCCGA 
ACGCTAAACC 

AATCT'rrrcc 

GACAGAAAGG 
GTGCTTGCTA 
ACCACCAATG 
GGGACCCACA 
GACTATCGTT 
GAACATACGC 
GGTGTCGAAG 
GGCGTTCTGT 
TTTGTCGGAG 
CATTTCGGAG 
TTCCCTTCAT 
GATGTTATGC 
AATGCTTTGA 
GTTTATGGCG 
TATAATCCAT 
ACCCGCTGGC 
GTAGACGCAT 
GATCTTATAT 
CGGTATACAC 
TTGGGTATCG 



TGAACAGCAT 
CCGTTTGCGG 
AGCTTACTAT 
CATACAAGGC 
TCGGATTTGT 
CGGAAGGTCA 
AGCGAATGGA 
TCTCCTATCG 
ATAGACGAAT 
CCGGCTTGTA 
TGGGCAGCAT 
ACGTCCGTGT 
TCTTCCGTTC 
CTGAACTGAA 
TTCGTACGGG 
CCGAAACCGA 
ACTCGGACAA 
CCAAAGGGAG 
GGCGTGTGTA 
AAGAGGAGAT 
CCCCCCAATT 
ACTTCTCCAA 
CCGACTTGTA 
6CGCAGGTGG 
CCTATGGCAA 
TGAAGGCAGA 
AGCTGAACGG 
GTAATGTACA 



CATGAAATAT 
TCAAACCCCA 
CGTTAATGAC 
CATCGAACCT 
CCCTGAAGTT 
TATGAAGCAC 
TGCCGGCTAT 
TGGGATGAAA 
GATGGCAGGA 
TTATTCGAAC 
CCCTCAGCTA 
ATACTTGGGT 
TAITCCCTAT 
CGTGACGATG 
AGGATTGTTT 
CCGCAACCTG 
TATGCAATGG 
GTTGTTTTTC 
TGCGAAAGCC 
GCCCTACCTG 
AGGGGTGAAG 
GCTGACAG'3T 
TCAGTATAAT 
TAAGCTGGAA 
GTGGAATTT'3 
AGTAGGTGTT 
ACGGTATCGG 
TCTTCTTAGT 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1S20 
1 680 
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GCGGATGTTT CATACAAGCT GAAAAAGAAC TTGAGCCTTT ATCTCAAAAT CGATAATATG 
CTGGCGGAAA CGACAGAGCl' TATCGGTTAT TATCCTATGC AGCCGTTCCA TTGTTTCGCC 
GGTTTTAGCT GGACTTTC 



(2) IMFOR^tATIO^^ for SEQ id NO: 77 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1071 base pairs 
(D) TYPE: nucleic acid 

(C) STRATIDEDMESS: double 

(D) TOPOLOGY: circular 



(li) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GlhfGIVAMS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...1071 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 77 

AACTCCCATG CAACASTCTC CGTTATATGT TCAATGATGG AAAAATGTAT CTTTGCTCAC 
TATCCACATA ACCTTGTGTT CATGATTCGC AAGCATTTCG GTATCATTTT GGGATTTCTT 
TCTCTTGTGT TTTCGGCAGG TGCTCAACAA GAGAAGCAGG TGTTTCATTT TCTGAACCTT 
CCGGCTACTG CACAGGCTTl' GGCTGCCGGA GGCAAAGCTA TCACCATCGT AGACGACAAT 
CCCGGACTGG CTTTTGAGAA TCCGGCTCTG CTCGGATATG AATCCGGTGG CCGCGCCTTT 
CTTTCCTATT TATATTATAT GAGTGGTTCG CATATGGGCA ATGCCTGTTA TGCCTCGTCC 
GTCGQAGAGC GTGGCATGTG GGGTGTTGGC ATGCGTTTCC TGAACTACGG GTCTATGCAA 
GGATACGATC AGAATGCGAT TGCCACGGGC TCTTTTAGTG CTTCGGATAT AGCTGTACAA 
GGATTTTACA GCCATGAACT GAGCAACCAC TTCCGCGGTG GAGTCAGCCT AAAAGCATTG 
TATTCTTCTA TCGAGACGTA TAGTTCCTTT GGCCTTGGTG TGGATGTCGG TATCAGTTAT 
TACGACGATG ACAAAGGATA TTCCGCTTCC GCTCTGTTCA AGAACGTAGG GGCGCAACTG 
AAAGGCTATA ATGAAGAACG GGAACCGCTC GATTGGGATT TCCAGCTCGG CTTTTCCCGC 
AGTTl'TATCA ATGCTCCGTT TCGCTTGCAC ATCACGTTGT TCAATCTGAA TCCGCACTAT 
TTCAAGCGTC TTGTACCACG CGATCTGTCC AAGATGCAAA AGTTCCTCCG ACACTTCTCG 
ATAGGAGCAG AA'ITTACTCC TTCCGAGAGG TTTTGGGTCG GGCTGGGATA TACGCCACAn 
ATTGCACAGG ATTTCGAGGT GGAAGGCGGC AACAAATGGG GAGGTCTTTC GGCCGGCGTC 
GGTTTCACTT CAGGTGTAGT ACGTGTAGGC GTATCTGCTG CCACCTATCA TCCTGCAGCT 
CTTl'CGTTCA TGTGTTCGGT AGGTATCCGT TTGGACGATA AGAGCATCTT C 



(2} INFORMATION FOR SEQ ID HO: 7 8 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1011 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNE^SS : double 
(D) TOPOLOGY: circular 



(11) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(Iv) AHTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa^ONAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
(13) LOCATION 1, . . 1011 

(J^i) SEQUENCE DESCRIPTION: SEQ ID HO: 7 e 



CCTCAGCCCG TCGGCCTTAA AGAAATAACC 

GCTCCGATTT CATCCGAGTT GCTCGAGCAG 

ACAAACAAAG CCGGCAATGA GATCTATGTT 

AAAGAAGTAG GCCGACTGCG AGAAGAAGCC 



ATTAAACCCA TGTGCCTCGA ACCCATA^TT 
GAGCTGACTG CCGATCCTTT TCTGCGGATG 
TTTACGGCCG AAGAAGCTCC CCATTGCATG 
TTTCGGCATT ATGGCGGAGG TACTGGCAAG 
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GCGATCGATA 
GATCCGCAAA 
TTCGATACCC; 
TTTTTGCACG 
TACCAATCGA 
ATCGGACCAC 
TACAAAGACT 
TCCGATCCGG 
GACGAAGCCT 
GTGCGCAAGC 
GAGATGCGTG 
ATATTCATTG 
ATCCTCAGCC 



TAGACGAGTT 
ACAAGGCTAT 
ATGGCAAGCC 
ATTATCTCCC 
CACGGATGGG 
TC;7VCTGTAGT 
ATGATCGGCG 
AAGGCTTGGT 
TGTTCTCCTC 
TGUGTATCAA 
TTTTCGGCAC 
CTGTGGGTAA 
GGAACGAAAA 



CGACACCATG 
ACTCGGAGGC 
TTTGCTGGCA 
CTACACAGTC 
CACAAAGGCC 
CAATCCAGAG 
AGCTCGCAAT 
CAAGCCTATT 
ATCCGACTTT 
TATCXCTCCT 
TGCAGTGAAT 
GATCCTGGAA 
AAAAGGTCTC 



CCCGGGAGCT 
TACCGCTTTA 
ACGGCAGAGA 
GAATTGGGAC 
ATTTTTGTGC 
GCACTCrATT 
CTGATCCTGT 
CATCCCCTAC 
GACACCAATT 
CTCGTGAGTG 
GAGTCTTTCG 
GAGAAAAAAC 
GACAGTAGCA 



ACAAACAGCT 
TCTATGGGCG 
TGTTTCGCTT 
GTTCGTTCGT 
TGGACAATCr 
TCTATGGCAA 
ATTTTCTTCG 
CGATAGAGAT 
ACAAGACTCT 
CATATATAGC 
GAGAGGTGGA 
AACGGCACAT 
ATGGCCGATC 



GATCGTATGG 
GGACGTTGCl' 
CAGTGATGCT 
GTCGCTCCAG 
TTGGGACGGT 
GGTGACCATG 
CAAGCACTTC 
CAGTGCGGAG 
CAATATAGAA 
TTTGTCTCCG 
GGAAACCGGC 
AGAGAGCTTC 
A 



(2) n-fFORMATTOTJ FOR S£Q ID WO: 79 

(i) SBQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1698 base pairf 
(Q) TYPE: nucleic acid 

(C) SrnAllDEDNESS: double 

(D) TOPOLOO*: circular 

(ii) MOLECULE TYPE: DMA (genomic) 
(iiil HYPOTHFTICAL: UO 
tlv) AJJTl-SEMSE: WO 

(vi) ORIGINAL SOURCE; 

(A) ORGANISM; PORYPHYROHOHAS GIUGIVALIS 

(ix) FEATURE: 

<A) WAME/KEY: misc feature 

(B) LOCATION 1...1698 



(y.ij SEQUENCE DESCRIPTION: SEQ ID MO; 7 9 



TACCACGGAG 
AAAACCATTG 
GTGCACTTGT 
GTCCTGTCTC 
GGG-SCCGATC 
TCGGCCTATG 
CTCTCGCTCG 
GCTTCCTACT 
GCei'ACTATC 
CGGCTTGCCG 
TACAGGGGCG 
TTGGAGCTTT 
GCTGCGTATC 
GACAAATTCT 
TTCGGTATCT 
CGTAGGGGCG 
TCGTCCATCA 
CTGTCGGACT 
ATATACGAAG 
ATTCGCCGCT 
CGCACGGATA 
GAGACGTATC 
ATACCCTATG 
CGAATGGTGC 
CAGCTGGCCT 
CTGTACGTCT 
GGCGACCTGA 
CATATCCTGT 
AATATCTCCT 



CGAGACTGGT 
TAAGATACAG 
CTGTCGAGGC 
CCACCGAAAG 
TTTTCCCGTT 
GCGATATTGC 
AATCGGGTGG 
CACGTGGCAT 
CCTATTTGGT 
GCTACTATTC 
AAGTTGCTTA 
CTTGTGCTAC 
TCTATCATAG 
'i'CGTCAGCTA 
CCAGAATGAA 
ATGCCATCGG 
ATCTCTTTGC 
TCGATTTTTC 
ACTACAAGCC 
GGTATATGCT 
GAGGTTGTGC 
GCAAGCATGG 
ACCATGCCGG 
TGACGCATTC 
ATTTGCCCTA 
CGATTCCGAT 
TGAAAAGAAA 
CCGATCCGCA 
ACCTCTTC 



CTATACCCTA 
CCGCCTTCCG 
GCAGATGCTC 
GCAATACAGG 
GAGCGA'l'AAG 
GGGCGACTAT 
TCGCATCAGT 
GCACAAACGC 
GTCCGATTCG 
TTTTCGCGCC 
TCGGCTGACC 
CTCTTTGACG 
ACAACACCTC 
CGGTTTCGGT 
CTACGTCAAC 
TCTCGACTAC 
TTTGCTTTAC 
ATTTTCAGCC 
GGATGATAAT 
CAATGAGTTT 
CCTGAGAGTG 
ACATCATACC 
ATCCAAATTG 
GTATAAGATT 
TGCCTATCGT 
GCAGAATACC 
GGACGGTATA 
AGCCGAACGA 



TTTCGGAATA 
GTCGCTCTCl' 
AATACTCCTT 
GAGATTTGTG 
CTGCGCGATT 
CTTCCGTACA 
GTCCGTAACT 
ATCGGCTGGA 
ACCGGCGGAG 
GGCCGCTTGC 
GATCCGCG*1-A 
CTGCCTCGAG 
ACACAGTACA 
CAGGTGGATG 
GGATGGAAGC 
AGCGGCTACT 
AATCGCCTGC 
GACTATGCTC 
TATCATAtCT 
TCTGCCCAAG 
AGTCCCGGTA 
ATGAGCGGTVfV 
GATTTTGGAC 
CGTACCATCC 
AATAGAGAAG 
CACCGCCTGA 
GCCTATGGCA 
ACGTCCGGCC 



GAAACGACAT 
TCTTTTGCCT 
TCGAGCTGTC 
TGCAAACGAA 
CGGCCTATGT 
ACGGCAATAA 
ATGGCACATT 
ATGCTCTGCG 
ACTATCAn**!' 
CCCTCGGTAT 
C6ACCAA7AC 
AGAACAGGCT 
ACTGGCGTCC 
TCAGCAACAG 
TTAGCTCCCG 
TCCTCGATAC 
GACTCTATGG 
TGCGCCAAGG 
ACGACCTCCG 
CCCAAGCCTC 
GTGATTTCTA 
TGCTACGTCC 
TTTCGCTTTC 
AGAAAGAGCA 
GCGTGGAGGT 
TGACAGAGCT 
AAACGCCCGG 
ATACCATCGG 



TCACCCTATG 
TTTGGGAGCT 
GGATCAGATC 
AGAAAAAAGG 
TCGTTTCGGC 
CTACTCCTCG 
GCAGGGCAGT 
CAACGCCGAA 
CGAAGACTAT 
AGGCTTCrCA 
GACCGGTGCA 
ATCGCTTTC6 
CGGGCAGCAG 
CCCTATCTGG 
TCTGGATACC 
CGAaGAGAGG 
TAGCTGGCAT 
GATAGAGCGG 
TATCriGGCC 
CTACCGTATT 
CGGCTATGAT 
TTTTGCCGGT 
CGCTGCTTAT 
GCTCGACTAT 
GCGTTCCTCT 
GCGGTTGTAT 
TGTCATCTCA 
GGCTATCTGC 



300 
360 
420 
480 
540 
600 
660 
720 
790 
840 
900 
960 
1011 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
7flO 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1698 



70 (2) ItirORHATIOlJ FOR SEQ ID NO: 80 

(i) SEQUENCE CHARACTERISTICS: 

tAt LENGTH: 2457 base pairs 
(B) TYPt;: nucleic acid 
75 (C) STKAh^DEDNESS: double 
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ID) TOPOLOGY; circular 
(ii) MOLECULE TYPE; DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) WITI-SEMSE: HO 

(vij ORIGIUW. SOURCE: 

(A) ORGAHISH: f7ORYPHyR0H0HAS GIHGIVALIS 

{ ix ) rEATURK : 

tA) HAME/KEY: misc_featur« 

(B) LOCATION 1 . . .2457 

(xl) SEQUEHCE DESCRIPTION: SEQ ID NO:80 

AGGACGAACG TTTTCTTATC riTGTCCCAT AAAATTGGGA GAAGGGGTGC TTCCTGTAGC 60 

AATCGGAATG CGTGGATGGC TGAGAACAAA CCCTCATCAC CTGAACCGGA TAATACCGGC 120 

GTAGGAAACT CTCCGTCTGA CTATCTTCAC GGCGAAGCAA TCATTCCCCC TCTCTCTTCT laO 

TTGTCCAACT TCAATGATAA GAGATTTATG AAAATVXCTTC ACATCATTGC CGCCTTAGCC 240 

GTCCTGCCTT TCTGCCTGAC GGCACAAGCA CCCGTCTCCA ACAGCGAGAT AGATAGTCTT 30O 

AGCAATGTGC AGCTCCAGAC CGTACAGGTC GTAGCTACTC GCGCCACGGC GAAAACCCCT 360 

GTC6CTTACA CCAACGTTCG CAAGGCCGAA CTTTCCAAGT CCAATTATGG TCGTGACATC 4^10 

CCCTATCTGC TGATGCTGAC TCGCTCCGTG GTAGCCACCA GCGATGCCGG TACGGGTATC 480 

GGATATTCCG GCTTTCGCGT GCGTGGCACC GATGCCAATC GCATCAACAT AACTACCAAT 540 

GGAGTACCCC TCAACGACTC CGAATCTCAG TCCGTCTTTT GGGTGAATAT GCCCCACTTC 600 

GCCTCTTCCA TCGAA6ACCT TCAGGTGCAC CGAGGTGTGG Gl'ACTTCCAC CAATGGTGCC €60 

GGAGCTTTTG GGGCAAGTGT CAATATGCGT ACGGATAATT TGGGACTGGC TC;ci"rATGGC 720 

CGTGTCGATT TGAGCGGAGG TTCGTTCGGC ACATTCCGCC GATCGGTCAA ACTCGGTAGC 7 80 

CGACGCATCG GTCGCCATTG GGCAGTGGAT GCCCGCCTGT CCAAAATCGG TTCGGACGGC 84 0 

TACGTGGATA GAGGAAGCGT GGATCTGAAA TCCTATTTCG CACAGGTGGG Cl'ATTTCGGT 900 

AGCAACACGG CTCTCAGGTT CATCACTTTC GGAGGAAAAG AAGTTACGGG TATCGCATGG 960 

AACGGTCTTT CCAAGGAGGA TGAAGCCAAA TATGGCCGCC GATACAACAG TGCCGGTCTT 1020 

ATGTACGTGG ACGCGCAAGG AGTACCGCAC TACTACCACA ATACCGACAA TTACGAGCAG iO30 

t:CTCACTACC ATGCCATCAT GACGCACAGC TTCTCTCCTT CCGTTATCCT CAACCTCACG 114 0 

GCACACTACA CGGCCGGATA 7GGCTATACG GACGAATATC GTACCGGACG TAAACTAAAG 1200 

GAATATGCAC TGCAGCCCTA TGTGGAAAAC AGTGTGACCG TGAAGAAAAC GGATCTCATC 1260 

CGTCAGAAGT ATCTGGACAA TGACTTCGGA GGACTCATCG GTTCGCTTAA CTGGCACACC 1320 

GGTGCATGGG ATTTGGAGTT CGGGGCCTCC GGCAATATCT ATAAAGGAGA CCACTTCGGC 1380 

CGTATCACTT ACATCAAAAA GTACAATCAG CCCTTAGCTC CCGACTTCGA ATATTATCGG 14 40 

AACAGGGCAG ACAAAAGAGA AGGTGCA6CC TTTGCCAAAG CCAACTCGCA GATCACTCCG 1500 

GAACTGAACA TGTATGCCGA CCTCCAGTAT CGTACCATCG GCTACACGAT AAACGGCATC 1560 

ACGGACGAAT ATGATGAGGT ACAGGGAAGT ATGCAGCACA TCGATTTGGA CAAGACCTTC 1620 

CGCITCCTCA ATCCGAAGGC CGGTCTTACC TATAGTTTCG ACGATGCTCA TACTGCCTAT 1680 

GCTTCTGTTG CGGTAGCACA CCGCGAGCCT AACAGAACCA ATTACACCGA AGCCGGAATA 174 0 

GGACAGTATC CTACGCCTGA GCGACTGATC GACTATGAGC TGGGCTACCG CTATGCTTCG 1800 

CCCCTCTTGT CGGCCGGAGT AGGTCTCTAT TATATGCAAT ACAAGGACCA ACTCGTGCTG 1860 

GATGGCCGTT TGAGCGATGT GGGACAGATG CTCACAAGCA ACGTCCCCGA CAGCTACCGT 1920 

ATGGGACTGG AGCTGACTCT CGGTTGGCAG ATCCTTCCTC GTTTGCTGCG TTGGGATGCT 1980 

TCTTTCACTA TGAGTC6CAA CAAAATCGAC CGCTACGTAC AATATACATC CGTATATGAT 2040 

GCGGACTACA ACTGGCTCGA ACTCAAGGAG GAGACCCTCG AAAGCACGGA TATAGCCTAC 2100 

TCGCCCAATG TCATTGCCGG CAGCATGCTT ACCCTCTCTC ATGCCGGTTT CGAAATGGCT 2160 

TGGACGAGCC GeiTCGTCAG CAAGCAATAT CTGGACAATA CAGAGCGCAG CGATCGCATG 2220 

CTTTCCTCCT ATTGGGTGAA CGACCTCCGC CTCGGCTATG TGCTGCCGGT TCACTTCGTT 2280 

AAGAGAGTGG CACTGGGCGT ACAGCTCAAT AATCTCTTCA ACCTCATGTA TGCGTCCAAT 234 0 

GCCTACATCT ACGATGCCGG TTACGTACAG GCATCCGGAG AACTAAGTGC ATATGCCGAT 24 00 

CTGCGTTATT ATCCTCAGGC CGGATTTAAT GCACTGGGTA GTCTGACAAT CGATTTC 2457 



(2) IMFORIIATION FOR SEQ ID NO: 81 

U) SEQUEMCE CHARACTERISTICS: 

(AJ LENGTH: 1596 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI^DEDHESS : double 
{D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) A/fTI-SEWSE: NO 

(Vl) ORIGINAL SOURCE: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(A) ORGP&nSM: PORYPHYROf-tONAS GIMGIVALIS 
( Ix) FEATURE; 

(A) HAT^E/KEY: misc feature 

(B) LOCATION 1 . . .1596 

(xi> SEQUEHCE DESCRIPTION: SEQ ID NO: 81 



TACAGGCGAT 
ACTCGGTTAC 
AGAAGGTTTC 
CGGTTTCCGA 
CGTCCTTGGC 
CATTTCATCA 
CAAACAGGCT 
CACGGATCGC 
CCGTTTGCCT 
ATCAACGACC 
CTGTCGGACC 
GCTATCGCTC 
TCTGTCGGGA 
GCCGGCTTTC 
CTGAATCTGA 
TTCTTCCAAT 
AATCTGATCG 
GGAGGTCTCT 
GAGGAGGTAC 
ATCTTCCAGC 
CTGAATGTCG 
TATAACCTCG 
TGGAGCTGGC 
CCGCACCAGA 
CGCCTACTGG 
Gl'CGCTCGCC 
ACCGGCGACA 



CGGGAAGGGT 
AGCCGAGAAG 
TATCGCTGTT 
TGGTGCAGGG 
GCGCCATCGG 
TGAACGAGGA 
TTGGCTGGGA 
TCTATTTCAA 
TCTTTGGCAG 
TCTGTGCCAC 
TGCTCATCGA 
TGATCAATCC 
GTCGCAGCGG 
GCTTTTTGGC 
GATTCGACTA 
TCAAAGCCGG 
GAATCGTAAG 
TTCAGCACTT 
TCGTCACCCC 
ACCACGGAAA 
TCCCGATGGG 
GCAGCGGATT 
TCTTGGGAGT 
AAAATACCGA 
TGACGAGTTC 
GTTTCATCCG 
TACAGCTGCG 



TTGTCCTCGT 
AGCGGTTCGA 
ACTGCTGTAC 
AATCGAGTTG 
TAAAACGATA 
CTTTGCAGAC 
CAATGACAAG 
TGCAGCGAGG 
TCTCATGTGG 
CACCATAGGC 
CAATCGTACC 
GATGCGCTTT 
ACAGATATTT 
AGACAAGCGG 
CGGCGATCCA 
ATTGAGTTTC 
CGGATGCCAA 
CGACTACTAC 
ATACCGTATC 
ATTTCGACGA 
AGCCAGTCTG 
GAGCGGCAAG 
CGAAAGCTAT 
TGTCAGCTCT 
CGAGTTCGCA 
CAAAACAGCC 
TGTCGGATTT 



ATCCTCCGGA 
CTACGAGCCG 
ATACTCTCTT 
GATACCGATT 
GGCGTCAATC 
ATCAGTTGGC 
TTTGTCACCA 
TCGAACGGTT 
GAGCTGCTTA 
GGTATAGCTT 
ACAGGGTGGG 
CTCAACCGTC 
CAGTCTGTCC 
CATGCCCGAA 
TTCCGAAGCG 
TCCGAATCGC 
CTGCTCGCAC 
AATTCGGAAA 
TCGCAAGTGG 
CGTCCTCTGG 
TCGGATCACT 
CTATACCTTG 
CGGCTCTACA 
TTTATGGTGC 
TTTCATCCTG 
TATCAATTCT 
CACTTC 



ACAAACGTTC 
GCACCAGAAC 
CCATCAGCCT 
CGCTTTTCTC 
TGGCCGTATG 
AGACTATCAA 
ACCTCTTCGC 
TGAGCTTCAG 
TCGAAAACGA 
TGGGGGAGAT 
AACGTATGGG 
TGACAGCAGG 
CCATAAACAT 
CCGGTGCCAC 
AGACTTTCTC 
AACCTCTGCT 
ACGAACGAAC 
AACGAATAAG 
CAGCTCTGGG 
AGCTATATGC 
ACAACGTGGA 
GTGCTACGTA 
CATGGATCGG 
AGGGGGACGA 
GCCCCTGGCA 
ACCCTAACGT 



CTACGCAGAT 
AAAGATGAAA 
TTCTGCTCAG 
TCTGCCCAAG 
GGGCTTCGA'J' 
GAGCAATTTC 
ACATCCTTAT 
GCACTCTGCT 
GCCACCGAGT 
GGGGCACAGG 
GCGCGAGCTG 
AGAGGTGACT 
AGTCGTCGAT 
GGCTCTGACC 
TCCATACGAT 
GAGCCAGATC 
GGTTTTGGTG 
CAAAAATTCG 
AGGCGGTCTT 
CGAGACCTAC 
CAATCGGGAC 
CAATGATCTG 
GTATGAAGAG 
AAGCAAGGCG 
TGTAGCCATC 
ATCATTCGAT 



(2) IHFORMATIW rOR SEQ ID NO: 82 



(i) SEQUEI-JCE CHARACTERISTICS* 

(A) LENGTH: 900 base pairs 
(R) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DtiA (genomic) 
(ill) HYPOTHETICAL: NO 
(Iv) Af^TI-SENSE: NO 

(vi) ORIGIHAL SOURCE: 

(A» ORGANISM: PORYPHYROMONAS GINGIVALIS 
(Ix) FEATURE: 

(A) NAME/KEY: mlac feature 

(B) LOCATION 1. . , 900 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 82 



AAAAGAAGAA 
TTCACCCTTG 
ACTATAAAAG 
GGTGGAGAAA 
GATAAATGGA 
AACAAGTGGA 
GGCAAACATT 
GAGAAGCCCT 
TATCTCACTC 
TCTCCTACAA 
TTCGGGGTGA 
GCCAATATAA 
TATACGCACG 
AACAAGTTCC 
AACGATGGCC 



A^AAACAGAT 
CGAACGCACA 
GTGTGACCGG 
ACACGGTGGC 
GTTGGGACAA 
ACAAAAGTGT 
GGTACGGAAG 
CGGATCACTT 
TCGGTATTGG 
CGGGCAAGCT 
AAGTTGGTGA 
ATCTGATGGA 
ACTTTGGCAA 
TCACGGCTAC 
CGAAAATCCA 



GAAACGACTG 
AGAAGCAAAC 
ACTAAATGCC 
AGGTAACCTC 
CGGTTTGCGT 
AGACAAGATC 
TGCGCT'ITTC 
GACAGGAGTC 
TGCGGACTAT 
GACTGTAGTA 
AAAGACAATG 
(5AATGTCAAT 
CATTGACATC 
GATAGCCACC 
GTTCAAAGAA 



ATTGTTTTTC 
ACTGCATCTG 
TCTCAGACTT 
TATTTGAACA 
ACAGACTTCG 
GAACTCTTCA 
ACTTTCCTCT 
AAGCATATCT 
AAGCCCAATG 
GCAGACGACT 
TTCGAACTTG 
TTGATAACCA 
AATTGGGAGG 
AATCTTATCT 
GTTGTGGGCG 



TGGCAATGGG 
ACACTCCCAA 
CTCTGACCAA 
TAGATGCCAA 
GTCTGACCTA 
CGAAGGCCGG 
CACAGTATGC 
CTAATTTCTT 
AiGAAGTTCTC 
ACCTCTCAAG 
GTGCTTTGGT 
AGGCTTCATT 
CTATGCTGGC 
ACGACGATGA 
TGGGTGTTGC 



TGGCTTGCTG 
AAAGGACTGG 
Cl'GGGCTGCC 
CTACCTGAAA 
CACAACAGCC 
CTATGAGATC 
CAAAGGATAT 
CGCrCCTGCA 
TCTCTACCTC 
TTTGGGAGCC 
AGTGGGTTCG 
CTTCTCGGCT 
CATGAAGATC 
TGTGAAGATC 
GTACACTTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1596 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



W^i O fi^ tS' :!!iL ^ ^i™!- -H** -^i, ««h 



wo 99/29870 

57 /490 

(2) INFOWIATION FOR SEQ ID NO: 83 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 663 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOGY: circular 



10 

(ii) MOLECULE TYPE: DMA (genomic) 

Uil) HYPOTHETICAL: HO 

^5 {Iv) AIJTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A» ORGAIUSM: PORY PHYROMONAS GIMGIVALIS 
20 (ix) tliATURE: 

(A) NAME/KEY: misc featur« 

(B) LOCATION 1. . .663 



25 



50 



55 



60 



(xi) SEQUEIfCE DESCRIPTION: SEQ ID NOr83 

'^^^T. ^S^- 

CCTGCTCl"rA GACT^nflTfV' Taftr-T«Twn^„ **«ww\itj*^ 



GATGCTCCGG ATAAGACTTC TTTGCGTAAT CATGACTTCT TTGTGGGTCT CGGTGTTCGC 

40 (2 J IHFX5RMATION FOR SEQ ID NO: 84 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 744 base pairs 

TYPE: nucleic acid 
tC) STRAtJDEDNESS: doubl» 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL; NO 

(Iv) AHTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .744 



PCT/AU98/01023 



60 
120 
180 



CCTGCTCrrA GACTQGAT,^ T^A^^^ ^T^S 7^^^ 1^5^^ 
^^^T rii^^TS TGCAAATACC GGTGAATGCC 



300 
360 
420 
480 
540 
600 
660 
663 



(xi) SEQUErJCE DESCRIPTION: SEQ ID HO:a4 

l^^J^, ;j;--ACT rCCJTCOCTC eo 

65 AATCAGAGCA TGAAftAHftcia fr^^^^^^X^ ^li^«:GTAG ATGCCAACTT CGTAGGCAC^ 120 

75 sss sr°°" "'~«»"' 

744 
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(21 INFORMATION FOR SEQ ID NO:fl5 

5 (1) SFQUF.NCE CHARACTERISTICS- 

IS) TYPE: nucleic acid 
IC) S-TRANDEDNESS: double 
10 TOPOLOGY: circular 

(ii) HOLECL-LE TYPE: DMA (genomic) 

(111) HYPOTHETICAL: NO 

AHTI -SENSE: NO 



20 



30 



35 



60 



<vl) ORIGINAL SOURCE: 

(A) ORGAHISM: PORYPHYRa^OMAS GINGIVALIS 
(ix) FEATURE: 

JAJ NAHE/KEY: mi^c feature 

(B) rJXIATIOtJ 1. . . 633 



25 (XI J SEQUENCE description: SEQ ID NO: Ob 



(2) INFORMATION FOR SEQ 10 NO:86 

(1» SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 2859 base palr^ 

(B) TYPE: nucleic acid 
45 STRA^JDEDWESS: double 

tD) TOPOLOGY: circular 

aij MOLECULE TYPE: DNA (genomic) 
(ilij HYPOTHETICAL: NO 
(iv) Ai-m-SENSE: NO 
<vi) ORIGINAL SOURCE: 
55 ORGA>/ISH: PORYPHYROHONAS tilMGIVALIS 

(ix) FEATURE: 

(A) UmZ/KEY: mlsc feature 
(8) LOCATlOr/ 1. . .285 9 

lr.L) SEQUENCE DESCRIPTION: SEQ ID WO: 86 



PCT/AU98/01023 



g^^^ ^Tcl^?. ^--^-^ ,^-CCTTG TATTATTATA TGGAATTGCA eO 

TCCGATACGG AGAACATCCC GGGAGGAT^^ A^aS^SI ^GCAGGTCGG AGCGGGATAC lio 

GGGAGCTTTC TGGAAGTGGG GCTCTCcI^G T^nJ^^l TCTATTTGGG AAAGCGTATG ito 

GCAGACTCCT TTGCATCGAA OGaS^AC ^A^^^ CACGTCAAAC AGCCAACAAT 240 

AATGAGAAGT ggtcattctt cgat^^g^ ^tIJ^UJ^ AGGTAAATAT GTCTTCTCCG 3no 

GTCAATCCTC TCCATCTGTT TTGgS:^^ AGC^^^ GCTATATGAT CGTCGTCGGA 360 

GCCGGCCTGT CCAATAAGCA SaT^^? ATTTGTTTCT GGCAGTACAA 420 

AGTATCTACA CCAATTCGAA TACCTAC^C GAGACAAGGG AGCCAAAGTC 480 

CAAATTCATA AAAACGTGGG GGcS^cC S?^^^''^ CTACGAATAT S40 

CTTACGGCCA TGGCCACGCT CTCCActEaT ^ ^^^^ ACGACCACGG CAATAAGATG 600 



£cSJ??^ ^^J^r o^.rr.cc c<^;v.Tccr. so 

65 GTCGGCATGC TGCGCT^^S^ ^S^^^^^ S^??*^"^ AGCAGTCGGA 120 

OCAAAACCTA TTCCTGACAG TGCTTTTCGA ^tI?^^^ CCAAAGACAC GSTGCCTCTC 180 

ATGCGGCAAG ATAGCGTGTA TGaSI?^ nl^^n ?T'==°»"C CACCGGATCG 240 

GCTGCCGATT CCATCCTTTT GCTC^CA^ USS^S^ «AGATATAGT GGAGTACGAA 30O 

GTGAGCTATC AAAAGAGTCG CTTGG^GCA ATCITTTCCSG CAAGAGCTAT 360 

ACGGTTTATA CTCGCTATGT CCTOTA^aS ^^^Ul ^'^^''O^TAC CGACAGCAGT 420 

AGCAGTCGTT cSS^SS SwcS AC«^"" «^GTTTTC 4 80 

GGGATTATCA GCGGAGTCAT CACGCAGCAG ^S-IIS^SI *'='^CTTCCG CACGGAGAAG 5J0 

AAGAAGATGC CCGACAATAT CATg???^g SirJ^^SJ ATCTGACTGC CGGTAAGACC ioo 

CACGATCATC CTCACTTCTA MTCAATm ^^^^ CTTATACGAC CTGCGACAAT 660 

ATCGTCACAG GTCCGGTCAA TCTGCT?^ GcS^^S^ '^°°TGCATCC GGAGAAAGAC 720. 

75 .^.GCA.. „c=™ 5--XGC CG^AT AGCTC«CCT ,00 
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GAUGACAATC GCTATGGATT TTATTTGAGG 
TATATCGArr TGGCATTGCG TGGGGAGATC 
CAATCGAAAT ATAAGAAGAG GTATAAGTAC 
TCGAAGTCCG GCGACAAATA CGTGCCCGGA 
5 CGATGGACAC ACAGTCAGGA TCCGAAGGCC 
TVATTTTGCCA CC'GGGAGCTA TTTCCAGAAT 
CGTACTGCTA CGACACGAAG TTCGGCCGTG 
TTTTCGATTA CGGGTAGCAT GGATATCAGC 
ACCTTGCCGA ATCTTTCGAT TAATATGTCC 

10 GTAGGACCGG AGCGATGGTA CGAGAAGTTG 
AGTATCTTGA CAAAAGAGAA AGATTTGCTC 
GGTATGCX3TC ATTCCGTACC GATCAGTTTG 
ACl'ATGGGGG TTAACTACAA TGAGTGGTGG 
GACGATAAGA AAACATTCCT GCCTTCGGAC 

15 TACAGTCTGT CGGCAGGCl'T ATCTACCACA 
TTTTCCTTCG GAGGCAATCT CATTATGATC 
TCCTATATGC CGGACTTCAC GAAACGCCtSA 
GATCAGAACG GCAAGCTGCA TACGCTGCTC 
GCTCCCTCCA TGGGCAATGC AGGATCTGTC 

20 AAGATCAAAT CCAAATCGGA TTCGACAGGG 
ACATGGTCTA CATCCTATAA TATGTTl'GCC 
TCGCTGGCAC TTCGCCTCTC CAAGAGCTTT 
TATTTGACGA AGTATTATGA GGGAGAAGA7 
CTGCGCATTT TTAACGGCAA GGGATTGGCA 

25 TATACGCTCA ACAAAGAGTC GCTCAGCGGA 
CGGAGAGATG AAAAGAAAAA CACAGGGGCT 
ATACTTGAGG GAGGAAGACC GCAAAATGAA 
CAGGGCGGAG CAGTGGATCA GGATGGTTAC 
TTCGACTATA GTTGGAATAT TGCTACCGAC 

30 CACTACTACC GGGTAACGCA GAATCTGAGC 
TGGAGCTTCG GATTCAATGC GAACTACAAT 
TGCAACGTCA CTCGCGACAT GCACTGCTGG 
GCATACAAGT CCTATAATTT CGTCATATCG 
TATCAGCAGA GCAATCGTCC CATCACGAAT 

35 



AATGGTGGAT ATTATTTTGC CTTCAGCGAC 900 

TTTTCCAAAG GGTCATGGGG CATTTCAGCC 960 

AACGGCTCGT TCGAAGCCAA TTATCTGGTA 1020 

GACTACAGCA AGACCACCAG TCTGAATATC 1080 

AATCCTTTGC AAACGTTGTC GGCCAATGTC 1140 

TCGCTGAATA CCACCTATGA TGTCAATGCC 1200 

AGCTATTCGC GCAAGTTTCC GGGTACTCCT 1260 

CAGAACATGC GCGATACGAC GGTGAGCCTT 1320 

ACGCGTTATC CTTTCAAGCG GAAGACCCGT 13B0 

AGTGTGGGCr ATTCCGGTCA GCTTCGCAAT 144 0 

CAGAGCAATC TCGTGCGCGA TTGGAAGAAT 1500 

ACTGTCCCTT TGTTGGATTA TATCAATCTG 1560 

TACACGAAAG GCATACGCWA GTCGTGGAAT 1620 

ACGACCTATA AATTCCGCAG ACTGTACGAT 1600 

TTGTACGGTA TGTTCAAGCC TTGGAAACCT 174 0 

CGTCATCGCT TCACGCCCAC TGTCAGTTTC 1800 

TATGGCTTTT GGGAGCTTCT TGAGCATACG 1860 

TACrercCTT ATTTCiGAGCA GATATTCGGT 1920 

AATTTCTCTT TTGACAACAA CTTAGAGGCC 1980 

ATCAAGAAGA TCAGCCTGAT AGATCAGTTC 204 0 

GATTCGATCC GATGGAGCAA TATCTCGGCT 2100 

ACCTTGCGCT TGTCCGGTCT GTTCGATCCC 2160 

GGGAAGATCA TTCCCTATAA GAGCAACGAC 2220 

CGCCTGATCA GTACGGGTAC TTCTTTCAGC 2280 

TTGATAGCTC TTTTCAGTGG CAAAAAGGAG 234 0 

ACTCCTCATG AAGGAGACGA TGCTGCCGAT 2400 

AGTGGGGGGT CGCTCCTCGA GCGCAACCGT 24 60 

TTCGCATATT CGATCCCATG GAGCCTGTCC 2520 

TACAATAGGT ACAATGTCAA TAAGATGGAG 2580 

TTTCGCGGCA ATATCCAGCC TACACCGAAC 264 0 

TTCGACrTGA AGAAAATAAC ATCGCTTACC 2700 

GCT ATCTCGG . CCAG-ITTCAT CCCTATAGGA 27 60 

GTGAAGAGTT CACTCTTGCA GGATCTGAAG 2820 

ACTTGGTAT 2859 



(2) INFORMATIOl-J TOR SEQ ID NO: BV 



(i) SEQUEUCZ CHARACTERISTICS: 
40 (A) LENGTH: 3753 base pairs 

(B) TYPF: nucleic acid 
(01 STRANDEDNESS: double 
(D) TOPOLOGY: circular 

45 (ii) MOLECULE TYPE: Dt>IA (genoiRic) 



(iii) HYPOTHETICAL: NO 



(iv) AIITI -SENSE: NO 

50 

(vi) ORIGINAL SOURCE: 

(A) ORGAfJISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 
55 (A) NAME/KEY: misc_f eature 

(B) LOCATION 1...3753 



txi) SEQUENCE DESCRIPTIOiJ : SI 

60 AAATGGAAAT TGGCATGCGC TTTTGATTGC 
AACGAGGTAA TAATAATGAT GAAACGATAT 
TGCACGGTAT TTACCTTTCA AATAAAAGCT 
AAGCCTTGGA TTCAGAAACA TTCAATGGAT 
TTAATTCAAG CTGAAATTGT ATACCAATCT 

65 CCTGTGAACG AAATAAGGCC TGCAAATCGT 
GAAAATCTAC GGCCATCTCC CCCCGTAGTT 
GT-reCCAATC CAATGGATCC TGAAAATCCC 
ACTAAAGCGG TAACAGTACC TGTCGATGTG 
GGAGGGCAAA ACATTGCCAG ATT7WKCTCT 

70 AAAATGTTGC CTAAGGGGAC GGCTACAGAA 
CATGAGCCTC ATCGCTTATC TGATTTl'ACC 
CGGGCTTTGA CTCCTATTTG GGGAACACAT 
ATTATGGCCA C^TCTACTGC TGTGGATAAG 
ACGGAGCAGT ATCCTGTTAA AAATGTAACT 

75 GCGAATGATC CCATTGATTT GGTTATACAA 



:Q id NO: 87 

GCGTGTTGTT TCGATCCATT TGTAGTAACT 60 

ACAATAATTC TTGCAGTTTT TCTTTTATTC 120 

CGCCCTTATG AAAGATTTGC AGATGTAGAG 180 

TCTAAATTGG TGCCTGCAAA TAAGGGTAAC 240 

GTTTCTGAAC ATAGTGACTT AGTTATTTCA 300 

TTCCCrrCGC ATAGGAAGTC TTTTTTTGCA 360 

CCCGTTGCCG TCGACAAGTA TGCGGTACCG 4 20 

AATGCCTGGG ATGTGACGCT AAAAATCACT 4 80 

GTGATGGTTA TCGACCAGTC TTCGTCAATG 540 

GCCATTGCAT CGGGACAGCG TTTTGTGAAA 600 

GGGGTGCGTA TCGCTCTTGT GAGTTATGAC 660 

AAAGACACTG CTTTTCTCTG TCAAAAAATC 720 

ACCCAGGGGG GGCTTAAAAT GGCGAGAAAC 7 80 

CATATCATAT TGATGTCTGA CGGGTTAGCG 84 0 

ACTGCAGACT TCATTGGCAA AACTGGAAAT 900 

GGAGCAA'iTA ATTTCCCTAC AAATTATGTT 960 
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10 
15 
20 
25 
30 
35 

* 40 
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75 



TCCAACAATC CATCI'ACACC TCTTACCCCA AATTATCCAA CTCATTCTTC TAAAGTTGGA 1020 

CGGAGAAATC TGCCGGAATC CAAATTCGAT TATAGTAATC TGAGTGCAAG GATTACTTT'l' 1080 

GATGGTGTTG CTGGCGCATT GGTCTATGAA CCGAGGTTTC CTCATCCCTA TTATTATTAT 114 0 

TTCCCTTGTA ACGCTGCTAT CAATGAGGCT CAGTTTGCGA AAAACTCTGu TTATACAATC 1200 

CATACTATTG GCTATGACCT GGG AGATTTT OCCTTGGCCA ACAATTC6TT GAAACTAACC 1260 

GCTACAGACG AGAATCACTT CTTTACGGGG ACACCGGCCA ATTTACCTGC AGCGTTTGAT 1320 

AATATTGCCC AAACTATTAA TATAGGTATA CAGAGGGGGG AGGTGACGGA CTTTGTAGCr 1380 

CCT6GTTTCA TCGTTAAAAA TCTGACGCAA TCGGGAGATG TTACTCATTT GCTAAATGTT 14 40 

TCAAATGGAA CGGTGCACTA TGAT6TCTCT ACTAAAAAAC TGACATGGAC TACTGGTACT 1500 

ATCCTGAGCT CATCAGAAGC TACCATAACT TATCGTATTT ATCCCGATTT GGATTATATA 1560 

CAGAACAATG ATATTCCGGT AAATACTACT TCTGCTATCG GCCCGGATCT TGGTGGATTC 1620 

GATACCAATA CCGAGGCAAA ATTGACCTAT ACCAATTCCA ATGGCGAACC GAATCAGCAG 1680 

TTAATTTTCC CACGTCCGAC GGTTAAGTTA GGTTATGGTG TTATTAAGCG GCACTATGTA 1740 

TTGGTAAATA AAGACS3GTCA ACCCATACAG GCAAATGGAA CAGTTCTCAG Tl'CCCTAAGC 1800 

GAGGCTCATG TTCTACAGTC ACAAGATTTC T'lTTTGCCCT CAGGTGGAGG TCATATTGTT 1860 

CCCAAATGGA TAAAGTTGGA CAAAACGACC GAAGCATTAC AGTACTATTC CGTACCGCCG 1920 

ACTAACACGG TCATCACTAC TGCCGATGGT AAACGTTATC CTTTTGTCGA AGTCCCAGGC 1980 

TCCACGCCGA ATCCGGGCCA AATCGGTATC AGTTGGAAAA AACCGGCAGG AAACGCTTAC 204 0 

■rrCGCTTACA AGCTCCTCAA TTATTGGATG GGAGGAACAA CAGACCAACA GAGTGAATGG 2100 

GATGTGACGT CCAATTGGAC AGGAGCCCAA GTACCGCTCA CAGGAGAAGA TGTAGAGTTT 2160 

GCAACGACAG AAAATTTCGG TTCTCCGGCG GTAGCCGATT TGCATGTCCC GACAACCAAC 2220 

CCCAAAATTA TCGGTAACCT TATCAATAAT TCCGACAAGG ATTTACTTGT TACCACAAGC 2280 

AGTCAATTGA CGATCAACGG CGTGGTTGAG GATAACAATC CGAATGTCGG TACGATCGTC 234 0 

GTGAAGTCGT CGAAAGACAA TCCTACGGCG ACATTGCl'TT TTGCCAATCC GGGCTAT7VAT 24 00 

CAAAATGTAG GGGGGACCUT CGAGTTTTAC AATCAGGCAT ATGATTGTGC CGATTGTGGT 24 60 

ATC5TATCGCA GCAGCTGGCA GTATTTCGGT ATCCCTGTCA ATGAATCAGG TTTTCCAATT 2520 

AATGATGTGG GCGGAAACGA GACCGTCAAC CAATGGGTTG AGCCTTTCAA TGGCGATAAG 2580 

TGGCGGCCAG CACCTTATGC ACCTGATACA GAGCTTCAAA AATTCAAGGG CTACCAGATC 264 0 

ACGAATGACG TGCAGGCACA GCCTAC6GCA GTTTACAGCT TCAAGGGTAT GATTTGTGTG 27 00 

TGCGATGCCT TCCTGAATCl' GACACGCACG TCCGGTGTCA ACTACTCGGG CGCCAACTTG 2760 

ATC6GCAACT CATACACTGG AGCCATCGAC ATCAAGCAGG GTATTGTCTT CCCGCCGGAA 2820 

GTCGAGCAGA CGGTGTATCT GTTCAACACG GGW^CACGCG ACCAGTGGCG TAAGCTTAAT 2800 

GGAAGCACGG TITCAGGCTA TCGAGCCGGT CAGTACCTCT CTGTACCTAA GAATACAGCG 294 0 

GGTCAGGACA ATCTTCCGGA TCGTATTCCA TCGATGCATT CCTTCTrGGT GAAGATGCAG 3000 

AACOGAGCGT CTTGTACGTT GCWIATCTTG TACGATAAGC TGCTCAAGAA CACGACTGTA 30 60 

AACAACGGTA ATGGTACGCA GATCACATGG CGATCCGGCA ACTCCGGATC GGCGAATATQ 3120 

CCGTCACTTG 'fGATGGATGT TCTTGGTAAC GAGTCGGCCG ACCGTTTGTG GATCTTTACC 3180 

GATGGGGGTC TTTCTTTCGG ATTCGACAAC GGCTGGGATG GTCGCAAGCT GACTGAAAA^ 3240 

GGTTTGTCAC AACTTTATGC GATGTCTGAC ATCGGTAATG ATAAATTCCA GGTTGCAGGG 3300 

GTTCCGGAGT TGAATAACCT GCTGATCGGC TTCGATGCGG ATAAGGATGG TCAATACACG 3360 

TTGGAGTTTG CTCTTTCGGA TCATTTTGCG AAAGGGGCTG TTrACCTGCA CGATCTTCAG 34 20 

TCAGGAGCCA AACACCGTAT TACGAATTCT ACGTCGTATT CATTCGATGC CAAGCGGGGA 34 80 

GATTCCGGGG CTCGTTTCCG CTTGTCATAT GGATGTGATG AGAACGTAGA TGAITCGCAT 3540 

GTCGTGAGTA CAAATGGCCG TGAAATTATA ATTCTGAATC AAGATGCTCT TGACTGCACT 3600 

GTAACCTTAT TCAC?\ATAGA AGGTAAGCTT Cl'TCGCCGCT TGAAAGTATT AGCTGGTCAT 3660 

AGAGAAGTCA TGAAAGTGCA GACCGGAGGG GCCTATATTG TtiCATCTTCA AAATGCTTTC 3720 

ACTAATGATG TCCATAAGGT GCTTGTTGAG TAT 3753 



(2) IHFORHATIOW FOR SEQ ID HO: 88 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1278 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI/DEW>IESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DHA i genomic) 

(iii) HYPOTHETICAL: NO 

(ivj Mm-SZUSEi NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GTMG.TVALIS 

(ix) t-EATURE: 

(Al NAJ'IE/KEY: niisc fcature 
iB) LOCATIOH 1...127 8 

(si) SEQUENCE DESCRIPTION: SEQ lU NO: 88 

ACTATGGAAG TCAAGAAAAA CACAGTGGTG CTACGCCTTC TGATTTGGTT CGTGGCCATT 60 
CrrCTCTTCC ACTCCTCACG GCTGTGGGGA CAGGAAGGGG AGGGGAGTGC CCGATACAGA 120 
TTCAAAGGAT TCGTGGATAC CTACCATGCC GTACGCAGCT CTTCTCCTTT TGATTTCATG 180 



SlJBSTrrUTE SHEET (Rule 26) (RO/AU) 



■tP^i tl™ii tP^^' 



iv^i ^« w='S> 
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AGCTCGCtSl'A 
GTATCGGTCA 
GAAGCCTTCT 
ATTTGGGGGG 
GAGTTTCTGG 
GTCTTCAACG 
CTGCCTGTGG 
ATGAATATCG 
TACGGTGCGC 
ACATGGAACA 
AGCCCTCGCT 
TrrGTTTTCA 
GCCGAGCAAG 
GGTGAATGGA 
ATCTGCCAAA 
AGTACACTCC 
CGCTTTGCAG 
TTCAGTAGTA 
AAAGCCCGCT 



CGAGAGTGAG 
ATGCGACCTA 
TCGAGCATCA 
CTGCCGACGG 
CACAGGATTA 
AATCGATGAA 
ATCCTCGCAA 
TCTGGAAAGA 
GATGGAGCAC 
AGATGCCCGT 
ATTATCGTAt 
GGGGAGAGGC 
AGGGTTTCCA 
TGATCTCAGG 
GACAACATTC 
AACTTTCGGA 
CrGACTATGC 
AGGGCAGCGG 
ACAGCTTC 



AGGTGAGCTG 
CAATGCTCTA 
GGAAGAGCAT 
TGTGCGCATC 
CGATGATATT 
AGTGGAAGTC 
TCCTTGQAAT 
AGAAGCCGGC 
TACGCTCTCC 
GATCGAAGTA 
GGGATTTGTC 
TGCGTTCAAT 
AACAATCAAT 
ACAATTCTCA 
TACCXTTGATT 
CTTCACCTAC 
CTTGAACGAT 
TATATTCGAT 



GAGAGGTCGT 
CTGAAAGACG 
TGGGGGTTGC 
ACGGATCTGA 
CGTATGCCGG 
GTGGTACTGC 
ATCTTCTCCC 
AAACCGGCCT 
GGTATCGACT 
CAGGGCATTG 
GGCGGCGACC 
ATCGACAAAC 
TGGTTGGCCG 
ATGGAAAGCA 
ACTCTCAATG 
TACGACCTTA 
CAGATACATC 
CGCTACAAAG 



TCGGTAATTC 
AGACCGGCrr 
GCCTCGGACG 
TCTCCCCGAT 
TCAATGCATT 
CTGTATTCGA 
TTTCGCCCAT 
TCAAGGTTGC 
TCGCTTTGGC 
TGCCGACG6A 
TCTCCGTACC 
ACTTCACCTA 
GAGCCGATTG 
TATTCAGGTA 
TTTCCAAGAA 
03GGCAAAGG 
TGATGGCCGG 
ACAATTCCGA 



GAAAGTAGCr 
ACGTTTACGT 
ACAGATTGTC 
GGATATGACC 
GCGTTTCTCT 
GGGGTACCGT 
TGCTCAGGGG 
CAATATCGAG 
TGCATTGCAT 
AATCATCGTT 
CGTCGGACAG 
TAAGAGTCAT 
GTATGCTCCC 
TAGGGATTTC 
ATTCTTCGGC 
ATGGTTCAGT 
ATATGACTGG 
ACTCTGGTTC 



(21 INFORHATTOH FOR SEQ ID WO: 8 9 

ti) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1392 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOG-f: circular 

(11) MOLECULE TYPE: DHA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 



(vi) 



ORIGINAL SOURCE: 
(A) ORGANISM: PORYPHYRa^ONAS GINGIVALIS 



(ix) FEATURE; 

(A) NAME/KEY: mi sc_ feature 
(8) LOCATION 1. . .1392 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO:BS 



TACGGCAAAA 
TTTATTTTGG 
GGTGCAGCAG 
ATGTCCTCCT 
GAACGGAATC 
TTAGCTACfA 
TATGCCATCG 
ATACTCCTGA 
AAGAATCCGC 
TTGTCACCGT 
AAGAAGAAAT 
GAGGGAGAGC 
GCCTGCGAAA 
CGTAAGATGC 
TCAGAAGACA 
GGCGATGAAT 
CGCATAGATG 
GATGAGTTCG 
GGCGAGArrA 
AGTTATCTTT 
GAAAATGCTT 
ATCAA6CAGG 
GTGACCCAAA 
TGGGAGGTCG 



GAAGAAAGCT 
ACTTACACCT 
CCATAOTGGC 
GTGAGGTGGC 
ACTCTTCCGA 
TTCTGATCGG 
AGCAGACATT 
CCACTGTTCT 
TGCAATACTC 
TTTCAAAATT 
ACGATATGTC 
CGGAGGAGAA 
TCATGGTTCC 
TTGACTTCGT 
ATATCAAAGG 
TCGACTGGCA 
ATTTGCTCGA 
GTGGCACTTG 
CGGACGAGTA 
TCGAAGGAAA 
TCGGTGAATT 
AACTCCCCCA 
TGGACAAGCG 
AA 



CGGAACCTCC 
TATTACTGAT 
TTTCATTATC 
TTATTTTTCA 
CATCGCGCIT 
GAATAATGTG 
CGTTTTCTCT 
TTTGCTGTTC 
GCGCTTTTCT 
GCTGGTCAAA 
CGTGGATGAG 
AGAAATGATT 
GCGTATCGAT 
TGTTTCGTCG 
GGTGATTTAC 
TCCTCTGATT 
GGAGTTCAGA 
CGGACTGATC 
CGATGAGGAA 
AACGTCTCTC 
GGGGGACGAG 
TGTGGGCGAT 
CCGAATCATC 



GTCCGACCTT 
TTTTTTGAAG 
GACCTGCTTC 
CTAAAGCCGA 
TCCAATTTAT 
ATTAATGTAG 
TCTCCGATCA 
GGAGAGATTC 
GCTGCAGCTA 
AGTACCGGCA 
CTCTCQAAAG 
AACGAAATCA 
ATTGTGGATG 
GGTTATTCCA 
ATCAAAGATC 
CGTAAAGCAT 
GCCAATAAGG 
ACAATGGAGG 
GAACTCCCCT 
TCCGATGTTC 
GTAGATAOSC 
ACAGCAGTGT 
GAAATCAAGA 



CCGTTCTAAC 
GGCTTCGGGT 
TTCTTTGCTG 
TCGATCTGCA 
TAGACAATTC 
CCATC6TTAT 
TTGGATTTCT 
TGCCGAAAGT 
TGTCCGTTAT 
TCGTTACCAG 
CGGTAGCCCT 
TCAAATTC1*A 
TGGATCTGAG 
GACTTCCCGT 
TAATCCCACA 
ATTTTGTCCC 
TGCATGTCrC 
ACATATTGGA 
TTAAGGTTTT 
GACACTATCr 
TAAGTGGGCT 
ACGAGCCATT 
TTTTCCCTTT 



CCAA»kTCAGA 
CAATCCTATC 
TTCGGCTTTT 
GAACATCCGC 
GAATCAGCTA 
CCTTTCCAAT 
GATCCAGACG 
GTATGCGCGG 
CTATAAGATA 
AGGTATCAGC 
CACCACTACG 
TAATAAGACA 
CTGGCCATTT 
TTCAGAGGGG 
CATGGATAAA 
CGAAAACAAG 
CATCGTTGTG 
AGAGATCGTC 
GGGGGATGGC 
TGACCTTCCG 
CTTCTTGGAA 
CCGCTTTCAA 
CGAGCGCACT 



240 
3C0 
360 
420 
480 
540 
600 
660 
720 
7B0 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1278 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1392 



(2J INFORI^TION fOR SEQ ID NO: 90 

{i\ SEQUENCE CHARACTERISTICS: 

ih) LENGTH: 7 96 base pairs 
(Bl TYPE: nucleic acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(C) STRANDEDIJESS: double 

(D) TOPOLOGY; circular 

<il) MOLECULE TYPE: WJA (genomic) 

ail) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

lA) ORGANISM: PORYPHYROMOIJAS GINGIVALIS 

(ixJ FEATURE: 

(A) NAME/KEY: mlsc feature 
{Bl LOCATION 1. . .7 98 



PCT/AU98/01023 



60 
120 
180 
240 
300 



420 
480 
54 0 
600 
660 
720 
780 
798 



txi) SEQUENCE DESCRIPTION: SEQ ID NO* 90 

mmmmmm 

G^?C^A?A TATTGTACCT GTGCGGAAAG ATTACAACAC CGTTGATGTT 

35 (2) INFORMATION FOR SEQ ID NO: 91 

(i» SEQUENCE CHARACTERISTICS: 

tA) LENGTH: 2721 base pairs 

(B) TYPE: nucleic acid 

(C) STRA14DEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: MO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROflONAS GINGIVALIS 
(ix) FEATURE; 

(A) NAME/KEY: mlsc feature 

( B ) LOCATION 1 . . . 2721 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO- 91 
CACATATCCG GTAC^^ ^^^^ J^^fJc «^'==ATGTA 

^i^.^ i^^^ -t^j 

GGTATGAAAA CCCATACCAT gS^^A^ ^n^J^^^^"" ACCTGACAGC TTCGTTCGTA 

■■■■■■ . 

AAAGTCGGGC TGACTACCAA CT^^SJ^ rtl^^^^^ TTCTGACGGC AAAGTCCGGC 
i^^ni-iAOCAA CTATAACTAC TACGGTGGCA AAAACAAGGG CTCTCGCTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
780 
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1080 
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TTTACCGAAC 
TTTGCCGGAC 
ACGGTGGGCG 
AGCTTTGCCG 
GCCGGATCAT 
CTCACCGTTT 
TGGAAGCGCG 
TCCGATGCGG 
GCACATTCTT 
TATGAGATAC 
CCGTCGAACG 
TATCX3TAAGG 
GCACTCTTTC 
CAGCrrCACGC 
CGAATCCAAC 
CAAGTACAGT 
TATAATCAAT 
AACGCCATCC 
TATGGCAATA 
GCCGTATGGG 
GCACTCGGCA 
CTGCCGAAGG 
CAGACGAAGT 
AAAAAATTGA 
AGCCGGACCA 
GTTTCCCTCA 
CGTACGATCG 
CAAGGAAATC 



GTACTACATC 
ACTTCGGCAA 
GCAATGTACG 
GCAGCAACCT 
ACGAGCTCAA 
CCTACCGCTT 
ATCCGCTCAA 
GCATGGACGA 
TGGAAGCAGG 
GACCATCCGA 
GAAAGTTCCG 
ATCAGTATTC 
CCGAAAACGC 
TCGGCTATAC 
GTCCTCCAAT 
ATGGTAATCC 
ACGGAGCCAA 
AGAATTACAC 
TCGGACGAGA 
TCAGGATTAT 
TTGATGTCAA 
ATTGGACTGT 
ATGATGGCAA 
GAGTCTCGCT 
TCGGCAATGG 
GCCTCACCTA 
TCAACGACGA 
CTACCGGCAA 



CTXTGCTCCAA 
TGCCCTCCTC 
CCTTTGGGAG 
CATGTCCTAC 
TGCCGACTAT 
CACTCACAAT 
CACAGCTAAT 
ACATACGGCA 
GCTGAAGTAC 
AGATGCTCCG 
CCACGATCAA 
TTTGCAAACC 
AGCAGCAGAT 
CCCCTCGCCC 
CGGCCAACTG 
CGACCTAAAG 
GGTCATGCTT 
CTTCTCCGAC 
GCATTCTTTC 
GCTCAACGGA 
TTCATGGTCX: 
GAATCTCTTC 
TGTATTCAAC 
GAGCGCAAAC 
ATTTACTATT 
CAGCTTCCGT 
CCTCAAGCAA 
T 



ACGATAGAAG 
TCATTCGAGA 
ATGACCACCG 
ATAGACAGAA 
CAGCACAGCA 
CCTAATAATA 
ACGATCCAGT 
CAAGTGGACT 
ATCTATCGTC 
TGGCAGCCCG 
TACATCGGAG 
GGCCTCCGAG 
TTCTCCCACA 
ATGAAGCAGC 
AATCCCTACC 
TCGGAGAAGC 
ACAGCATCGC 
CCGGCCAATC 
AGCTTGAATA 
AATATCGATC 
GGCATGGTAT 
GGAGGTTATT 
AATATCGGTA 
AACATTCATG 
TATTCGGAAA 
AAGATGAATA 
ACCTCATCCC 



AAGGCAAAGG 
TAGATTCGCT 
ACCGGAACAG 
AACTCAAAAC 
CTCGCCTGCC 
GCGAGACX^TT 
ACGCCGGCCA 
ATACACGTCC 
ATGCCACGAG 
GCTCTCTATA 
CAGCCTATGC 
TGGAAAGCAG 
ACTCGTTCGA 
TTAAGCTGGC 
GGCTACAGAC 
GTCACCACGT 
TCGACTACGA 
CCAATCTGTT 

cctatgcx:at 
gcacattcca 

ACTCAGGCCT 
ATCATGGGGG 
TAGCCAAACA 
CGAAGTATTC 
ATGCCGGTAT 
CACAAGTGCG 
AAGGACAGCA 



GCAAGAAACC 
CAATCTCTTT 
CGTAGAAAAA 
ACAGATGGAT 
GGGCGAATTG 
CATTGACCAA 
GCACTCCAAA 
CTTAGGACAA 
CGATCCTCTC 
TGCACAGAAT 
CGGCTACAAC 
CAGGCTGAAA 
CTGGGTGCCA 
CTATAACTl'C 
CAACGATTAT 
CGGTCTCTCC 
CTTCTGCAAC 
CCACCAGACC 
GTACACGCCG 
AAAGAGCGAA 
GATGTTCACC 
AAGAAGCTAC 
GCTTTTCGAC 
GACATGGAAG 
ACAACGGAGT 
CAAGGTAGAG 
GG6TGGCGGA 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2721 



(2) INFORMATION t'OR SEQ ID NO: 92 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 baac pairs 

(B) TYPE: nucleic acid 

(C) STRW>rDEDNESS: double 

(D) TOPOLOGY: circular 



40 



til) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



45 



(Iv) ANTI-SENSt: NO 



(vi) ORIGINAL SOURCE: 

(AJ ORGAIIISM: POR Y PHYROMONAS GIIIGIVALIS 



50 



(ix) FEATURE: 

(A) NAItE/KEY: misc_featura 

(B) LOCATION 1. . .1350 

(xil SEQUENCE DESCRIPTlOtJ: SEQ ID HO: 92 
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TGGTGCCAAT 
CAACAAGGGC 
TGCCTTGGGC 
TTGATTGTGA 
CGACTGGAGA 
TTCTTTCGGA 
CGGGTTTCCG 
GGAGTATCTA 
GATTTACCTT 
CAGGCCATGC 
ACGGCTCACG 
GAAGCC6CTG 
TGGATGAGCA 
ACTACTTCGA 
ATTGCCACGC 
GTCAATGGAG 
ATTTGGGGAT 
AGAGGGTGGG 
GACTATTGGT 
ACGTATGACA 
TATTCGTGGC 
CCTTCGGATC 



CCGACCCAAT 
GATCAGACCA 
GGTATGAAGG 
TGCTGCCTGC 
CGGACAGCAT 
ACAATGAATA 
CTTTTGCCTC 
CCCTGAACTA 
ATTGGACGGIA 
TGAAGCCGAC 
GACTGATCGA 
TGCAATTTCG 
TGATTTTCAA 
AATTGTTATC 
ATCGCGGCGG 
CTGTCGGACT 
CTGCTTATGG 
GCGGTTATCT 
ACGGCAGGCA 
AACAGCCTCT 
GGATGGCACG 
GTTTTGCGAT 



ACCTCATTGC 
AGACCGTTTA 
ATTTGGTGAA 
TGTGCTTAGC 
TCGTCCGAAC 
CAATGCACGT 
TTACTCGCTG 
CTGGGGGGCA 
CTATAACGAC 
GGCCACGACT 
ACCGATCTAC 
GGGTGATTGG 
AAATGACAAT 
GGGTGAAGGC 
GGAATACAAC 
TAAGCTTTCG 
TGTGGCAGCC 
TTCTCTCGGA 
TTACGTTTCT 
TACGAACGGT 
AAGTGTTTCG 
GAGCCACGCC 



CATGCAGTAT 
CCTCCCGTTC 
AGGATGAGAC 
GGGCAGCATT 
QAACTCTCGG 
TCGGTCAAAG 
CCGGCAGCAC 
AGTCGCTATC 
TATGTACGCT 
GerCTCATGC 
AATCCrGAGT 
ACACGTTTCC 
CATCAGGAGT 
AAATGGCGAC 
TGGGCGCAGC 
TATCGCCCTC 
TTGTCAAGCG 
ATGGACTTGG 
CCCTTTGCTG 
TGGGGCGATT 
TTGGCGGCTG 
TTGGAACTGA 



ATCGAAACGC 
GAGGCTACCG 
TCTCTGCCAT 
ATTATTCCAT 
CATCGATCCG 
GTTATACGTT 
ATGGTGTGAA 
CGGCCGGTAT 
TGCGTATCCT 
TGGGCAATAT 
TGGATTTGAC 
GAATGGATGT 
CGTTTGTCTT 
TCGAACTGCC 
AGGATACCGT 
GTACCGACAA 
GAGGATACTT 
AGCACTTCGC 
CACCTTTCGC 
ATATTCGTCT 
TTGCTCGGGT 
CGATGCGTAT 



TCAAGGACAT 
GTATGCTCGG 
TCTTATCGCT 
GGCGGGAGAG 
AAGTGCGCTT 
GCCGGGTGCA 
GCTTTCGCTC 
CGCTTATTCC 
GCCTTATGTA 
AGCCGGTGCT 
GGCTGATCCT 
TTGGGTCAAT 
TGGCTTGTCC 
CTTGCAGGCT 
GCATACATGG 
ACCCATGCAG 
CCCTTACGAA 
TTTTCGTACC 
CAATTCCCTG 
CTATGCCGAC 
ATGGTTCCAG 
CGATCCCAAA 



60 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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TTCCCAATAG CTrTTCTGAJ\ AGGCAATCAT 



1350 



5 
10 
15 
20 
25 
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(21 IMFORHATIOM TOR S£Q ID WO: 33 

ii) SEQUEJJCE CHARACTERISTICS: 

<A) LEHGTH: 1341 base pairs 
(B> TYPE: nucleic acid 

(C) STRAfJDF.nNESS: double 

(D) TOPOLOGY: circular 

(ii) HOLECULE TYPE: DI4A (genomic) 

(iii) h:^pdthetical: no 

iiv) AJITr-SBNSE: HO 

(vil ORIGINAL SOURCE: 

(A) ORGAiilSH: PORY PHYROt-lOHAS GIMGIVALIS 

(ix) FEATURE: 

(A) HAME/KEY: misc^f eature 
{B} LOCATIOM 1. . .1341 

(xi) SEQUENCE- DEaCRIPTIOlI: SEQ ID f/0:93 

CCGTTTTCAC CGCCATGGGT ATGTCCTGGA GCGGCTTCCT CAGTACGCAC ACGGCCATGC 60 

TCGACAGTCT CAAGTACCGC CGCCTCATCA CACAG<iCGAT CGGAGCACAT GCGATCGGCG 120 

GACTCGTCGC CGGCATATTC GCCCATTGGC TCTTTGTCCT CATCTCACTC ATTTGATTCG 180 

ATACCCGACC AACGTATGAA CAAATCGCl'A TTATCATTGG CATCCCTCAT CCTGTGCGGT 240 

ATGCCGGCCA TCGCCCAACA GACAGGACCG GCCGAACGCA GCGGCGAGCC TTCTCrCGCC 300 

GAACCTGTAT TCGGTCTGGA GCAGAAGCAG AAAAAGej'GA . AGGTGTACTT AGGCATACAG 360 

TCGTTCTACG ACCAGCCGCT TGTCGATGAC GAATCCCATA TCGGACACTT CAAGGTACAG 420 

GAGCTGCGGA TGTCTGCTCA TGGCGAACIG AACCGCCACC TCAGCTTCGA CTGGCGACAA 4 BO 

CGTCTCAACC GTGCCGCCGA CGGCACTTCG TTTGCCGACA ATCTCTCCAA TGCCATCGAC 540 

ATCGCAGGTG TGGACTGGCA CCCGAACGAC AAGGTGTCTT TCTTCTTCGG ACGTCAGTAC 600 

GCGCGTTTCG GAGGGATAGA ATACGACATG AACCCCGTAG AGATCTACCA GTACAGCGAC 660 

CTTGTGGATT ACATGACCTG CTATACTTCG GGCGTGAACT TCGCATGGAA CTTCCACCCC 720 

GAACAGCAGC TGCAGCTACA GGTACTCAAT GCTTACAACA ACCGCTTCGC CGACCGCTAC 7 80 

CACGTGACAC CCGATGTCGC TACCGCCACG AGCTACCCGC TCCTCTACTC GGCACACTGG 840 

AACGGTACCC TCCTCGGAGG AGCACTGCAT ATGCGTTACG CCGTGTCGAT GGCTCATCAG 900 

GCCCAAGAGC GTAATATGTG GTACTTCACT GCGGGCAACC TGTTCAATCC GGGCAAACGG 960 

ATCAACGGAT ACCTCGACCT CACCTACTCG ATCGAGGGAT TGGACGACAA AGGCATTATG 1020 

ACTGCTCCCT ACGGCAAGGG CAAGACCCTC ACGGACGTCA AGTACTATGC TCTGGTATCG 1080 

AAGTGGAACT TCCGCATTTT CGATCAGGTC AATCTCTTCC TCAAAGGCAT GTACGAGAAC 1140 

GGCTATGCGC CTGCCCAATA CGGCGAGAGC AGCCACACGC GCCACTCCTA CGGCTATATG 1200 

GGAG'jGGTGG AATATTACCC TACGGAGACC AACTTCCGTC TGTTCCrrCJVC CTACATAGGA 1260 

CGGCATTACC GGTACAGTGC GACCGAGACG GAAAGCACCA ATGCTCTTCG CGCCGGTCTG 1320 

ATCTATCAGA TACCTTTCTT A 1341 



(21 IHTORMATION FOR SEQ ID NO: 94 

(1) SEQtfENCE CHARACTERISTICS: 

(A) LEHGTH: 681 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI4DEDNESS : double 
rO) TOPOLOGY: circular 

(ii) I10LECULE TYPE: DHA (genottiic) 

(iii) HYPOTHETICAL: HO 

(iv) Al>»TI-SEiJSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGA/VISH: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) HAl'lE/KEY; misc_featurft 
!B) LOCATIO»J 1 . - . 681 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94 

TATAAGCAAG CTATAATGAA ACATTTGTTT AAGTCGACAT TAGTACTTCT TTGTGCTCTT 60 



SUBSnrUTE sheet (Rule 26) (RO/AU) 



in? ^^t^ tF^-ii ^p=i, ii=ii, 

iUli >O^Bk; o o u, sJ fa a2 o IJ O 



^^^^^'^^^^ PCr/AU9«/01023 
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i^i-^^^ fo'^^i^ .cc^x„«.T 

SS?^^ ^-I^- ^--i ---^ 

AATAGAATAG CCATTCCTAT ACGCCCTATC AAAAATTTTA AT^^^ CTCTA^^aI 
GTCGCAATGG CTT6SATGAG TCGTCATGAG CAAATTTACA ATT^^S? SmaS^^ 

^^^r. t^^^t S^-S-?-? 



120 
180 
240 
300 
360 

681 



(2) INFOW^tATIOM FOR SEQ ID MO: 95 

(i) SEQUEUCF. CHARACTERISTICS - 

(A) LENGTH: 1218 base pairs 
ib) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
(H) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(Hi) M*x POTHETiCAL: UO 

(iv) AI(TI-SBNSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOtrAS GINGTVALTS 
(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATlOtJ 1 . . .1218 

(Xi) SEQUENCE DESCRIPTION: SF.Q ID NO: 95 

a5^??G^ ^1^^ CAAAACGAAC GAAATATGAT TATCAAGAAA 60 

s^s^ ssSE? "^^"^^ -o 

TTTGCTGTCG A^^^^CT^ ^^CGC^ ^^C^^ ACT^CTC^ I™?^ 

CTGCGTTACA Trr*i^r-Pf az-jv /-vtit,*™- * i ti n-L»<3rt<jA ACTTCTCCAT GItCTGTTGCA 54 0 

GCCTWG^ C^S?AGC CACAACACCG GAGAGAATCA GGCCGGAAAT 600 

AGCTTG?^^; C^^??T -^C^IIIg ^SJS^^I^ TGCTACTGGG TAATGCGGAG 660 

GGTGTCACGA GTTtSScAT Wr^S^r ??r^IS5^? S^'^TCTC CTATGACSGA 720 

WAGiATTCC ^^TO?? It^J^^ TCTTTCGGTG ATGCGCCGGG AGGACTCAAG 960 
TrrGTTOlTG C^?J??c M^CTgSc r^^S^SJ!^' ATAGCTATGA CGATAAATTT 1020 
GCCGGTGCCG GCT^CA^S? S^JS^aS^C ^SaS^p '^JT^'^ GTACTTCACG 1080 

^l^^ "oS^cg^I ?a^^^-^ 

1218 



(2) INFORMATION FOR SEQ ID NO: 96 

(il SPQUENCE CHARACI'ERISTICS: 

(A) LENGTH: 1356 base pairs 
(BJ TYPE: nucleic acid 
iC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

111) MOLECLrLE TYPE: DNA (genomic) 

(111) HYPOTHETICAL: HO 



(iv) AJiTl-SEMSE: NO 



(vl) ORIGINAL SOURCE: 

(A) ORGAJIISH: PORYPHYRa^QNAS GTNGIVALIS 
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( i.x ) FEATURE : 

(A) NAME/KEY: mi 5c_f eature 

(B) LOCATION 1...1356 

(xi) SEQUEHCE DESCRIPTION: SEQ ID NO: 96 



10 



15 



20 



25 



30 



CTAATAATCC5 
ATGATGTCAG 
TTTATCAGTA 
GGAACAATCT 
CAATTGCCTA 
TATTTTGATG 
TGTACGACTT 
AGGTGTATAT 
TCGACAGGGG 
TTCGA3TTCA 
ACGAATAGTG 
CTTACTTATC 
ATTGCATCCA 
CGATTTATGA 
GGCGGAGATA 
GCTTGTGAAG 
GACGCATCTG 
GACACTTTCT 
CTCAGGAGTA 
GATGTCTATA 
TTGOITTTCG 
TCGGACATAG 
TCTTCCGGAC 



AAAAGGAAAT 
GTGTATTGGG 
ATGAGCCTAA 
ATGTAACCGT 
TGTACTATCG 
ACAAGATAGT 
CTTCTCGAGA 
TCAAAACTTC 
TGACTTATAT 
ACACTATCTA 
GCACCATCTA 
AAAACGGTGA 
TGTGCTCTAA 
GTGGAACCCT 
TACAGAATGC 
TCTTCCGGAC 
TCTTTGATTT 
TCfATAGCCC 
AAGAGTCTTi- 
CTTTGGCCGG 
ACCTGGCTGG 
ATACTAACGG 
AGGTATTCAG 



GAAAACAACA 
CGGAAACGCA 
CGTCAAGTAT 
GACAAAAAGG 
ACCATTAGGT 
TGCGGATATT 
TCAAGAGTAT 
TGTGTCTACA 
AGGTACCAGG 
TGAAGACTCT 
TCTGGCATTG 
GTTCGTCGAT 
TAAAGAAGGT 
TGCGATCACA 
GATCGTTCAA 
TCCTAACGGA 
TGAGTGGTAT 
TTGGGACAAA 
TATAACCTCT 
GAAGATAAGG 
CAGGATGGTA 
ACTAAAGCGA 
TCATAAGGTG 



GTTCAACAAA 
CAGAGCTTTT 
ATAATTCCCA 
ATTCAGCAGG 
GATAATGAAC 
CAGACAGATG 
CAACTTTATA 
TATGAGCATG 
CATCACATCT 
ACACCTATGA 
ATGCATGAAA 
ATCTCGGAAA 
GATATAATAG 
AAAGCAGATG 
AATATATGCA 
GTAGATGGTC 
GAAGATGAAA 
CTTTCT'ITATG 
TTCATTTCTC 
ATCGAAAGTG 
CTTCGGCAAA 
AGCGGTATTT 
CAGGTA 



TTATTCTGTG 
GGGAAGAAAT 
ATATGGGGAT 
GAGCAAATTA 
AGTGGTGGAA 
CATATGGCAG 
TAAACGAGCA 
GTATGGCTGT 
TCGCATCAGG 
GCTGTCGCTT 
CCACAATGTC 
GTGAATTGAG 
CTCTTGTTAC 
AAGGCAAATG 
TGATGGACGA 
GGACAAGGGT 
TATACGGAGG 
CGAAATTTG-j 
CGACAGTTGT 
AAACTCCGGT 
CCATTGATAA 
ACGTAGTCTC 



CCTGGCTTTA 
AGCTCCTCCT 
TGATTCAAAG 
TACrrCTGAG 
ACATGATCCG 
AGTTTATGTA 
GAACGAATGG 
TTTTCGCTCT 
TGTAAATGAT 
TGCAGAGGCT 
TACGACTATC 
TAACTCGATT 
TTCATATACA 
GCAACTTGTT 
CAACAAGATT 
TTGTGTTTCT 
CCTGATATTT 
TGGGATTATG 
ACAAGGAGTG 
CrCTGAGGTG 
TAAAATCTAT 
GGTGCGGCTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
120 
ICO 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1356 



35 



40 



45 



(2) INFORItATION FOR SEQ ID NO: 9? 

lit SEQUEMCE CHARACTERXSTICS: 

(AJ LENGTH: 993 base pairs 

(B) TYPE; nucleic acid 

(C) STRAHDEDHESS: double 

(D) TOPOLOGY: circular 

(M) MOLECULE TYPE: DNA (genomic) 
{ill) HYPOTHETICAL: NO 
(Iv) Afrri-SEhfSE: NO 



50 



55 



60 



65 



70 



(Vi) ORIGINAL SOURCE: 

(A) ORGAWISI-4: PORVPHYRa-lOriAS GIUQT/ALIS 

(Ix) FEATURE: 

(A) NAf-)E/KEY: misc_feature 
(8) IX>CATION 1.-.993 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97 



GGCCTGTACG 
GCAAGACTCA 
ATATTCGTGG 
GTTCAGCCAG 
CCCGTCCTGC 
CTTTCCGATG 
GAACTCAACT 
GCTGCCGGTG 
ATCGGTCTTG 
GCCGACTTCT 
TTCGGCTTTA 
TTGAGTTTTA 
ACGTTCTCGC 
TTTGGCAGCT 
CATGGArrCG 
TTTGCCGTCG 
TTTGCTTATC 



GTATGTCTGT 
TGATAATCCG 
TAGGTCTTTT 
TGCCGAATAG 
CCGCTTCTTT 
CTTTCCTCAA 
TCGCGGCAGA 
TCCrn^CATG 
GCCATGATTT 
TGCTGAGTCA 
TACCTATGAC 
CCACCGGTAT 
TTAAAGGAAG 
ATCCTCTCTA 
GCCCTCAGTA 
GTATGGAGCG 
CTGTATTCTA 



TGTACCTATA 
GTC3TCTTATC 
CTCTGCGATG 
CAGCATGGTG 
TCAGAATACC 
TGGATTGAAG 
GCGTCGGGAT 
GCGACCGACC 
ATTGACCGGT 
AAATCTTACG 
GGCTGTCAAT 
CGATTATCGA 
TGCTCGATAC 
CAGCAGTACG 
CGGTGGATCG 
CGAATACAAT 
TGGCGATAAG 



ATAATTTATC 
CGTCGTCCGA 
GCGCAAGAGA 
GAGCAGACCC 
CGTACACTGA 
CCTCATCGCT 
TTCGTTTCTC 
GATAGGATGC 
GTGCGCAAGG 
GCACATGTCC 
GGCCAACTGC 
CAGGTACAGT 
GAAGTGATGG 
CGCTCAGGAC 
CTTGAGCTGA 
ATCTGGACTC 
AAG 



TTTGCGGTAT 
GAACCGTCCT 
AAAAGGATAG 
CTCTTCTCTC 
AAAGGTTTAG 
CATCTTTGCA 
CCCTCTTGCA 
ATTTTTATAC 
ACTTCGGATG 
AAGGCGGTTG 
GTTGGCAAGC 
GGAATGCTTT 
ACAATGTCTT 
TCAATATGGC 
AAGTCTCCGA 
G7CGGTGGGA 



ATCAAATTAT 
GTl'CGGGTTG 
TCTCTCTACG 
CATTGATCAC 
AGACAAACAT 
ATTGAATGAG 
T^CTCGCCAC 
ATCGGGCAAT 
GAATGCTGGT 
GCAGCAGAAT 
CACCGAGAGA 
CGATAATAGA 
TGTCAATGGA 
TGTl'CCGATG 
GCGATTCGGC 
AACGCATTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
"720 
780 
840 
900 
960 
993 



75 



(21 INFORHATION FOR SEQ ID NO: 98 
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20 
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50 



55 



60 



65 
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fi) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 98 7 base pairs 
<B) TYPR: nucleic acid 
5 (C) STRAfJDEDNF.SS: double 

(Dl TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

10 (ill) HYPOTHETICAL: WO 

(iv) AHTI-SENSE: NO 

(vi) ORIGIWAL SOURCE: 
^5 (A) ORGANISII: PORYPHYRWOMAS GIHGIVALIS 

tix) FEATURE: 

(A) MAf>IE/KEY: ml.'5c_f eatur- 

(B) LOCATION 1 ... 987 



PCT/AU98/01023 



(Ml) SEQUENCE DESCRIPTION: SEQ ID HO: 98 



C^^,ACGAACT CTTGGGTATC CAGCGATTGC AATTCGACGA CGATGAAAAC GAATAGACGA 
TACCCATTTG TTTTGCCGCl- TCTGCTACTC ACCGGATTGT TGGC^TGGGG G^^GA^T^ 
25 TCCCACGGTA GCAATACAGC CTTTGCAACT GATTCTTCGA GTAGAGACTT g5^2^^ 
CAGTCC.3CCT ACCGCATTCA TTCTGCCTAT ATGGTCGGTG GTGgSgSg CA^^Sc 

GAc^yi-A-i-T tgtcacccct tcgttatgga ggatggacac tc^a^g?? ^^G^^ 
acgttccctc tcaaagcctc cgattcccgt tggatgatcc gtaccg^gca ?^gctgS^t 
tttgccctga tggacaatcc ggccaataat gctcatttct attccctgct gtataac^? 

crcTTTACcG ccttggcgct aagcatSg^ gA^S-% ^$gS? 
ctgcgcttgg cattcggccc gggcttggaa atcgggcttg gaggaattta tag?a^c 

^^^^'"'^ ^"TCCTGCGAC ATTGAAGCTC TACACCAATG CCATCGCCCA A^ctSSa 

ggatactacg tcccctccga aacttttccc ctgtattttc ggttgctctc ccagatSJ? 

CTCTTCGGTA TAGCCTATGG AAATGGTTTT GGTGAGAGCT ATTAOGAGAA mmOT? 720 
35 AATAACGGCA TTGCAGGCTC CCTGCATTTC ACTTATCCGG GCAAGTtS? ?SS?^G 780 

acactcataa cggcggatat tcccattcgg aacttctgta cgcttcctg? c^??J^5^c a!n 

???^5cgS ?c^™ TTGGATACTC G^^^^^ tS^ 900 

_ SSg^^ ^????A^ TTCCGTGGGC GCAAAGCCAT GAATACCGGT 960 

40 987 



(2) INFORMATION FOR SEQ ID NO: 99 

(i) SEQUEHCE CHARACTERISTICS: 

<AJ LENGTH: 957 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) J-IOLECULE TYPE: Dtih (genomic) 

{iii) HYPOTHETICAL: HO 

(iv) ATiTI -SENSE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

( ix ) FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATIOtJ 1. . . 957 



60 
120 
180 
24 0 
300 
360 
420 
480 
540 
600 
660 



<xi) SEQUF^^CE DESCRIPTION: SEQ ID NO: 99 

f^™?*" CGTCCGTATA GCCATATCCG GCCGTGTAGT GTGCCGTGAG 

^^^^^^ AGAAGCTGTG CGTCATGATG GCATGGTAGT GACGCTGCTC 120 

^^S^?^ Tat^^^ r^t^l^n"" T^^^^C GCGTCCACGT ACATAAGACC iso 

I^JSSSSSSS S^^'^TTTGGC TTCATCCTCC TTGGAAAGAC CGTTCCATGC 240 

7n ^liSSi?^^ acttcttttc ctccgaaagt gatgaacctg agagccgtgt tgctaccga; 
70 atagcccacc tgtgcgaaat aggatttcag atccacgctt cctctatcta cct^^c^ 
cgaaccgatt ttggacaggc gggcatccac tgcccaatgg cgacSatcc ^C^^C 

GAGTTTGACC GATCGGCGGA ATGTGCCGAA CGAACCTCCG CTCAMTCGA ScG^TA 
Sg^ r5™r^? CCGTACGCAT ATTGACA^^^ ^C^G SScc 
75 gSa??2 AC^^^^a ^^^t^'^'i^'' CTGAAGGTCT TCGATGGAAG AGGCGAACTC 600 
Gi^^TATTC ACCCAAAAGA CGGACTGAGA ri'CGGAGTCG TTGAGGGGTA CTCCATTGGT 660 



60 
120 



300 
360 
420 
480 
540 
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CGtIcc^^a ^J?^^'"^^^ CATCGGTGCC ACGCACGCGA AAC.CCGGAAT ATCCGATACC 

S iiiS ~» = 
.,cr»,c^ ssss sss?s ssss iss;?" 

(2) IHrORJLATION FOR SEQ ID NO; 100 

(1) SEQLFEUCE CHARACTERISTICS: 

{AJ LENGTH: 1842 base pairs 
fE) TYPE: nucleic acid 

(C) STRAtlDED!<iESS : double 

(D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DTJA (genomic) 
(ill) H Y POl' HET I CAL : HO 
tlv) AIJTI-SEM5IE: NO 

(vi) ORIGI/JAL SOLfRCE: 

(A) ORGAinstt: PORYPHYROi^OHAS GIHGTVALIS 
Ux) fc-LATURE:- 

(A) NAUE/KEY: misc featiirf* 

(BJ LOCATIOII i...l842 

Ui) SEQUENCE DESCRIPTIOM: SEQ ID MO- 100 

sii sEs ™ =^ = 



TTGTCTCACC GCAAGrr-rrA ««1CAGQAAG ATAAGAAGGG CCAGCTCGTC 

AA^SS l?^r?^' «==^<^«» GGCTCTCGWV 

GA^W??G S^^^^ ATCGAAGCAG AGCTCGAACA ACAGAAGAAA 

TCTTACG^G ^^^^^S^^^ S^^^^G^^^f 51?'^''°'^° GTATCGTCAA GAATATTACT 
TCATGGGSTC GTGT^S 5^^^^ S^^I'^'''' "'^TCCATAT CACTGACCTT 
45 GTTATCCTCG ACTT^SATrA ATCGTACAGC TGGATCAGAA GATCyWTGTC 

CCTCATCCT? G^A?^S ^A^^S^^^ CGTATCGCTC TCGGACTCAA ACAGCTGATG 

sjs; ™1 ^^^^ ssss sss 
50 ss; iEi sss ssss 
ssss EEi Si ES" =S" sssss 

CACCCCAGCG AGTTTAC^ ^?S?^SSTTT CTTGGACGAA GAAGATCAAA 



PCT/AU98/01023 



7?0 
780 
840 
90O 
9S7 



(2) IHFORHATIOJ^ fX)R SEQ XD NO: 101 

U) SEQUENCE CHARACTERISTICS- 

(AJ LENGTH: 729 base pairs 
(B) TYPE: nucleic acid 
yrj <C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DI^A t genomic) 

(iii) HYPOTHETICAL: NO 



60 
120 
180 



GCCGTGAGTC GTCAGGAGCA rnn^^^^TZ ^^^^^^TTTG. AGGCCGGTGG CGTCCATGCT 
AAG<^CTGG ?JS^?AG Ji^TGACAAAA CGCTCAATAC CATCAAGGAA 

35 GGGTAcS^? ?^J^4g? gS^S^ A^SS^cC G^^^? j^TCAATGTA 240 
GTGGGAGACG AAGTGGAAGT TTATAT-GAG aS^^JI^S GCTACAATCC CGAACTCAAA 300 
TTGTCTCACC GCAAGgSJch T^^n^S^S ATAAGAAGGG CCAGCTCGTC 360 



CACCCCAGCG AGTTTArrJ^k n^^™^^^' TCCGACCTTT CTTGGACGAA GAAGATCAAA 
- ^^^CC ^Sct^S ^G^T^c "^^r^"" TTCAGGTAAT CGAGATCGAC 
55 TTCGAGACGG TATTCACTGT iArfSS^^SS AAACAGTTGG AAGAGAATCC TTGGGATGTA 14 4 0 

AAGGGTGCTG ^^^^^CTCT G^SSJS CACGAAGGAA CGGTAATCGA AGTGATGGAC 
GTGAA^G ri^^f^^ 5T?5i^??ZZ I^?CCACTCC GAAGCACATG 



(3TGAAGGAAG k^GG^^ik GGCTgVaCTC gA^AaJ^ ^I^^^^ GAAGCACATG 1560 

TTCAATAAGG ATGCCAArrr Z^ntt^ GAAGAGAAGT TACCTTTCAA GGTTATTGAG 1620 

AAA^^?? AGCG?^GC ^5^™ o^^J'^'^^ GTGTATTCGA AGATGAGCAG 1600 

GAAGCTGCTG CcSaA^GC S^^CctSa r^^^l^^'^ ^^^^^AAGC GGCTCAGAA.^ 174C 

CTCGGCGAGC TGG^^^^ ^TTC^^ AGAAAGCCAC TCTCGGAGAC 1800 
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tiv) AIITI -SENSE: NO 

tvi) ORIGiHAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALTS 
( ix } FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATION I. . .729 



PCT/AU98A)1023 



10 Ui) SEQUENCE DESCRIPTIOt^ : SEQ ID HO: 101 

^cSS^ l^^^ G^I^^St CCTATCTCCC 

GGACGTCCCA AACTGACTAC CA^^^ IJ^SI? SSISJ^ CGAACTAGAT 120 

15 TTCTTCTTCA TCGACATGGA TTACACcrrP n^^J^^ CCGACAAATG GGGAACCACC leo 

TCGCGCGAAC TGAAGT^riG G^^S^CCC JSI?'*^^'^ TTGGGAGATT 2,o 

CrCTCCACAA GCTTTACTTT C^CAoS5 r^JS^T'^ZZS ATTTGGAGTA CAACGGAGGC 300 

AACCCCTCCT TTACACGTGG A^T^^^l A^^^??? aS^S^? CACCTACAAC 360 

GACTTCCACA CCTATCAGAT CACCGGCACT TGGTaSt^^ aSS^SS?^^ GGGTGCGCAC 4 20 

20 ACCTTCAACG GCTTCCTCGA TCTTTG^CT ????2^r J^Z?^^^ CGGTCTGCTT 4 60 
GTGCTCAAAG AAGGGGATAA GTTCGTArrc Sc^^r^^S SS^rS?^'^ CGGGGGCCCT 

aatcgcatca aaggcatcga caaggatJtc ^?erc^A t^^^'^'^ gatcaacctc 

ACCAGGAACT TCGCTCGCAT GGACAAATTC ^^G^^n J^l^^^'' GATGGAAATC 
ACTTTCAAC fCEJTeCATCC CTACTCTTGC GGTCAAATGG 



(2) INtORMATION FOR SEQ ID NO:102 

on SEQUENCE CHARACTERISTICS - 

<A} LENGTH: 705 base pairs 
(B) TYPE: nucleic acid 
tC) STRANDEDNESS : double 
(D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) Al^fTI-SENSE: HO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYRa-tOT^AS GIHGTVALIS 
liJC) FEATURE: 

(A) NAME/KEY: misc feature 

tB) LOCATIOfJ 1. . .705 



(2) IHFORI>tATlOH FOR SEQ ID MO: 103 

U) SEQUEJICE CHARACTERISTICS- 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 
7n STRANDEDNESS: doubl* 

<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



60 



5-JO 
600 
660 
720 
729 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102 

50 ATGAAAACAA TTAGTAAGAA CCATGCGGCA rrr-a-r-^^m, 

GTGTGTAACG GCCGGATAGC TGCTC^^ ^^^SaJ^ ^"""^^ TTTGTTTGCA 
GCTGCTCAGT ATTTTGGCGA TGCAAAC^I cr^S^^^SIS SJ^TAGGAGG AGGTTTTGGT 
GAGTTGGTCG GACGTTACAA TTAT^Jw? C^n^J^ GTTCATCCGG AGTAGGTTTG 

CGTACATTGA GAGGCGATAC CGAT^gJJc S^S^S^ TCAGTACCftT GTTGGATTGG 24 0 

55 GATTTTAAGG TCGGCTTGAC TCAGCT^J^C ^^^JS^ 1^2?"°^^^' TGCTCAAGCG 300 

AGCGATGGCT ATAAGTATCT TGGtS^^ C^^??2?Sr TCTCCCTTAT 360 

TCTTTTGGGTT TTGCTTCGGG TGCTAAAGCT T^^^^S^S ^^TCTAGC GGCCGGGTTG 4 20 

ATGGGAGTGA AGTATAAGCa- l^^lrac Sf^^^'^^ TACTGCGGGA 480 

GGGTTACTTA CCGATGCTTT GCATGCGCTG AOT^JS^A ^Z^ISS'''^^ TTCTTTCACG 54 0 

60 TATAAGATCA ATGACTCCTG GGTCAAAAAC A^J^ SISI^"^ CGAGGATCCA 600 
ATTACGTATG ACTTCGGCCT GCgS^^ J^^^a S^"^^ AGTGCTTAGG 



60 
120 
180 



660 
705 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



tPHi tV^ii -iP"" -iP^l 

%ji 



-tp«ii -="b iP^i #^ 

m u. J c:; u iLJ 



WO 99/29S70 



PCT/AU98/01023 



70 / 490 



10 



(iv) Wri-SZHSZ: NO 

(vl) ORIGINAL SOURCE; 

(A) ORGAHISM: PORYPHYROMOt^AS GIMGIVALIS 

tlx I FEATURE: 

(A) NAilE/KEY: misc_f eaturs 

(B) LOCATION 1. . .1308 

Ixi) SEQUEWCE DESCRIPTION: SEg ID NO: 103 



15 



20 



25 



30 



35 



AATATAATGT 
ATTGGGTTGC 
CCTTACACJ'C 
ATGCCAGGAC 
TCATACTCGG 
ACGTGGTATG 
GCCATGCTTT 
GCATCCGGGT 
CGTAAATACT 
TTCAAAAACT 
AGGCAGGTAA 
TTGAAGGCTG 
TCGCTCGTTA 
ATAAAGAATC 
GGAATGGACi' 
GATAAACTTC 
AAATCCGATT 
GATATAAATG 
AATTCTTACC 
GCTGTATTCG 
GAATATGACC 
TTCGGCCTCA 



ATAAAGACTA 
TATTTGCAGC 
GCTTCGGCCT 
TCGGCGTCGG 
CTGTGGATTC 
CCGAGAACGG 
Tl'CCTATTTC 
ACCAGTTCGG 
TGGGGACAGG 
TCTCAATAGG 
TCTTCl'CCAC 
CCAAGTTCGA 
TCGGTGCCGT 
AGGTTCAGAA 
ATTATACCCT 
TCTTAGGAGC 
GCAAATTCCA 
CCGTTGGGAT 
'J'GAAAGTGCC 
GTATAGGAAT 
GATTGATCCC 
CGTTCAACGA 



TAAGGGTTTG 
AGACGGTATA 
TGGCCGTCTC 
TCTGCGTCAG 
6ATGACGTTT 
GAAAAAGGAC 
CAAATCCATT 
ATCCGTTGAT 
CAATCTGAAC 
AGCCAATGCT 
GGAGGCTCCT 
CTTOGGTATG 
CTATTCTCCG 
CGGTGTAGTA 
GCCTCATACA 
AGACGTCCAA 
GGACAGAATA 
GTCGCCTAAA 
GACTTVAAGGC 
CCCGCTCAAT 
GAAGGAGGGT 
GTCATGGTTT 



TATGCGTCGC 
CAGGCTCAGA 
GGAGAACGGA 
GGCACATACG 
ATCTTCGATT 
AATAGGAAAA 
CCTATGAGTG 
CAAGTGGAAG 
CiATCTCTATG 
TCATCCCT-lt 
TACAATCCCG 
CAGTATCACC 
CGGGTGAAGA 
GTGGAGAGCG 
TTGGGGATAG 
TATAGTAA.IVT 
CGGGTATCTG 
GTTCGCTATC 
GGTGTATAU'C 
GACAGACGTT 
ATGATCAAAG 
AAAAAGCTGA 



TTCGGTGGTA 
ACAACAACTT 
CGACTATTAG 
TCAATGCCGT 
TCGGTGCATC 
TGGGAAACAT 
CGGGAGTGCT 
GAGGCAGCGT 
TCGGTATAGG 
TTGGGCGATT 
TACATCTCTC 
TTCTTCTCAA 
TGCATAGCGA 
AAACCCAAaA 
GTTTTTCTTA 
GCAAAGGCGA 
TCGGCGGAGA 
GCTTCGGTTT 
AAGGATACCA 
CGTTCGTAAA 
AAAATGCrCT 
AACTGAAC 



TGCCCTGATC 
TACCGAGTCG 
TGGGCATTCC 
CAATCCTGCT 
TACCGGAATT 
TGAGTATTTC 
TCCTTACTCC 
CCAGTACACC 
TGCAACCCCG 
CACACACAGC 
GACGCT6TAC 
ATCAGATCGT 
GCTGACTCAG 
ATATATCAAG 
TGAAAAGAAA 
GAAATTTTAT 
GATCATACCG 
ACATGGTGAA 
TAT6GTAGGT 
T6TCTCTCTT 
GAAATTGACC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
760 
040 
900 
9€0 
1020 
lOBO 
JMO 
1200 
1260 
1308 



12) INFORMATION FOR SEQ ID WO: 104 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2835 base pairs 
tB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



45 



(11) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



50 



(Ivl Ajn-i -SENSE: NO 



(vU ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa-lOHAS GIMGIVALIS 



55 



(ix) FEATURE: 

{A) NAME/KEY: misc_feature 
iB) LOCATION 1. . .2835 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104 



60 



65 



70 



75 



AATACCAT'i'Y 
CGATCGATTT 
GAAGCCCAAC 
CAAGCCAACG 
AATGAAAAAG 
TCATATGTAG 
ACGCTCAAAG 
GCTAAAGCGG 
ACCGTAGCAC 
GGATCCGATG 
AAAGAGTTTT 
AATAAAGTAC 
GGAGAAGAGG 
GGAACGCTTC 
CGGTTCGATG 
GGCTTTAGCG 
GGTCGACGCG 



CGGAGAATGG 
ATCAATTACT 
AAGCCGGAGT 
TACAGCTTGT 
GGTTGTTCAG 
GTTACACTAC 
ATATATCCAT 
CAGAGGTCGT 
AGGGAGCTTC 
GGAAGATCAC 
TCTCCAAAGA 
AGGTACTGAA 
AGACCGTAAT 
AGGCCGGCTA 
GAAATAAGCA 
AGATGGACAG 
GCTTCGGCW^ 



GAACGATAGC 
GTTGTCAATA 
AGCAGGTAGA 
ACAGAGTACC 
CCTGAAAftCG 
CCACGACGAA 
GAACGAAGAT 
GGTACGCAAC 
TATCGAGGAA 
CATCAACGGC 
TCCACAGGT6 
CAAACTGAGC 
CAACCTGACG 
CG6TACCGAC 
ATGGACATTG 
CGAGATGGGA 
TAGTGGAGCT 



ACCTATTTTT 
CTCCTTGCTT 
GTATTGGACG 
GGCCAAGTAG 
TCACAGGAGG 
AAAATATCTC 
GCCCGTCTTC 
GATACGCTCG 
CTGATCAAGA 
AAG6ACATTA 
GCAATAAAGA 
GAGCTCTCGC 
GTGAAGCCCG 
CAACGCTATA 
ATCGGTAGTG 
TCCATGAGCT 
GTTACGTCTT 



Ci-fTTCAAAC 
CTCTTGGTTT 
AAGAAGGCAA 
CCGTTGCCGC 
GTGACTACAT 
TTAGAAACGG 
TACAGAGTCT 
AATTCAATGC 
AGCTACCCGG 
GCAAGATCCT 
ATCTTCCGGC 
GGATGAGCGG 
AAAAAAAGAA 
TGGCCGGAGG 
CGAACAATAC 
TCTTCTCTCC 
CGTCGATGCT 



TTTTCTTATG 
CGTCGGGCTG 
CCCCATGATT 
AGGTGCCACT 
TCTGCGCGTT 
GCAAACCATT 
GACGGTGCAG 
CGGATCCTAT 
AGCAGAGATC 
TGTCT^ATGGC 
CGATATGGTC 
TTTCGATGAT 
AGGCCTCTTC 
GAAC6TCAAT 
GAACAATATG 
CCAAGGCGQT 
GGGCGGCAAC 



60 
120 
180 
240 
300 
360 
420 
4 BO 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
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TTCAGTGTCll 
GACAAGGCCA 
ATGGACGAAA 
CAATGGAAAC 
ATCGATGT^T 
AAAGGTTCTA 
ATCAGTCACA 
ACCGACGAAG 
AAGCAATTCA 
TTGGGTAAAA 
GATCGTGAGG 
GGACTCTCCT 
GCCAAAACGT 
CGGAGCGTAG 
ATGCTCCGAA 
CGCACGACAC 
TTCGTGACGG 
TTCTCGGACT 
ACATTCGACG 
CGTTACGAAA 
CTCAAGAACA 
AGCrrCATCA 
CTCACCTATC 
GCGAATAATA 
CAAGTTGCCC 
AACTCCGGTT 
TACAGCTTCC 
CAGCCG7CAA 
ACGATCGGAC 
GGATCTCGCA 
GGCAGACGAC 



AATTCTCCrC 
TAGAGACCAC 
ATATATTGGA 
CGTCCGAACG 
TCTTTAACGA 
TCCACCAAAC 
AGCTCAACGA 
ACGGAGATGG 
ACGACAACTC 
ACTACTTCGC 
TGTACCGACT 
ACAGTAACGA 
GGGACTACAC 
CCGGAGTAGA 
TCAACTACAA 
AACCATCCAT 
AAGGCAATCC 
TCGATGCCAA 
ACATCGTCCC 
ACGCCTCCGG 
GGGCATTTTC 
ATGACGATAA 
GCAACAATTG 
GTCTGAGCGG 
TAACCCTTCC 
ACAGCGGAGG 
TCCGTGACAA 
GTATCAGCCG 
GCTACGTGAT 
GCGATCATCA 
CGTCC 



TGCCCTTAAT 
CAAACGCGTG 
ACGCTCTTTC 
TACCGAAGTG 
CACATACGAG 
TACACAAGGA 
CGAAGGCCGT 
CATATATCAG 
CAACCTGCAA 
ACAAGCGATT 
GGGCGATGAC 
GTTCACCCAG 
CGTAGGATTC 
GCAGGACAAA 
ACCGAGCAGG 
CAATCAGATC 
CGGTCTGAAG 
AAcrrCAGCGA 
CAATACGCAC 
TACGTGGCAA 
TTTCAGGATG 
GAACAAAGCT 
GATCGATACJG 
ACAGAAAGAT 
CTATGGATTC 
ATTCAGTCTG 
GGCCGGTACA 
TTCTGCTTCG 
GGTGGACTTT 
GCGTGGCAAT 



ACAGGAGGCG 
GAAAATATCC 
TCrCACAATG 
GTATTCGAGC 
ACGAAAGATC 
AACAACTTCA 
ACGATCAGTG 
GCTGTGCTCC 
TATCGGCTTC 
CTGAACAGAC 
GGGCAATACT 
TATCGCATCG 
AATGTGGATC 
CTGGCTTTCA 
ACTACCAACC 
GCTCCCGTTC 
CTGAGCTATl' 
GCTTTCAACA 
TACGATCCGT 
GCGAATCITC 
TCCTTGTTCA 
CTCTCTTTCC 
AGTATCGGTG 
TCTCGCACAT 
CGTATCGACA 
GACGAATGGC 
CTGCGTGTCA 
GCCATCAATA 
ATCTACCGAT 
ATGAATCGTC 



ATSCACGCTA 
TCGCCGAAG*3 
GTCAGGCGCG 
CGGATCTTTC 
CCACCGGAAT 
GACTGAACGG 
CCTCCGTCAG 
AAAGCGTGGA 
GCCTCTCGTA 
GTTTCTCCCG 
CCATATTAGA 
GACTCAACCT 
CCAAC31GAAC 
ATCGTGTCAA 
TCCGAGTGGA 
AGGACATCAC 
CCAACAATGT 
TTGTTTTCTI' 
CTACAGGGAT 
ATGGGACACT 
ACAGGTTGGC 
OAACGAGGGA 
GCAATATCGG 
ACGATTTTGG 
GCGATGTTGA 
TTTGGAATGC 
ATGGCTATGA 
TAGAAGAGAG 
TCAACGCCTT 
CGGGCCCACC 



CGGATACTUVC 
GAATACTTAT 
ATTTAGGATG 
GATATCCAAG 
CTCTATCAAC 
AGAATTGGAT 
TGGCGGTCTG 
GACGAATCAA 
TGTGGAACCG 
TCGCAATTCG 
CAGTCAGTAC 
CAAGAAGATT 
TCTCAGCTAT 
TCTCTCCCCG 
CTACCGAGGA 
GAATCCGCTA 
GATGGCCATG 
CGGCAACTAT 
CCGTACCACT 
ATCGCTTCCA 
CGAAGGAQ^A 
ACGCCTGACG 
ATTCTATATG 
CGGCAATTAT 
ATACAATACG 
TTCGCTTTCA 
CATCCTCGGT 
CATGTCCAAT 
CAGTCGTGGT 
TTTCGGCGGT 



(2) INFORMATION FOR SLQ ID NO:10S 

(i» SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1236 base pairs 
(B^ TYPE: nucleic acid 
(C) STRANDEDNESS: doubl- 
ID) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DMA (gencinlc) 

(iii) HYPOTHETICAL: HO 

(iv) AHTl-SEi;SE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISJi; PORYPHYRa^ONAS GINGIVALIS 
(ix) FEATURE: 

(A| NAME/KEY: misc feature 

(B) LOCATIOfJ l..,1236 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:105 



GGAGAGTATC 

AGTTTGCTCC 

GACATCTTCG 

AAAGACTGGC 

GATCGGGACA 

TCCAAAGACA 

TTGGCTCGAT 

ACTGATGATG 

AAC/^iTTTACA 

TTCTCCGGCT 

GGATGGGGCG 

TACAATCCCT 

GGTTGGGGTG 

TCCAATGGTC 

GGAACGTCCG 

CAAAAAAATA 

TCGGGACGTA 

GGGCAAAAGC 

GGACGGAATA 



CTGCAAACAG 
TGCTTGGAGC 
AAGATGACAT 
AAAACCAACA 
TTGACGCCTA 
AGAAAAGAGA 
TCTATAAGCC 
GTGAGTATTT 
TCAACAGTCC 
GGTACAACTA 
GTTATTACCC 
ATGG7\ATCGG 
GCTATCCGGG 
AGCATTCCGG 
GT6CCAAACT 
AGTTCGGATT 
CCGGCCGAGC 
AGAATCGTCC 
TCCGCAGCGA 



TAACGACAAA 

GGTACTGCTG 

CTATACATCG 

GGACGGATAC 

CAATCGTAGA 

CTCCACTCGT 

GAATACGATC 

CCTCTATGGA 

TTGGTGCGAT 

TACGTGGAAC 

CGGATATAAT 

TATGGGTTGG 

AGTGA'l'ACAT 

AGCTTACTAT 

TGGGACAGGA 

GCAGTCGAAC 

CAATAGAGAC 

CGTATTCCAG 

GAGACAGGGG 



AAAGAGATGG 

ATTACGCTTC 

CGAAAAGAAA 

GGCGACGATA 

GATGGCCAGT 

TCITCTGTTC 

GTCATTTCAG 

GACGAATACT 

CCGTTCCCTT 

TATCCATGGT 

TGGTATTGGA 

GGATATCCTT 

CACTACCACC 

TCTTATGGCC 

CGCTACGATA 

AAACCCAATA 

CGAAATATAG 

CAGAATCAGT 

GAAAATAACG 



TTATGAAGCT 
CTGCGTACTC 
TACGTAAACA 
CGGAATATAC 
CCTACGATGG 
CCGGTCGCTA 
GTGCCGACAA 
ATGATGACGC 
ATACGTCATG 
TCTACTACGG 
GCTACTACTA 
ATGGCTGGGG 
ACTACCCCAA 
GACCGAATCG 
GAATTCAAAA 
ATAATCTGCA 
AAACGGTAAC 
CCGGCAATGA 
ATAGGACATT 



GATTAAAAGA 

GCAGAATGAT 

AAACCAAGTT 

AGTGGCTTCC 

GAAAAAGTTG 

TAGTCGCCGC 

TGTATATGTA 

GTCGTCTGTA 

GTATCCATCT 

TAGCCATATC 

TGATCCTTTC 

CAGCTATTAC 

GAAGACCTAT 

TATCAAAGGT 

TTCGTCTl'CG 

AAATGTCAAG 

TCCAAACAAC 

CCGACCGACC 

TTCGACTCCT 



loao 

1140 
1200 
1260 

i3;>o 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

I860 

1920 

1980 

204 0 

2100 

2160 

2220 

2280 

2340 

2400 

24 60 

2520 

2580 

2640 

2700 

2760 

2820 

2835 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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TCTCGTAGCA ATAGTAACGG TGGCTTCT^C A*-r;rf-T"rr-T-^' r-r-..^^™«„ - 

AGCGGAGGTG GCGGACGTAG TG^^SSS^ 2gC^ COICITCTTC CGGCTCTATG 1200 



(2) IJJFOW^IATION FOR SEQ ID NO: 106 

(i.) SEQUEIJCE CHARACTERISTICS: 
(A) LENGTH: 1803 base pa 
<B) TVPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: circular 



(ii) MOLECULE ri'PE: DiJA (genomic) 
(ill) H*^POTIIETICAL: NO 



(iv) AIJTI -SENSE: NO 



(vi ) ORIGINAL SOURCE : 

(A) ORGAI-tlSM: PORYPHYRWOIIAS GINGIVALIS 
iXx) FEATURE: 

(A) NAItE/KEY: misc feature 

(B) LOCATION l...i803 



ixl] SEQUENCE DESCRIPTION: SEQ ID NO: 106 

A?^?CG^ AG?5§™ ^'SrlZ^^^ TATTATCATA TAGAAAAGAC TAAAAGTATT 

GT^^?Sg AcIg^acS^ ^S^^r*^ CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 12C 

^r^^^^^A GGG??S^T^ T^n?^??? AATCGTATCC TTACAAGCAT GCAATCCTCC 18C 

W^^§5r 13^3^^ ATGGGGCGTG GCGTTCCTCA CATGTATGTG 420 

GG?^?^S:i TcS?I^f S^^??!^'^ ATGGGAGACT TCTACGATCA GTTCGGTAGC 480 

TCTGATCTGG Arir-rraaTnT az-nX-^^i^ 11(-AATTCCG GCCGAGGCTA CCTACTGGGT 660 

i^AI^**22 ^2 T<5T AGAGCGTTGG AGCAGTGCCA TGCGCGACAA TGACTATCAT 72 O 

Tif^^^l^"^ GGGGATCGIT CGTTTCCAAA CACGAAGCAG ACGAAGATAT ATTTGTgS? 7 HO 

ACC^??C SISSS^A^ tS^?^^^ CCGCrrCAATG TCC^^A? S^C^'^GC 840 

cSS^-Ar TACGCAGAGT ATGGATACAA ATACAACGAT OOO 

ACAATGACTA TATCTACCAC GACGGACAGG C1*GCACTCCT CTCTGcStCA q^O 

In^^^^ AAGGGATGAG TATCCTGTTG CAGGCCAAAC SicTGAGiA CT^^G^^C lofo 

ATTATTCGGA TATAGATGTG CAGAxiiCTA ^^^Sl^ S^S^C llso 

GACTGG^C? ACG^S^GGC cS^rS^S^ CAATATTTGC ACACGAAGCA GGATCAGGGT 1560 

GAGCACTATA ATt^rtrl r^n^l^^. ATCCTGCCTT CTCTGATGCT TTCCCTCTCG 1620 

ATATCGGAGA GACCAAGAAA CATTATGTCA TGGGGTCTGT CACCTATACT IfiRn 

1803 



(2) TNF0RI-1AT10N FOR SEQ ID NO: 107 

<i} SEQCJEHCF. CHARACTERISTICS: 

(A) LENGTH: 756 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) KOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 



(iv) ANTI -SENSE: NO 



|vi) ORIGINAL SOURCE: 
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<A) ORGAIilSIl: PORYPHYRaiONAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NA[<t:/KEY: mi bc_ feature 

(B) LOCATION 1. . .756 

(Kl) SEQUENCE DESCRIPTIOH: SEQ ID NO: 107 

AGGGGGTCGA GTTCGGGCAT TTCAGCCCGT GGACGCGACA TGCGTTCTTT ATTTTTGAGC 60 

GCGTTGCGCA GCTCCTCTCT CCATGGTTCA GAGCGACGCA GTCGGATAAG TTCTTCTGTA 120 

GTCATGTCAA TAAGGCAGAA AATAAGGTTA TTCCATCTCT CGGTATGCGC CCAAACGCAT IBO 

GATCATCTCA TCGAAATCCA Cn'GGTGTGC ATCGAATTCG GGGCCATCGA CACAGACGAA 24 0 

ITTCGTCTGT CXTTCXTCACGC TTATACGACA AGCCCCACAC ATACCGGTGC CATCCACCAT 300 

AATTGTA'rrG AGAGAAGCTA TGGTCGGTAT CTCGTAACGT TTGGTCAGGA GAGAAACGAA 360 

iO CTTCATCATC ACAGCCGGCC CGATCGTAAC GCAGAGGTCT ACXTGTTTCCC GTTTGATAAC 420 

GCTTTCCACT CCATCCGTTA CGAGGCCTTT CGTCCCATAA GACCCATCGT CTGTCATGAT 4 80 

GATCACTTCA TCGCTATTGG CTCGCATTTG TTCTTCAA'3G ATAACCAGAT CTTTAGTTCT 54 0 

GGCAGCCAAT ACGACAATTA CACGGTTGCC TGCTTTGTGG AAAGCCTCCA CGATCGGGAG 600 

CAAAGGAGCC ACy^CCCACAC CGCCTCCGGC ACAAACCACT GTGCCGACCT TTTCGATATG 660 
CGTACTCTGT CCCAGCGGAC CTACCACATC CGTGATATAG TCGCCGACTT CGAGTTCGGC 
CAATTTCTTG GAAGATTTGC CCACGGCCTG AACCAC 



720 
756 



(2) IHFORIIATtON FOR SEQ ID NO: 108 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2370 base pairs 
(R) TYPE: nucleic acid 
fC) STRAWDEDtlESS: double 
(D> TOPOLOGY: circular 

Hi) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYRWON AS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...2 370 



60 



45 (xl) SEQUENCE DESCRIPTIOH: SEQ ID tiO:108 

CTCTTGITTT CTTCTCCTTT ACCCCGAATG GATCGTCCTA AGCCTTCATA TATTGTTCGA 

ATAGCAGCCA TTCTCTGCTT GTTTGTCGGC AGGCCTTTGT TTGCGCAGAG CTATGTGGAC 1^0 

TACGTCGATC CGCTGATCGG GACGCTAAGT TCTTTTGAGC TGAGTGCG6G CAATACCTAT 180 

CCGGTGATCG GTTTACCGTG GGGAATGAAT AGCTGGACAC CGATGACCGG TGTACCCGGT 240 

GACGGCTGGC AATATACCTA CTCGGCACAC AAGATTCGCG GATTCAAACA GACCCACCAA 300 

CCCAGTCCTT GGATCAACGA CTACGGCCAA TTCTCCCTTC TTCCCCTTAC GGCACCGCAG 360 

AAGCCATCAT CGAACGACTC CATAGCTCTG ACTAAATGGT GCAAGCAACT CTTTTCGGAC 420 

GAACAGACCT CGTGGTTCTC GCACAAAGCG GAGACGGCGA CGCCATACTA TTATAGTGTC 480 

TAriTGGCCG ATTACGACAC ACGCGTGGAG ATGGCTCCGA CCGAGCGTGC AGCTATCTTT 54 0 

CGCATACGTT ATTCCGGC5VA TACCGAAAGT GGCTCCGGTC GATGGCTTCG TCTTGATGCC 600 

TTTACCGGTG GTTCGGAGAT TAGCATCGTG GATCCTCACA CCGTAGTGGG CATATCTCGC 660 

AAGAATAGCG GAGGTGTGCC GGCTAACTTC GCCTGTTATT TCATCCTGCA GTCCGATACT 720 

CCTATGGCCG ATGTCCTGCT TGAGACAGAT ACCGGCAAGT CAGACGAAGG CACAAGGGCA 780 

TGGGCAGCCT GTCGCTTCGA TTCGCAAGAA G'lTACCGTCC GGGTGGCATC TTCTTTTATC 84 0 

AGTGTCGAGC AGGCCGAAAG AAATCTTGCG GAAGTCAAAG GGCAGA6TTT CGACCGGATC *»00 

AGACTTGCCG GTCGCGAAGC TTGGAATAAG GTGCTCGGAC GCATACATGT GGAAGGAGGA 960 

ACGAAGGATG ACCGCACTAC ATTCTATTCC GCACTCTATC GCTGTCTGCT TrrrCCCCGT 1020 

CGCTTCTATG AGGAGGATGC TTCCGGCAAT TTTGTGCATT ACAGCCCCTA CAATGGAGAG 1080 

DO GTACTTCCCG GTTATCTCTA TACCGATACC GGATTTTGGG ACACTTTTCG AGCCCTTTTC 114 0 

CCCCTGCTCA ATCTGCTGTA TCCCGATGAA AACATTAAAA TTCAGGAAGG TCTGCTGAAT 1200 

GTATATCGCG AGAGTGGCTT TTTCCCCGAA TGGGCCACTC CGGGCCATCG GGATTGTATG 1260 

ATAGGCAACA ACTCTGCTTC TGTTCTGGCG GATGCCTACC TCAAGGGTGT TCGGGTAGAA 1320 

GATACCCGTA CACTGATGAA CGGACTCTTG CATGCTACGA AACCCGTCCA TCCGAAAATC 1380 

TCCTCCACGG GTCGCAAAGG TTGGGAGTGG TACAACTCCT TAGGTTATGT TCCGGCTGAT 1440 

GCAGGCATCG ACGAAAGTGC TGCCCGTACG CTCGAATATG CTTATAACGA TTGCTGCATC 1500 

CTCCGACTGG GGCGCACATT GGGTTGGGAT AGAGCTGCAT TGGACACGTT GGCrCATCGT 1560 

TCGATGAACT ATCGTCATCT GTTCGATCCG GAAACCAAAC TCATGCGCGG TAGAAATCAG 1620 

GATGGTAGTT TCCGGACACC TTTTTCCCCT TTCAAATGGG GAGATGTATT CACGGAGGGC 1680 

75 AATGCCTGGC ACTAGACTTG GTCGCTCTTT CATGATGTGC AGGGGCTTAT CGACCTGATG 174 0 
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t2t IWFORMATIOM EX3R SF.Q ID NO: 109 



(i> SEQUENCE CHARACTERISTICS: 

(A) LEHfTTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS; double 
20 CD) TOPOLOGY: circular 



tiil MOLECULE TYPE: DHA (genomicj 

(ill) HYPOTHETICAL: NO 

(ivi AMTI -SENSE: NO 

(vi) ORIGiWAL SOURCE: 

(A) OKGAIIISH: PORYPHYRCflOMAS GINGIVALIS 



t2) INFORMATION FOR SEQ ID NO: 110 



(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1134 base pairs 
(O] TYPE: nucleic acid 
(C) STRANDblUNESS: double 
60 tD) TOPOLOGY: circular 



(11) MOLECULE TYPE: DNA (genomic) 

(iiil HYPOTHETICAL: HO 

(iv) AIJTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROf^ONAS CTMGIVAtlS 



(Ix) FEIATURE: 

(A) NAUE/KEY: miscfeature 
tB) LOCATION 1. . .1134 

75 ix±) SEOUEt^CE DESCRIPTION: SEQ ID NO: 110 



WO 99/29870 
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GGAGGAGATC CCCCUTTCGT GTCTATGCTC GATTCGGTAT TCAATACTCC TCCTATGTTC 1600 

GATGAGAGCT ATTACGGATT TGTCATCCAC GAAATCAGAG AGATGCAAAT AGCGGATATG lfl60 

GGCAATTATG CTCATGGCAA TCAACCCATA CAGCATATGA TATATCTGTA TAATCATGCC 1920 

GGTCATCCAT GGAAACCTCA GGAGAGACTA CGCGAAGTGA TGGGGCGGCT CTATCGTCCT 1980 

5 ACTCCGGATG GGTATTGCGG CGATGAAGAC AACGGACAGA CTTCGGCTTG GTACGTTTTC 204 0 

TCTGCTTTAG GCTTCTATCC TGTTACACCC GCTACGGATC AGTATGTGCT CGGTTCGCCG 2100 

ATTTTTTCCA AGGTAATACT CTCTTTTCCC GACGGACACA AAACGGTGTT GCATGCTCCG 2160 

GCCAACAGTG CCGATACGCC TTACATCCGC TCGATCAGCG TAGAAGGAAA AGAATGGAGC 2220 

TGCAATTACC TGACTCACGA ACAGCTTCGC TCTTCTGCAT CCATTCAATG GATGATGGAC 2280 

10 ACGAAACCCA ATTATAATCG TGGTATGAAG GAAAGTGACA GACCTTATTC CTTCTCCACG 2340 

GAGCAACAGC GTCGCGCTAA TCACAGTAAT 23*?0 



PCT/AU98/01023 



( ix ) FEATURE : 

(A) IaIAI-IE/KEY: mlsc_feature 

(B) LOCATION 1. . .858 

35 txi) SEQfUENCE DESCRIPTION: SEQ ID NO: 109 

ATTTGTGGCA GTAAAATGAA TCTATCGGGA TTACAATCTT TCACTATGAT GAAATCCATG 60 

CGCAGCGTGC TGTTGCTACT CTTTCCATTG TCTTTGATCA CTGCTTTGGG CTGTAGCAAT 120 

AACAAAGCTG CCGAATCGAA GTCTGTCTCT TTCGATTCGG CGTATCTCGA ACGCTACATC 180 

40 CCTCTGCGGG CAGACATAGA TACGCCATCG CTGCATGTGA TGATCAGCTA CGTCTATCCl' Z40 

TCGGGAGATG ATATGCTCAC AGAGATTTTC AACGGTTTGC TCTTCGGCGA CAGCCTGATG 300 

GATTCCTCTT CCCCGGAGAA TGCCATGGAA GGCTATGCAC AGATGCTGGG AGAAGACTAT 360 

CGCTCTAACA ATGCCGAACC CAATCTGCAA GGGCTTCCTT CTGACCTTl"f GGACTATATC 420 

TACAAGCAGG AAAATACCAT CGCTTATTGC GATACGGGAT TGATCTCCAC GCGCATCAAT 4 80 

45 ACATATACTT ACGAAGGCGG TGCACATACG GAGAATACAG TCCGGTTTGC CAACATCCTT 54 0 

CGCACCACCG GCAAGGTGCT CGAAGAGCGA GATATATTCA AGATCGACTA TGCGGAAAGG 600 

CTGTCCGCAC TCATCATAGG ACAATTGGTG CACGATTTCG GCAAGACCAC ACCTGCCGAA €60 

TTGGATGCAA TAGGTTTCTT CAACGCAGAA GAAATACAGC CCAATGGCAA TTTTATGATC '720 

GATGACAAAG GTCTCACATA CTGTTTCAAT GAGTATCAGA TAGCTGCTTA TGCCAGAGGT V80 

50 GCTGTCTATG TCCGTCTCGG ATATGACGTA TTGGCTCCTT TGCTAAGGGA TGATTCCCCA 84 0 

CTAAAGCGTT ACTTGCCG 858 
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GGGATAATAT CTGTTCTTTC ACATGTGGTT 
GCTGCTGGCA CGC7\TAGTGT GTATTCGATT 
AGGTCTATGG GTATTATCTC AGCCTGCCGT 
5 AGCGGTCTGT CGAGGGCAAA T6TAGGTATT 
AAAAAGACyVA AACTGCAAGA AAAAAATGTT 
AATAATTTGA TACCTTCGCT CCCATTTAGA 
AAAAAAACTA CTTTGACAGG ATCGATATGT 
AATGCCCAA7 CGAAGTTAAA GATCAAGAGC 

10 ACGGCCGGAA ATGGTTTTGG TGGCAATATC 
GTACACCACA GCATTCTCCC CGAAGGGTTG 
GCACACTGGG AAGAGGCCGG AAGTCCGAAG 
GGTATTGrTG AAAAGGTAAT AGTCTTCGAA 
CTCGGCCGCT TCCTCATCAG TQTGGGGATA 

15 GGATGGACCA ATGACGCCCA TATTCTTTTC 
GACACCTACA CAAGAGCTGG CAGTGACCTT 
AGCAATGGCT TTTCACCGGG GATCGGACTC 
AAGAGCAAAT ATGATTTTCG CCTCGGTTTC 
TATCCGTATC GTAATTTCAA GCTGGATGGA 

20 ATCAACCACA TCGGCCATGT GGGCTTCAAC 



GACCGTCCAC AATGGGGTGC TTCTCCCGAA 60 

CTACATCCCT CCGGCX3GTAT TATCCGGATC 120 

ATCGCAATTC TTGCCGGCAA GCACCCCCGT 180 

CTTTCGTACA ATCCAGAAAA CACGCCCGAG 240 

TTCCTCCAAA TCCGGCTCCG TCAATCATTT 300 

ATCGATAACA CAAAAAAAAT CACTGAAATG 360 

GCTTTACTCC TGTrrrTGGG TCTd'CGGCC 420 

ATTGAGGCAG CTACCACTTT CAGTTCGGCC 480 

TTCGGCATGG ACATGAGCAT ACGGATGAGG 540 

GATTTTTCGG TAGGAATACA TGAAAGAAGA 600 

CTCATGTATA CGAAtGTCCC AAGTATCATT 660 

GACGCAGAAG ACTTTTTTGA CAAAAAAGCT 720 

TCCTATACCA AGCATCTGGG AGCGTATTGG 780 

TCACC6ATAC CCAAGAGCAA GGTCCACTAT 84 0 

GTACrrCAGT CCGAAGATGT TGCCACAGTG 900 

AAAAGTl'CrA TTTGGTGGAA AATGCX:CATC 960 

AGCCTGGGCT Aa*GAGTATCT GAACCTGCTA 1020 

AATAAGCCGC TTTCAGCACT ATCTCCTCGC 1080 

TTTACCGTGG GTCTTTGGAC TAAT 1134 



(2) INFOWtATlOK FOR SZQ ID NO; 111 

25 ii) SEQUENCE CHARACTERTSTICS: 

(A) LENGTH: 380 7 base pairs 
<B) TYPE: nucleic acid 
(C) STRATA DEDf/ESS : double 
(U) TOPOLOGY: circular 

30 

(li) tlOLECULE TYPE: DMA (genomic) 

(itt) HYPOTHETICAL: HO 

35 (Ivl ANTI-SEMSE: NO 

tvi) ORIGINAL SOURCE: 

(A) ORGAtUSH: PORYPHYROHOHAS GINGIVALIS 

40 tix) FEATURE: 

(A) HAI4E/KEY: ml sc_f eaturs 
(DJ LOCATION 1,..3807 



45 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 111 



GTAAAATGTA TGGGCAAGTA TAAAAGAGCT AAGTACCGCT ATTGGCTTTT TCCTTTCTGT 60 
TCGGATTATT ATACCTTTGA GGGAGTTACT TTTTTATGCG CATCTGACGA TATGACAACC 120 
AAGAAACCCC AAGCCATTTT AGACTTAGAG AAGGCCTATA ACATTGAAAT TCCTGATCTC 180 
TCCTCACAAG AAGGGATAAG CTGCTCGGTA AATAGATATT TCAAGCAAGA TTCCTCCGGT 2-10 

50 GCAGTCCTTG AGCTTTGCTT GCGAGAATGT CAGATAGAAA GCATGACTTG GCTTATTGAT 300 
TTTCCTGCTC TAAAAAAGCT TGATCTATCG TATAACCAAA TCAGTAAGCT AGAGGGTCTA 360 
GAACGTCTTA CTTCGTTAAC AAAACTTCGT CTAAGAAGTA ACCAAATCCG TAAACTAGAG 4 20 

GGCCTGGATA GTCTCACCTC GCTAACAAAA CTTTCTCTCT CCGATAACCA AATCAGTAAG 480 
CTAGAGGGTC TGGAACGTCT CACCTCGTTA GCGGAGCTTT ATCTTTTGGA TAACCAAATC 540 

55 AGTAAACTAG AGGGTCTGGA ACGTCTCACG TCCTTAGCAA CGCTTGAACT ATCGGGTAAC 600 

CAAATCCGTA AGCTGGAGGG TCTGGAACGT CTCACGTCCT TAGCAACGCT TGAACTATCG .660 
GGTAACCAAA TCCGTAAGCT AGAGGGTCTG GAACGTCTCA CTTCGTTAAC AAAGCTTCGT 7 20 

CTAAGAAGTA ACCAAATCAG TAAGCTAGA6 GGTCTGGAAC GTCTCACGTC CTTAGCAACG 780 
CTTGAACTAT CGGGTAACCA AATCCGTAAG CTGGAGGGTC TGGAACGTCT CACGTCCTTA 84 0 

60 GCAACGCTTG AACTGTCGGG TAACCAAATC AGTAAGCTAG AGGGTCTGGA ACGTCTCTCT 900 
TCGTTAACAA AGCTTCCTCT AAGAAGTAAC CAGATCAGTA AACTAGAGGG CCTGGAACGT 960 

CTCACCTCGC TAACAAAACT TTCTCTCTCC GATAACCAAA TCAGTAAGCT AGAGGGTCTG 1020 

GAACGTCTCA CCTCGTTAGC GGAGCTTTAT CTTTTGGATA ACCAAATCCG TAAGCTGGAG 1080 

GGCCTGGAAC GTCTCACCTC GTTAACAAAG CTTCGTCTAA GAAGTAACCA AATCAGTAAA 114 0 

65 CTAGAGGGCC TGGATACTCT CACCTCGCTA ACAAAACTTT CTCTCTCCGA TAACCAAATC 1200 

AGTAAACTAG AGGGCCTGGA ACGTCTCACG TCCTTAGCGG AGCTTTATCT TTTGGATAAC 1260 

CAAATCCGTA AGCTGGAGGG TCTTGATGGT CTTGCTTCCT TAACAAGGCT TAGTCTAAGG 1320 

CGGAACCAAA TCAGTAAGCT GGAAGGACTA GACAGACl'AA AGGTTTTGAG AAAACTTGAT 1300 

GTTTCGGGCA ATGATATTCA ATCTATTGAT GATATTAAGC TAl'TGGCTCC GATTCTGGAG 14 40 

70 CAAACTTTAG AAAAAG«3A6 AATCCATGAC AATCCATTTG TTGCATCATC AGGCTTGATA IS 00 

CTCTCTCCl'T ATGATAATCA TTTGCCGGAG ATTAAAGCTC TTCTTGAAAA AGAAAAAGAA 1560 

AAACAGAAAA AGACTTCAGT TGAATATCAC CCATTTTGCA AAGTAATGCT ATTGGGAAAT 1620 

CATTCTTCGG GTAAAACAAC ATTTCTTAGT CAATACGATA CAAATTATAC GTATCAGAAA 1680 

AATACACATG TGTTGTCGAT ACATCGAAGC /\ATAACCCTA ATGCGATCTT TTACGACTTT 174 0 

75 GGGGGACAGG ACTATTATCA TGG6ATTTAC CAAGCCTTTT TTACCACCCA ATCGTTATAC 1800 
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CTTCTCTTTT 
CAGACTCITA 
TGTATGTCCG 
ATTATCATTC 
GCCGAGAATG 
GTACATGCGC 
TCAATTCAGA 
GATAATAAAC 
GCTGAAAATG 
CCIAGGGGAGG 
CCGGCAGCTT 
GGAACAGTTC 
TTTGAAGAAG 
GTATATGAAG 
GAAGCCTATA 
CGTTTTATCC 
GCTCTAAAGC 
CAAAC6CTTG 
CAGATCTGGA 
AGAAAGACAT 
GATATGTATT 
ACGAGAAGCA 
TCCTGTGCCC 
GACGATGAWV 
GATAAAGAGC 
AATCTGGCTA 
GAGACCTCTC 
GATAAGTTGA 
GCCGACTGTA 
CATGAGTTGC 
TGTAGArrCG 
ATCAATCTTTG 
CATCGTGATG 
ACAAAACAGG 



GGGATGCTAA 
ATTTCAATCG 
TTGGAiGGAAA 
AGACTCATGC 
GAGTATTGGA 
TCAACTATCT 
TCACAGAAAA 
ACATCCCTAT 
ATCTTTACAC 
TGCTTTACTA 
TTGTCCAAAT 
CTAAAGACAT 
ATGGCCAAAA 
ATAAGGACTG 
AATGGCITAC 
CCTTTGGCCT 
GGTArrGGCO 
AGCAAGAAGA 
TCAAGCn'CGA 
CAGCTAAGGA 
GGAACAATAT 
CrATTCGTGA 
AAGCGGATAA 
GCAAGACTAC 
GGGTGCGACA 
CTGCAAAACA 
TGCAATTl-l'T 
CTAAGTTTGA 
TAATCGCTTT 
CTGTATTTCG 
AAGACGATGA 
G.WKAGAGGG 
AAAATTGGGT 
AGGTAAATAC 



CAAG'jATCGA 
CCCCTATTGG 
TCCTGATGGC 
COATGAAACG 
AGAAATCTAT 
GAATGAGCGG 
AGATAAGGGA 
CTCTCTCGAA 
CATAGAGTAT 
TCGTGAGAAT 
GATTCATGGA 
TTTTGAATGC 
TGGTAATATG 
CTATGTGATA 
TTTGGGATTC 
GATCAACCAG 
AGATCAGGTC 
AGAGAAAGAG 
CTTTACCGAC 
TATGCAGCGG 
CCCTCCGAGG 
AACAAACAGA 
AGATTTGACG 
GGCGAGGATP 
AGTATCGACT 
GATCTTTATT 
CAAACCCTTG 
AACACCTATT 
GATCAGCCAA 
GGAGTATAAC 
GTTCCTTCGG 
AAAAACCATT 
GGCAGTAGTC 
AGATGAA 



AACTTTGTGA 
TTAGGACAGA 
AAGGACACAC 
GGCGCTAAGC 
GTATCCTTAG 
GTGCGAGAAG. 
TTGTACGAAG 
GCTCTTGCGG 
CTACAGACCG 
CAGAAGCTGA 
GAAATCCTCC 
AAACTGCATA 
ATCTTGCAGC 
CCGGGCl'ATC 
GAGAGGCCCA 
ATTATAGCCT 
ATCTTCACAG 
GG-n-rGCCCA 
TTGGCCATAT 
AAAGAGGCTA 
GAGCAAATAG 
AAGAAGAGAC 
GAGTCTCATT 
GCAGCCTATC 
CGTCCCTACA 
TCCTATTCCA 
GAGAAGAATG 
CACCCTGAAA 
CGCTATCTGG 
AAGACCATAG 
GAGAAATATT 
AAAGCTTATG 
AGAGAGTTCA 



GCGTAGATGA 
TAGCCTATGC 
CACAGACCAC 
AGCAAACCTT 
AGCCCAAGGC 
TTGTOGCAAG 
CTCTTCCCAC 
CTCAATTGAA 
AA7TGAACCA 
ACAATTATGT 
AAAAAGACAA 
ATCTAAGTTC 
TATTATTGGA 

TcccrrpGCA 

ATTTTGTCCT 
ACTATGGCCG 
CAGGCCGTGA 
AGACGAATGC 
CCOTATTCAT 
CTATCCTCAG 
GAGATAAGGA 
CCATCCAGGA 
ATATCCATTT 
CCTTGAAGAA 
AACATCTTTC 
AAGAGGATCA 
GTCAGATCGA 
TAAGAAAGCG 
CCACGGATTA 
TGCCGATATT 
TTGCTCAGAA 
ATAGTATTAC 
AAGAGAAGAT 



TAAAGAATAT 
CTGCAATCGT 
AGACGATACA 
AGGCl'GTGCA 
GAATAGTGCC 
CAGGAGTAAA 
AATCGCCGGT 
TAAGGGAAGA 
GCTTAGTCTG 
CTGGTTAGAT 
CATCAATAGA 
CGGAAGTATA 
AGAGCTGATC 
TTCCGATGAC 
CAAATTCGAA 
GGAAGAAGGT 
GATGGATAGG 
CGAGGATTAT 
CAAAGAGCAG 
TGATATGTTG 
TACGGAGCAA 
TCTCTACCTC 
GGGCACGCTG 
CGGCGTTATC 
CGTCAATAAA 
GACTGAACTG 
GATCTACTAT 
TAT-J-GTCGAA 
CATCCTGGAT 
GATCAAGCCT 
AGCTCAAATA 
GGCATCAGCC 
ATTAAGAATA 



1920 
1980 
20*10 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3190 
3240 
3300 
3360 
34 20 
3480 
3o40 
3600 
3660 
3720 
3780 
3807 



(21 IWrORMATION FOR SEQ ID HO: 112 

ti» SEQUENCE CHARACTERISTICS: 

(A) LENGTH; €93 base pairs 

(B) TYPE: nucleic acid 
(CI STRAW DEDMESS : double 
tD) TOPOLOGY; circular 

(li) MOLECULE TYPE: DNA (genomic) 

{ill) HYPOTHETICAL: NO 

(iv) miTI-SEUSE: HO 

ivi) ORIGINAL SOURCE: 

{AJ ORGAIIISH: PORYPHYRa-tOtlAS GINGIVALIS 

(ixj FEATURE: 

<A) NAME/ KEY: iniac_f tjature 
(B) LOCATIOH 1. . -693 

{kD SEQUENCE DESCRIPTIOH: SEQ ID HO! 11 2 



AAGTTTATGA 
AGCAGTTCCG 
AAGGTGAGTT 
GATGCGAACG 
GATATAGCTA 
TATGCACACC 
ATACCAACCT 
ACGATCGTCT 
TGCATCCGAA 
TCTCACGATG 
AAGATTGTGG 
CAGATTTGTG 



TGAAAAAAGC 
CCCAAACAAC 
TGAACTTAGG 
GTCTCCCTTT 
CCACTTATTA 
CGGGCGGCAT 
ACAATGCAGT 
TGATACCCAA 
CCGAAGCCGG 
GTAACGAAGT 
'GGGACGTATC 
CCCAAACAGT 



ATTTGTTTTC 
GACGAACAGT 
GGTCCCCCCC 
CGAAATACCT 
CATAGCGAAT 
CGTGAGGGTA 
CTGCACCCGG 
GCCCAAAGTC 
GAGGGAATTl* 
AACGATAGGC 
TCAAAAGTAT 
ATCACAACAA 



GTACTACTGG 
AGCCGGAGTT 
GTAAGCACAG 
ATCTCTTTCA 
AGCGAGGCAA 
GAAGGTCGTT 
ATTACATTCG 
TCGCTGCCTC 
ATCCTTTGCG 
GGTAAACCIT 
GCCGTGGGGG 
AAA 



TTTGCCTATT 
ATTVTACAGG 
AGGTTTGGGG 
GTCGTTTCAA 
CTTTGAATGA 
TTTGGAAAAT 
AAAATCAAGA 
ATGTGTCGGA 
AAGAAGACGA 
TCTTGCTCAA 
TAGGAGAAAT 



CTCCTCGTTC 
ACGAATCGAG 
AATGACCCAT 
CAGCCAGGGA 
ATGGTGCGAC 
GACTTACAAC 
AATAGAAGGA 
ATCGGTGCCT 
CACCTTTGTG 
TACCAACGTA 
TCGATTCCTG 



60 
120 
180 
2*10 
300 
360 
420 
480 
54 0 
600 
660 
693 



{?.) INFORJlATIOt) FOR SEQ l^ HO: 113 



SUBSlllUt H SHEET (Rnie 26) (RO/AU) 
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10 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 168 9 base pairs 
(B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(DJ TOfOLOGY: circular 

(11) MOLECULE TYPE: DKA IgenoraicJ 

(iil) HYPOTHETICAL: NO 

{XV) AtlTI-SEHSE: NO 



15 



20 



tvl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOflAS (SINGIVALIS 

(Ix) FEATURE: 

(A) NAME/ KEY: mi5C_feature 
{B) LOCATION 1. . .1689 

f.xi) SEQUENCE DESCRIPTION: SKQ ID HO: 113 



25 



30 



35 



40 



45 



50 



TCGCTAATCA 
ACAAAAACTA 
ACCCTCTGTC 
TACAATCAGG 
GTAATSACCC 
GACCAAGGTG 
CAAGACrrCG 
GAGCTCATGA 
AACGCGCAGA 
GTGGATATAG 
GCrrTGGCTT 
GTGTTCTCTC 
GAGAAGAAAA 
AATGCCTGGC 
ATCGGTTTCT 
AAAGTGAGTG 
AACAATACGA 
TCTGAATATT 
AAAGGAAAAA 
AGTGAGACCA 
GCCAAAAAAG 
ATTCATAACA 
TACACACCTC 
TATCl'GCCGG 
GAAGACATCG 
GCTGTGATTA 
AGAGTAGTCG 
GCTAAGGGTA 
ATCGTGGAA 



ACAATTCAAG 
TGAAGACAAA 
CTCTCGCCCA 
AAAAACTGGT 
ACAiSTGGATA 
CCACATACCA 
ATATCGTGGT 
ATAAGCCCGG 
ATGCGAAACT 
CCTCCAACTA 
TCGCTTACAC 
TGAATC3GAGG 
TTGACAAGGT 
CGCTAATGGG 
TGTCGAACTT 
AAAGCGACAT 
TCAATGGGGA 
CGGATTGGGA 
CTCCGACTAT 
ACTCGGGGCT 
AAGAGAAC6G 
AAGATTTGTG 
AAGTAGACAT 
GCAAACGGAT 
TAATGCAAGA 
GCCT<3CCGAC 
CTGAGGCTTC 
CGTACATACT 



AGACTGGAGG 
AGTTTTACGC 
AGCGCAGACG 
ACAAACGAGG 
CGACACCGGC 
AAAGTTGAGG 
AACGGGTAAG 
AQGATATAAG 
CGTGTATAAG 
TCGTTCGCCT 
CGGC7TCAAC 
GCAAAATTTC 
GGATCTCTCA 
TGTGGTATTC 
TGTCGACAAT 
6TCGTTCAGC 
GA(3TTGCCAC 
CATTCGGTAT 
GGATGATCTG 
GGGGTATGAC 
TACGAACACG 
GAGCGACACA 
CAATCCGACC 
CGTTTGGl'CT 
AGGCAGCATG 
GGCAGCAAAC 
TrTCTCCGGC 
CAAGGTCGTA 



GCTCCTGTCC 
AAATTCGTGG 
ATGGGAGGAG 
ATGAGTGTGG 
AATAGCAATG 
GATTGGGATC 
AATGAATCCG 
AGTAGAGTTG 
CAAGACTTCr 
TCTTCTCTTA 
AATACGCACA 
AATAAAAACT 
TTGGGTAGCA 
GAAATGAATA 
GATCCCGAAT 
CCCAAAATCC 
AACTTCATGA 
GTATATCCCA 
GTGGAAGCTT 
AAGAACGCCA 
CTGAAATACC 
TTTACGTATA 
AAGGGTCTCG 
GATACGCAGT 
AAGCTCTACC 
TGCAAGGCTG 
AACGAATACA 
TCCGATACGG 



GATTGAGTCT 
TGGCGGCTTT 
ATGATGTCAA 
CGGACAACGG 
TGAAGATCTT 
CATCGGATGA 
ACATCAAGAT 
CGGTCTTCAG 
CCAATGTGCA 
ACAATGGTGG 
AAATAAGTTT 
TACTCTTCAG 
CCTCTGAATC 
AACAAGGGGG 
TTCAGTGGTC 
AAATGTTGCT 
TTACGTACAG 
AGAAATCGTT 
TCCTTACAGC 
ATCACTACCT 
GCTGGGCCAA 
CATCATCl'GC 
TGTGCTGGTC 
GGACCCATGC 
CGAATCCGGC 
TTGTTTACGA 
GGCTGAACGT 
AGCGTTTCGT 



CAAAAAAAAG 
CGCCGTCGCA 
GGTGGTCCAG 
ATGCATCTAT 
CCGCTCCAAA 
TTATCAGTTT 
TTGGTCGGTA 
TCGCGATGCC 
GTTGTACGAT 
CAACCCTTTT 
TGTGGACTAT 
TCAAGATGGA 
CATGGGTCAC 
AAAAAGCGAT 
AGGCCCTATA 
GGACGAGGAT 
C6ATTATGAT 
CAAGTATGAA 
TTCGTACCAG 
GATTACATAT 
TTATGACAAG 
CAATGCTCTC 
ATGGGTGGAA 
CAACGGTGTA 
TCAAGAATAT 
TATGCAGGGC 
GCAGCACTTG 
AGAGAAGCTC 



60 
1?.0 
180 
Z40 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
12 60 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 
1609 



(2) INFORMATION POR SEQ ID NO: 114 

55 U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS: double 
(DJ TOPOLOGY: circular 

60 

(ii) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

65 (ivl ANTI-SEtlSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI/ISM: PORYPHYRaiONAS GINGIVALIS 

70 (Ixl FEATURE: 

(A) NAI-IE/KEY: misc feature 

(D) LOCATION l..,235B 



75 



(Xl) SEQUENCE DESCRIPTION; SEQ ID NO: 114 



SUBSTrnJTE SHEET (Rule 26) (RO/AU) 



'iui -^sii Q r::: o. ii3 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



ATCGTTTACC 
AACTTCCTCC 
GTCTCTTCGG 
GGTACCACGA 
GTCAGCGGAT 
TCGGCCGACA 
CCCGAACTGA 
TGCTATGGCT 
GACAAATACG 
GCTTTTCCCG 
TCGG-SAGCCT 
ACGTTCTGCT 
CGACCGATCT 
GATTCGACTG 
GACATCGGTG 
GCCGTTTCAT 
GTGTTCAAAA 
ATAGAGCGTT 
AAGACCATAT 
GCCAATGGAG 
TACACCGTGT 
TATCCGGAGA 
TCGCTGCCGA 
CTTATCGTCA 
TATGAAGCCA 
AACGACGACT 
TCGCATGCGC 
GGGCATAAAG 
AATAAGGAGT 
GATCCCAAAC 
AACTATGCCT 
AAGGTTTTTT 
GAGCCGCIACA 
CACAAATTGA 
GGTAATGACG 
CCTGACTGTC 
ATTCGGCTCA 
AATCAACCGA 
CATGGAACAA 
CTAAGCAATC 



TTTGTCACTG 
TCTT1TGTTG 
GTAGACGACT 
ATCCGGGAGC 
CGACAGAGAA 
ATAGTTACTG 
GTGGAATACT 
CTTCGCTCAG 
GTATAGATGC 
AAGGAGAAGG 
CTGTTCGATT 
ACAATCCGCA 
CTGCCGGCTA 
CAGGGAAATA 
TCCGATTCTC 
TCGTCAGCGA 
GTGTCGGAGG 
GGGAGGAAGC 
TCTATACCGC 
AATATCCTAT 
TCTCTCTTTG 
AGCAGTTGGA 
GATGGGAGCT 
TCAATGACAC 
TGAAARAAAA 
ATCTCACCCT 
TGGAATACTA 
AAGACGCAGC 
ATGGTATGCT 
AGGGTGAAAA 
TTTTCGTTCC 
CGGAAAGGTT 
TCGCCTATCC 
CCCGGGAGIT 
ATGCCGGTAC 
CGGGCAGCCC 
ATCCGCAGTA 
CAGATTCCAT 
GGCATATCAG 
GTCCTCGA 



TATGAACCAC 
TCTGTCGCTG 
GACAGAATAT 
AGTATTGCCC 
TCGCTTCGAC 
CATCGGTTTC 
GCTGATGGCC 
TCGAGAATAT 
AGCCGTGACC 
CCATATCCTG 
CTTAAACGAC 
AGCAGTTTTT 
TTGGAAGAAG 
TAAGCAGTAC 
GTTCAACTGC 
AGCCAATGCG 
GAATCCGGCC 
CCTCGGTACG 
ACTCTATCAC 
GATGGGCAGT 
GGACACGTAC 
TATGGTACGG 
GTACGGACAG 
TTGGCAAAGG 
TCCTTCTTCG 
CGGCrrCGTA 
TCTGGCCGAC 
TCTATTCGGA 
GTGTCCA'rTG 
CTTCGAGCCT 
CCACGATATA 
GCAGAAAGTC 
TTACCTCTTC 
GATAGACAAA 
GATGAGTGCT 
CACCTATACA 
TTATCCTCAG 
TTACATCCAT 
CCATGCCGAT 



AGACGATCAA 
ATAGCGTTTG 
GTCAATCCCT 
AATGGGT'l'GA 
AAAGATTCGC 
AGCCATGTGA 
ACTTCCGGCA 
GCGCGCCCGG 
GTAACCGAGC 
CTGAACCTGG 
TCCACAGTCG 
CGTCAGTATT 
CAGCCTCCTA 
GACGGCTACA 
GATCAGGGGG 
CTCTATAATC 
AAGGCTTTCT 
GTGCAAGTGG 
CTGCTGATAC 
GGCAAAACGG 
CGCAATGTAC 
ACACTGATCG 
GAGACCCTGA 
GGCCTTCGTG 
GCAGGAGCGA 
CCGCTTCGCG 
TGGAATCTGT 
AAACGCTCGT 
CTGCCGGATG 
AATCCCGGTT 
CAAGGGCTTG 
TTCGATGAAG 
TCXTTATl'TCC 
CATTTTTGCA 
TGGCTTGTCT 
CTGACCTCGC 
GGGGAGTTGA 
ACGGTTTCTC 
TTGGTGCGCT 



AAACCATGCT 
CTCCCGATGC 
TTATCGGAAC 
TGAGCGTTAC 
GTTGGTGGAG 
ATCTGAGTGG 
CATTCGATCC 
GAGAATACAA 
GGACTGCTTT 
GACAGGCCCT 
TCGGCAGCAG 
TCGTACTTCA 
TGACAGTGGA 
AGCGTGAGAT 
AAAAGATCTA 
TGGAAGCGGA 
CCGCTATACG 
AAGGAGGCAC 
ATCCGAATAT 
GTAATACGGC 
ACCCGCTGCT 
ACATGTACCG 
CGATGGAGGG 
CTTTCGATAC 
CCCATCCGAT 
AACAGTACGA 
CCCGGTTTGC 
TGGGCTACAG 
GATCATTCCT 
TCCACGAGGG 
CCCGGCTGAT 
GATATTATGA 
CCAAGGAAGC 
ATGCTCCTAA 
ATTCCATGCT 
CGGTATTCCC 
TCATTACGAC 
TTGGCAATAA 
GCGGTCACCT 



GACGATCCGA 
ACAAAGCTCT 
GGCCAATTAC 
CCCTTTCAAT 
TGCGCCTTAT 
AGTAGGCTGT 
TGATTACTGC 
GGCTGTATTG 
GACCGAATTT 
AAGCAATGAA 
GCTGATGGGG 
GGTGAGTCGG 
AGCCCAATGG 
GAGCGGTGAT 
TGTACGATCG 
GCAAGAAGAG 
CTCTCGCGCT 
ACCGGATGAA 
CCl'ACAAGAT 
TCACGACCGC 

CTGcxrrccTc 

AGAGAGCGGG 
CGACCCCTCG 
GGCJVACGGCC 
CCGTCCTGAC 
CAATTCCGTA 
CCACGCACTT 
ACACTATTAT 
CACTCCTTTC 
CAGTGCTTAT 
GGGAGGAGCA 
TCCGACCAAC 
ATGGCGAACG 
CGGCTTGCCC 
GGGATTCTAC 
CCGAGTTAGG 
C7VATACAGAG 
AACACTTCCG 
CCGTTACGAA 



(2) IWrORMATION FOR SEQ ID NO:H5 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 2442 base pairs 

(B) TYPE: nucleic acid 
iC] STRANDEDNESS: double 
(O) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AliTI-SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGAl^ISM: P0RYPHYR0I10NAS GINGIVALIS 
(ix) FEATURE; 

(A) NWiE/KEY; misc featur-- 

IB) LOCATIOM 1...24 4 2 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 115 



CCCCGTTTTC 
AAATCAAATC 
AAAATAAGAT 
CTCCAAGCTC 
GCCGTAGAAA 
AAAGAAGCTC 
GTAAGCAGCA 
AATATCGACA 
ATCGATCTGA 
GCAGGAGGAT 



ATCCCGGAGG 
AGTCATCGTG 
TCAGCCTCTT 
AAGAGGAAGG 
CGCCCAAGAA 
CCCATGAGGC 
TAGCCTACTC 
TCl'TGGACGA 
TAGACAAAAC 
TCGGCCTCTC 



GACACACGCT 
CTTTGGCGGT 
GCAGGCTCTT 
TATTTGGAAT 
AGTCTTTGCC 
AAAGATCT*rC 
CGAGCAACTA 
AGCAGGCCGT 
GCTCAATCGC 
CGTTCTGGAT 



CAGCTTTGCA 
GGCACAGCTC 
GTCGTCl'GCT 
ACCCTCTTGG 
GTAGCCAACG 
GACCGTATCA 
AAATCCTTGG 
GTGACCAACG 
CTTTTGATCG 
GTCGCCGAAG 



GAAATGGGAG 
CGCGATTTTT 
TATTGTTCAC 
CTATCCACAA 
GAGTACTTTA 
GCGGACTCAG 
TCATATACTA 
TACCl'GCATT 
TAGGCAACAG 
CTCGCATACC 



AAGAAACCAA 
TATTATGTGC 
CTCTTTTTCT 
GACGGAAAAA 
CTCGGTGGGC 
CGATACATCG 
TGCATCAGGC 
GAAAGACAAT 
GGCTTATTTG 
GGCTACCTAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2358 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
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GCCAAGGGAA CTAAGGTGAC CGATGTGGCT 
AAAGAAGGGC AGCTCTTCAT CGGAAAAGAG 
ACAGCCTTGT CTTTGAATTT GCCGATGGGC 
GACATCTGTT TCCTGCTCGC CGATGGCCGT 
5 CCGGAGCTAT TtjCTCTCTTC CTCCGCCGAT 
TTCaTCTTGTG CCGAGAATCG AATTTATTTC 
CCTATAGCCG ACGTCCTTGG TGTCGGTGCC 
TTGGGAGAAG AAGGTTTGGC TTCACTTCTr 
CCTGTAGCAT TCGACGGACC GGGGGACAAT 

10 CGTCTGTATG CAGCCAGCGG ACTCTGGGGA 
AAGCTATACG ACGGCAACCG ATGGACTAAC 
GGCGGCGGAT TCAGTTTCAA TGATGCTATC 
CACTTTTTTG TCGGTACATG GGGAAACGGT 
GCTCGCTATT CGGGAAACGA AACTGCTATC 

15 AAAGCGATTG CCTTTGACAA TAAGGGCAAC 
AACATCTTCA TGTACGATCC GCAGAGTAGC 
GCCAATCTGG CCTCCTTCGG CAATATGATT 
ATCCTTCAOC GTAGTGGCGG ATCCACGCGC 
ACACCGGAAA CGACTTCGGA CGACAGCCAT 

20 GGGGCAGCCA TAGGACATAA GACTATCTAT 
TGGATGGGAT CGGATATAGG CATTTTCGGC 
ACTTCTACCC CTATCGCTGT TCGGCCGGTC 
CTGGACAAGG TGACGGTGAC AGACATCGTC 
ACCCAAGGGA CAGGACTCTA TCrCCTTTCG 

25 ACCGTAGAAA ACAGCCCTTT GCTTTCTAAC 
AACGGACTGC TGTACATCGG TACGGCGGAC 
AGTGGATCAG CTTCCGAACT GGACGGCGTC 
TATCCCGATG GCGTCACCAT TGCCGGACTG 
ACXACCGGCA GACTGCTATA CCAGACTGAG 

30 CGAGGTGCCG ATGGCAATAG GGTA(3CTTCG 
GTATC6AAAA AGTCCAAACT AATTCGCTTC 



AACrrGGACA ATGATCGCTT GCTGATGCTC 660 

ACCGATAACC TGCAAGATCC GGCCGCATGG 7^0 

TCGGTCACCG GTCTGGGCAT TGTCGGGGAA 780 

GTATATGTCG CTTGCAAACXA ATCCTTTGAG 84 0 

TCACGACTGT ATGTGACGGA TCGTGGTCTG 900 

ATAGAAAAAG GTCGCAAAAC GACACAATTT 960 

ATGAACGAAA GCAATACGGC ATACATAGCA i020 

CTCGCAGAGG GAAGTACGGC CGAAGCCATG 1080 

GATTTCTACG AGATGCGGTT TAGTCACGGA 114 0 

ACAAACCTGA TGGGACATGC CGGTATG6TG 1200 

TTCGACAAGA AGACCGTACA GGAACAGTTG 1260 

GATATAGCTG TTTCCAACGG AGACCCCGAT 1320 

CrG-lTCGAAT TCAAGGATGG CAAAGCGATA 1300 

GCAGAATGTA ATCCCG6AGA TGCCCGTGTG 14 40 

CTCTGGGGGA CGCTCGGTGC CGTAGGCAAG 1500 

ACATGGCATT CTTTCAGCTA TC03GATGTA 1560 

ATCCTACCCA ACGGAGACAA ATGGGTAAAT 1620 

AAAGGTGTCT TGATCTTCAA CGATCGGGGT 1680 

CTTTACGTCG AGGAGTTTGT CAATCGCCTC 174 0 

GCAATGGCCG TCGATCATAA CGGCTCT6TC 1800 

GTCTACAA7G CAGCCGGAGT ATTGTCCTCG 18 GO 

GGAGGAGAAG AACCCAATTT GTACTATGTG 1920 

GTGGACAAAC TCAATCACAA ATGGGTTGCC 1960 

GAAGATTG-rA GTAAGATCCT CGCGCAATTT 204 0 

AAGATAC7AT CCCTGGCCrT AAATGACGAT 2100 

GGAC-fGATGA CGTTCCAAAC GGGTACGGGG 2160 

TATGTATACC CCAATCC3CT AAGGCCGGAA 2220 

CAAGCCGGCT GTACTGTCAA AATCACCGAT 2280 

AGCGTAACCA CCGAAGTCAA ATGGAATGCT 2340 

CGCGTArATG CCGTTGCAGT GTACGATCCG 24 00 

GCAGTGATTC GC 24 42 



35 



45 



i2) IH FORMATION FOR SEQ ID HO: 11 6 



11) SEQUEMCE CHARACTERISTICS: 

lA) LENGTH: 3486 ba^e pair? 
(B) TYPE: nucleic acid 
Id STHAllDEDHESS: double 
40 tD) TOPOLOGY: circular 



(ii) tlOLECULB TYPE: DMA (genomic) 
till) JTi'POTHETICAL: NO 
(iv) W^TI-SEHSE: MO 



<vi) ORIGIIIAL SOURCE: 

(A) ORGANISM: PORYPHYRWOHAS GIKGTVALIS 

50 

{L'JL\ FEATURE: 

(A) HAME/raY; mlsc feature 
(Bl LOCATIOM 1, . .3466 

55 (xlj SEQUE1>ICE DESCRIPTIOK: SEQ ID NO: 116 



GCTATTTCTC AGATGAAACG AATACTTCCA 
CTTGCTTTGC CTGCGAAAGC GCAACGAGer 
GCTTCGGGAC ATTGGGTCAA GATACGTGTC 

60 GAACAGCTCC GTGCCAATGG CTTCTCCGAT 
GGAGGGGTGC TTCCCGAAGA TCTGAGCCGG 
GTACTCCUTC AGGGCAATGC GCTGTATTTC 
AATCCGGCCA AAACCACCAT GGAGCATACG 
TTCCTGTCGC ATGCTGCCGG AGCACCTTTG 

65 ^-CGGCCGAGG CTTTGATCGA CTACTACGAT 
TCGCCCAAAG AATCGGGACG AGATCTGTAT 
ACGGTCAAGT TCCCTTTGAG GGGCAACACC 
TCATACATAG CCAAGGCCAG ATCGGCCGGT 
GGCATTCTGA TCTTCAGCGA TCCTTTTTCC 

70 TTGGCCGGCA AGAAGCGTCG TCTCl'ATCAC 
TTGCGCrrCG ACGCGAACTA TAGCATGACA 
GTGGCTACAC AGAACGACCT CCGGTACGAT 
AATTTGCCCG TTTTGGGGGG CGAGTCCTGC 
CTGGTGGTTT TGCAGGCCAA TTCTTCCCTG 

75 GGGGATAAGA CCATTGAGTT CGTGGCTCCG 



ATAGTCGCAT TCCTTTCTCT CTTCCTTGCC 60 

ATGGGGAAGA CGGCCGACCG TTCGCl'AATG 120 

GATGCAAGTG GAGTGTATCG CCTTACGGAC 180 

CCGTCCAA'3G TAGGTGTGTT CGGTTATGGT 240 

ATCACGACAG ACGATTTGCC TCCGGTACCG 300 

TATGCCGTGG GCCCGGTGAC ATGGTTCTAC 360 

GTGAATACAT ACAGTACGCA TGGCTACTAC 4 20 

CACATGTCCC AATATACGGG TGGAGGTGCG 4 80 

GAGCTGATGC TCCATGAACA GGAATTGTAT 54 0 

GCCGAGTCTT TCAGTCCAGT CAATACGCGT 600 

CG'-TCGTCTG GCGAACTCGG TACCGTATTC 660 

GGCGGCCGTG AGATGTCGCT CTCGGCGAAT 720 

ATGACATG3A ATGAAGTCTC CAATTCCTAT 780 

AGTACGCCGA TGAACAGCTT GGTCAATGAG 840 

GGAGATGCGG TCAATCTGGA TTTCATAGAG 900 

GGCGCACCCA TGCATATCAG GCGGTTTTCC 960 

CGGTTCGTTA TCAGTGAGGT GCCGGAGTCT 1020 

ACAGCATCGC TTGTTCCCGT TAAGACTGTC 1080 

CCGAAGGG7C AGGATCGTAG OACTATCAAT 114 0 
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ACGTTrTATG CCGTGGACTT GTCACAGGCT 
AATCAAAACC TGCATGGAGA GGAAATCCCT 
CTCCTTGAGG CTGATCGACT GGCCACCTAT 
GTCGTGTTGC AGGAACAGGT GTTCAACGAG 
5 TACCGCCTLT TTGCCAAAAT GTTCTACGAC 
ACCTTCCCGA TGCAAATGCT TCTCTTOSGT 
GTAGCTTGGC AGAAACCGTA TCTCCAACAA 
AATTCGACGA ACGTAAACAG TTATGTGACG 
CCGGCCrCGG TCAATATCGG TTCGCGCAAT 

10 CGTACTCCGG CCGAAGCTCG CATCGCAGTG 
GAGAGTGGT6 CCrGGCGTAT TCGTGCCTGT 
ACCGAGACTT CCCGTTTGAT CGATACCGTC 
CGCGCCTTTC AGGACGTATA TCCGCATGTC 
GCAAAGAAAA AGATGCTGGA AACCCTTCAG 

15 CATGGCGGTC CTGCCGGATG GTCGGACOAG 
TTCAATTATA AGCATATGCC CATTTGGATT 
AGTCAGACGA CCTCGGCAGG GGAGGAGGTT 
ATGTTCTCGA CTACGCGTGT CGTTTACAAT 
CTTCGGCGTA TGTTCGAGAA AGCTAAG6AT 

20 CGATCGGCCA AACAGGGGAT GCTCAGTACT 
TTCTTTCTGA TGGGTGATCC GTCCGTGCGT 
ACCGCAATCA ACGGGCAGGA TCCCGAAGGG 
GAACGGGTAG CTCTGAAGGG TAAGGTAACC 
AGTGGCAAGG TTTTCCTGAC CGTCTTCGAT 

25 GAGGGAAACG ATCTCTCTCT TGTATATTAT 
CCCGAGGTGA AAGACGGACT CTTCGAAACT 
TCCGAGCACG AAGGCCGGAT CAATCTTTAT 
ATGGGGGTAG ACTTCTCCAT CAQAGTCCAA 
AATACACCGC CTGAAATCAT AAGCTGCTTC 

30 GAGGTTAATC CTACTOCTCT GTTTATGGCC 
ACGGGTAGCG GAGTAGGGCA TGATATTACG 
TACAACCTCA ATGCATATTT CACAAGTTCG 
TTCATGATAC CGOCTTTGGC CGAAGGAGAT 
TTCAATAATG CCGTCCATCA TGACTTTTCA 

35 GTGGCTGATG TGATTCTATT CCCGAATCCG 
CACAATCGCC CCGGAAGCGA TTTGAACGTG 
CTTGTGAACA GTTTGCCAGT CAAGACCTAT 
AAGTGGGATC TGACCTCCAA ATACGGAGTG 
TGTGTCGTGA ACTCTCCCGG AGGACAGACG 

40 GGACAA 



TCTGCTCCGG AGATCCTCGG AGCGGTACCC 1200 

GATCTGATCA TTGTCTCTAC TCAGGCGCTC 1260 

CGTAGAGAGA AAAACGGGCT GAAGGTTTTG 1320 

TTTTCGGGTG GAACTCCCGA TGCTACAGCA 1380 

AGATGGAAGG CAAATGCACC TGTGGGAGAC 14 4 0 

GATGGGGCTC ATGACAACAG GAAGGTCTCC 1500 

ACGGAGTTCT TGCTGACATT CCAAGCCGTC 1560 

GATGATTACT TCGGCTTGCT GGATGATCAG 1620 

TATAATATGG CTGTAGQGCG ATTCCCCGTA 1680 

GACAAGACCA TCCGATATGA GGAGGATCGA 1740 

TTTGCGGCAG AGAACGGGGA CAAGCAOGCA 1800 

AAGCGTTATG CTCCTGCCAT CATGCCGGTA 1860 

ATCGAGAACG GGTTGCACAG CATTCCGGGT 1920 

TCGGGTATTA TCCTGCTTAA TTATGCTGGT 19ttO 

CATTTGCTSA CX3CTCAACGA TATACACAAA 204 0 

ACTGCCACCT GCGACTTTGC CAACTATGAC 2100 

TTCCTCCATG AGAAGAGTGG CACTCOSATC 2160 

ACGCAGAATG AGAAGATCAA TGGTTTTATG 2220 

GGGCGTTATC GTACGATGGG CGAGATTATC 2280 

GTTTTCCCOS ATTCGATCAA CCAGTTGAGT 234 0 

ATGAATCTTC CTACCCACAA AGTGCAATTG 24 00 

CAGTATGGAA CTATTATGCT CAAGTCTTTG 2460 

GATGAAAAGG GGACATTCGA CGAGACATTC 2S20 

GGCAGAAAGA AAATGACAGC ITTGGAAGAG 2580 

GACTATCCTA ACGTGATGTA TGCCGGTATT 2540 

TCGri'TATCG TACCCAAGGA TGTGAACTAT 27 00 

GCTTATAACG AGAGCACAAA GGCGGAAGCC 2760 

CCGGGTATTC CTGATGAGGT AACGGAAGAT 2820 

CTCAATGAGA GTACATTCCG ATCGGGAGAT 2880 

GAAGTATTCG ACTTGAATGG AATCAATATC 2 94 0 

CTTTGTATCG ATGGCCGTGC CGACCTGACC 3000 

GCTACGGATG CAGGTGTGGG CACTATTCTC 3060 

CATACTGCCC GACTGACGGT TTGGGACATT 3120 

TTCAGAGTGG TAGATGGCAT TGCTCCGGAT 31 BO 

GTACGCGAGA GTGCTACGTT CCGAATCTTC 324 0 

GCCGTGGAGA TCTATQACTT CACCGGTCGT 3300 

TCGTCTTCCT ACGGAGAACC TATAGAGATC 3360 

AAGATCGGAA ACGGATTCTA CCTCTATCGT 3420 

GCCTCCATGG CCAAGAAAAT GATCGTGGTA 3480 

3486 



(2) IMFORJIATIOK FOR SEQ ID NO: 117 

45 (i) SEQUEHCE CHARACTERISTICS: 

(Al LENGTH: 2 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIiDEDHESS: double 
(01 TOPOLOGV: circular 

50 

(iij MOLECULE TYPE: WA (genomic) 

(iii) HYPOTHETICAL: NO 

55 (iv) AUTI -SENSE: MO 

(Vl( ORIGINAL SOURCE; 

tA) ORGAJ^ISM: PORYPKYROMWAS GTWGIVALIS 

60 (Ixj FEATURE: 

(A> MA14E/KEY: misc_£eat.urc 
(B) LOCATION 1. . .2919 



65 



(yi) SEQUENCE OESCRIPTION: SEQ ID MO: 117 



TTTTACACCG GAATTGACTT GCATATAGAA AGCCAAATGA AGAAACT^TT CCCATTATTA 60 

TTACTCATAT TGTCGATTTT GGTCGGATGT GGAAAAAAAG AAAAACACTC TGTAACTGAA 120 

ATCGCCCGAG AGAAAAAGCG TATTACTGCA TTGCTGTACG AAAAAGAACT CCCCACTGAT 180 

TCTGTTAAAC AGCTTTACGA GAACAGCGTA CAGAACAAGA ACCTTGT6GG ACAAATGTTA 240 

70 TTCGCGATCG AGGTCGGCAA ACGGATGCGT AATATGTCCC AATACACAGA TGCGATGCTA 300 

TATCACCAAA AAGGGTTGAA CGCTGCATTG AATCTAAGGG ACACCATCGT AGCCGCACAA 360 

GCATGGAATC ATCTCGGAAC GGATAGCCGT CGTATCGGTG CTTTGGCAGA AGCTTCGGAT 4 20 

TATCATTACA AGGCTCTTTC CTTGATAGAA TCTTTTAGCG GAAACCAGAA TAGGCCI'GCG 4 80 

ATCAAAGCCA GATCGGCGGC CCTGAACGGC ATCGGCAACA TCAATCTTGA GTTAGGATAC 540 

75 CATGATGAGG CCGAAAAGAA TTTCCT6AAA GCACTGCAAG GTGAGAAAGA ACTCGACAGT 600 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



CCTCrCGGGC 
TACGACAACjG 
CTGATGGGTA 
TATCAAAAGG 
CGATGGCACT 
GAAAGGCTCl' 
ACTTCACATC 
TACAAACAAG 
GCGCACAAGG 
AAAGAGCTTG 
CTTCTTGAGCA 
GCATACCGTC 
GGCTTTTTCA 
AATGAGAAAA 
ATAATAGACA 
AAGATCAGAA 
CAGATTCTCA 
CAACCCGAGA 
TCCAATCTTT 
CTGGCAAAAA 
ATAGATAAAA 
GAAAAGCATG 
GCTACGATCA 
ACACAAAACC 
GTCATGCCTG 
AATCATCGCT 
AACnTGCTCG 
AAAGAGGGTA 
ATGCCGATAA 
CACATTCCCA 
AAAAGCGGTG 
ATCGAGCAGC 
erGGATCGGA 
CTTGCTGCCA 
GACTTGGCCG 
GTAGGTTGCT 
CTTGCCGATG 
GCTTACTTCT 
AAACTCCTTG 



AAGCTATCAA 
CrCG'i'ACCTA 
TCGGACTCTG 
CTTTGCAAGA 
GGCTGAATTC 
ACCAGCATTT 
TGATAGAAAT 
CCCTCGAAGC 
TCAGCAGCAT 
AAGAAATACA 
CTCTTTTTGC 
AGGGCAAGAA 
CCGGTATTAC 
TGAGTTCAAG 
GACAGAGTAG 
GCGGAGTATC 
TCGATTCGTT 
GCAAACCTAT 
TGTCCAATGC 
CCAAGAATGA 
CTGATCAGGC 
GATCAGGCGT 
AAGTGGAAAG 
AGTCCTCTTC 
TCCACATCGC 
TCGAGGACGA 
TCAAACTACT 
TAGCCCTCGC 
TGGATGGGAT 
'II'GTCGCTTT 
TAGTCTCTTA 
TTCTGAAAGA 
AGCCGGAGGA 
AAGATGCAGT 
AAAAAATGTG 
CCACCATCGG 
AGAGCAAAAA 
CTCGCACCTT 
CCATGCCGGG 



CTATGCGAAC 
CTTCCTCTTG 
TAGCATCy\AT 
GTATGCCACG 
CTGTATCCCG 
CATTTCTTTC 
ATACAATCTT 
ATTCTGTCTG 
ACAAGAAACG 
GCAAGTAAGC 
CCTTTTCATC 
GCATAACAAG 
ACACGAATTT 
TCCTGATCTC 
CCATATGCTG 
CACGCCCGAA 
TGCACCATAC 
TGTCGTGGAC 
CATCAAGTAT 
AAAAAATCTG 
TCATATCTTC 
CGGACTCTCG 
CCAGCCGGGG 
GGCAGAGATT 
GCCCGATGAC 
ACGTCCGACC 
CCTTTGCGAT 
TACCGAGCAT 
AGAAATGACA 
GACGGCCAAC 
TCTATGCAAG 
CCGTGAGTTG 
GGAGCCTCAA 
GTCCGGTGGA 
CATGAGTCCA 
CTACATACAG 
CATCTCCGAC 
CAAACGCTAC 
GAGCGACAAG 



CTCGGACGTA 
TCTCTGGAGC 
CTCGGAGAAG 
GCATACAAAC 
ATGGCACGTA 
GCCGAAGGGA 
CAATACGAGA 
AGCAAGACGT 
CGATTCAACT 
AAGGCAAAAC 
TOGATTCTTT 
CTGATCAAAG 
CGTACGCCTA 
CAAGCATCGG 
AATTTGGTGA 
TGGCGGAATG 
GCACAGGCTC 
TTCGTCCCCT 
TCTTTAGCCXj* 
ATCATACGCG 
GACATCTTCT 
TTl'ACCAATA 
AAAGGAAGTG 
CTTCCTTGGC 
TCACCGACAT 
ATACTGCTCG 
CGCTACAATG 
ATTCCCGACA 
ATCCGGATGA 
AGTACCGAAC 
CCATTCTGTC 
CTCAAGAAGT 
CCGATAGATG 
ATCAAACAAA 
TCCCAACTCA 
CACATCAAGA 
ATTAGCATTG 
ATGAACTGCT 
GAGACAGTT 



TCTATCGACA 
AGAACAATAT 
TAGACGAAGA 
TGATGGAACA 
TCAATCTCAA 
CTGCGAAAGA 
ATCTCGAGCG 
TGAGCGACAG 
ACGAACGAAA 
AAGAGAAATC 
TGATTTCTGT 
AGACGGATAA 
TCACCGTCAT 
ACAGAACCQA 
ACCAGCTGTT 
CCGACATCGT 
AAGACATAAC 
CCTACTTGCA 
GAGGGAGAGT 
TTGCAGACAA 
ATCGAGGACA 
TACTGGTCGA 
CCITCACCAT 
TACCCTCCTC 
CTCCGATGGT 
TCGAGGACAA 
TGCTATCCGC 
TTArCATTAC 
AGCAATCGCC 
AGGACAGATT 
CGGAGGAGCT 
TCTATATGCA 
ACAGCAGTAT 
ATCCGGATTT 
ACAGAAAGCT 
TAAAATTGGC 
AGGCAGGCTT 
CTCCCTCCCA 



ACGCAAAGAA 
GGCAGAGAAT 
AAAAGGGGAT 
GTTGTCCQAT 
ACAAGGTAAC 
AATTAATTCG 
TAAAAAAGAA 
CATGTCCATT 
CAACTCCCAA 
GAAGTTTATC 
TCTGACATAT 
ACTTCGCTCC 
ACAAGGTTTG 
GCTGCACAAG 
GGATATTTGC 
CTCCTTCGTA 
CTTGGAGCTA 
AAAAATCATA 
GGTCATATCT 
TGGCATAGGA 
CTCCGCTACC 
AAACCTTCGA 
CAGTATTCCT 
CGATGACATT 
AGCAGerCTG 
TAAGGATATC 
CGCAAACGGA 
GGATATTATG 
TCTGCTCTGT 
GGAAGGAATC 
TTTGATGCGG 
AAAACTCATG 
GCAGTTTCTC 
TTCCGCTCAA 
CACGAGTGTC 
CTGCAAGCTC 
TTCGGATCCG 
ATATCGGCAA 



(2) INFORItATia^ FOR SEQ ID NO: 118 

(i) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 1689 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDF.nNESS: double 

(D) TOPOLOGY: circular 

(ii) >IOr.ECULE TYPE: DNA (genomic) 
(iiij HYPOTHETICAL: HO 
(IvJ AHTI -SENSE: NO 

tvil ORIGINAL SOURCE: 

<A) ORGANISM: PORYPHYRa^ONAS GIHGIVALIS 
(ix) FEATURE: 

(A) NAME/KEY; misc feature 

(DJ LOCATION 1...1689 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118 



CATCATAAAA 
TTGGGACTCT 
GGTCAATTTG 
CCTGTGCGTG 
GGTATCCCGA 
GCGAGTGAAA 
TCTAATTGCG 
TGGTTCGCAA 
CGTCCTAACG 
GGGATGAAGC 
CAGTCACATA 



CATATCAAAC 
TCCAACTGCC 
CAACAGAAGA 
CTATCGCTGA 
TGGAATTGAT 
GCCAAAAAAA 
ATTTCATCAT 
TGTACGATAC 
ATGATGAATT 
TCAAGCAGAC 
TCGCATATAC 



AATGAAAAAG 
CGCAATCGCC 
GATGCAACGA 
GTACGAACGC 
CAAAGAGCTG 
CACCGTTATA 
1*GCGAAAACT 
GAACAAAGTA 
CCCCAAATAC 
CGCTGGCAAC 
GGAGAACTCC 



CTTTTACAGG 
CAAACGCAAA 
GCATTCCAGG 
TCTGCAGCCG 
GCCAAGAACG 
ACCCAGTACA 
GACTCTTACT 
GGTCTCGTGG 
GAAGCACAAT 
TACATGACGG 
TCTCTGTCTC 



CTAAAGCCTT 
TGCAAGCAGA 
AAACGAATCC 
TTTTGGTACG 
ACAAGGTGAT 
CCCAAAGCGG 
GGACACGCGA 
ACTTTATTTA 
ATCTGGGCAT 
ACGGATATGG 
AAGCTCAACT 



GATTCTGGCA 
CCGAACAAAC 
CCCTGCAGGT 
CTACCCGTTC 
TACCATTGTC: 
TGTGAATCTC 
CTATACCG3T 
TAACCGCCCT 
CGAGATGTTC 
ATCCGCTGl'G 
AAATCAAAAG 



660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24 00 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2919 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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10 



15 



20 



ATGAAAGACT 
ATCAACCATU 
GTGCCTGACA 
CAGACCTGCG 
CCGTACACGA 
TCCGTGGACA 
GGTGTCAAAG 
CACGAiSGTAG 
GCAGGCCCTG 
CCGGTACAAT 
ATGCAATCAA 
GAATACTATA 
GAACCTGATC 
GCTGCCAAAG 
AGTTTGAATA 
6CTGCAATGA 
CAGGCTCCGG 
ATTCrcAAA 



ATCTCGGCAT 
TGGACTGTTG 
ATCACCCTCA 
CATGGGGAAC 
ACTCTCTGAT 
ACGATGCTCT 
GGGCTTCAGG 
CGGATAAGGG 
ATTATAAGAT 
GTTACTATCG 
CAGGTCACTA 
TCTCTGCCGC 
CTTTCAAGTT 
CTCTTCGTGC 
TTGCCGGCAC 
CCAAGGAATT 
GCACATATGT 



CACACAYCAT 
GGGCAA6TAT 
GCACCAAGCC 
GAAGTACGAG 
TCTGAACAAC 
GAACGTCTAT 
AACACCTTGG 
CTATCTCTAT 
CGAAGCAGAT 
TATCAATGGT 
TACTTATAGC 
TGACAATAGT 
TACGTGTATG 
ATGGTTCAAC 
ATATCGGATA 
AGTAGCAGGG 
TCTGGTTGTT 



GATGTGGTAC 
TTGGCACCGA 
CTGGAAGATA 
GTATATCGCG 
AGGGTATTTG 
AAGACGGCAA 
TTAGGAACAG 
ATCAAGCACT 
GTCGTCTCAT 
TCCGGTAGCT 
TTTACAGGTC 
GGTCGCAAAG 
AACGAAACCA 
GCCGGTCGTT 
AAGCTTTATA 
ACGAGTGTCT 
GAAGGAAATG 



AAGATCCGAA 
ACAAAATCCT 
TGGCAGCCTA 
CTTTGGCCAC 
TTCCTGTCAA 
TGCCCGGTTA 
ATGCCCTGCA 
ACCCGATACT 
GCGCCTVATGC 
TTAAGGCTGC 
TTAACAAGAA 
AGACTTATCC 
ATACATGTAC 
CAGAACTGGC 
ACACCGCAGG 
TCAGTATGGA 
GAATCCGTGA 



CGGCGAATAT 
CATCAGGAAA 
CTTCGCAGCA 
CAATGAACAA 
TGGCCCCGCC 
CGAAATTATA 
TTGTCGTACT 
GGGCGAACAG 
TACTATCTCG 
TGATATGACG 
TGATAAGGTA 
CTTTATCGGC 
TGTGACCGGA 
TGTTTCGGTA 
AGAAGAAGTC 
TGTGTATTCT 
GACAATGAAA 



720 
780 
840 
900 
960 
1020 
1080 
ll-tO 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
16G9 



t2) IKFORl-lATIOr^ FOR SEQ ID HO: 119 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LBUGTH: 1311 base pair^ 
(E) TYPE: nucleic acid 

(C) STRANDEDNF.SS: double 

(D) TOPOLOGY: circular 



30 



MOLECULE TYPE: DNA (genomic) 
(HI) HYPOTHLTICAL: NO 



35 



(iv) Af(TI -SENSE; NO 



|vi) 



ORIGINAL SOURCE: 
(A) ORGAtlISM: PORYPHYROMOfJAS GINGIVALIS 



40 



45 



50 



55 



60 



65 



(ix) FEATURE: 

I A] tWlE/KEY: misc^feature 
iB) LOCATIOM 1. . .1311 

Ixi) SEQUENCE DESCRIPTIOtJ: SEQ ID NO:119 



ACCACAAATA 
ATTGCATTGC 
GGGATTCGCA 
CATTCCATAG 
CGAGAGGAAA 
CTCACTATCT 
GATGTAACGA 
TTGGACTTGA 
AGTTTGGACC 
ACTGAGCTGA 
CTGACGGAGT 
CGGATCACGA 
AATGAGCTTT 
TCGAACAAAC 
CATTCCAATC 
TCTCCCGGCG 
AAGAATATCT 
GACTTCAACG 
GCAGTAGATG 
GAGATCCCCG 
GTCTATAGTT 
GCCACTTATT 



GAAAACCAAA 
TGACGATGGG 
TGTCTGTCAC 
AGT^AGAAAGG 
TAACCGTATT 
ATGCl'AATAC 
AAAACCCTTAA 
CGCAAAACCC 
TGAGTGGCAA 
AGACCGATAA 
TGGAACTCAG 
AACTCGATCT 
CGACCTTGGA 
TTACATCCTTT 
AGATCGCAGG 
CAGGCGCTCA 
GTACCGTAAA 
GTGATTCTGA 
CTCCCACTGT 
AGTCTCTTTT 
TCGCGGTAGA 
TGTTCCGTCT 



TACTAATATG 
ACATGCTGTG 
TACAACCAAG 
CATCTGGATC 
CGATGAGGCA 
GACCCGATTG 
TCTGACCTAT 
AAAGCTGCTG 
TCCGGCTTTG 
CAACCCCAAG 
TGCCAATCCT 
GAGTGCCAAT 
TCTTTCCAAG 
GAATCTGTCG 
TGAAGAAATG 
GAGCAAGTTC 
GGATGTGGAA 
CAATATGCTT 
CAGGATATAT 
AGGGCAGGAA 
GTCTCTTCGT 
CGATAACTAT 



AAACTTTCAT 
CAGGCACAGT 
GCCGTAGGCG 
GATCTCAATG 
TACCACGAAT 
GGCTGTCGAT 
CTCGCATGCC 
CGAGTTTGGT 
ATCATCCTCG 
TTGGCCTCTC 
CGTCTCAATG 
CCTCTATTGG 
AATTCGGACG 
GGGGl'GAAGG 
ACGAAAGTCG 
GTCGTTGTAG 
AAAGCrAAAA 
rCATACGAAG 
CCCAATCCGQ 
GCTGCTTTAT 
CAGAACATTG 
ACCACTAAGC 



CTAAGAAAAT 
TTGTTCCGGC 
AAAAAATCGA 
GGGATGCCAC 
ACACGATCGG 
C-i'ACCGGTGC 
CGAAAAATAA 
GCGACTCTAA 
GCTGTGACAG 
TTTGGTGTTC 
ATCTTTGGTG 
TAACACTTTG 
TTGCTTACCT 
GACTGAGTGT 
TGAATGCTTT 
ACCTCAAGGA 
GTAAGAACTG 
GAAGTCCGAC 
TAGGAAGATA 
ACGATATGAA 
ACCTGACACA 
TCATCAAACA 



CTTAGC7VATC 
TCCCACCACA 
ATTGrrGGTT 
TTACCAACAA 
GACGCA7VACC 
AACGGCTGTC 
TCTGAAATCA 
CGAAATAGAA 
GAATAAGCTG 
TGATAATAAC 
CTTCGGTAAT 
GTGCAGTGAC 
TTGGT6TTCA 
TTTGGTTTGT 
GCCCACACTA 
CACTGATGAG 
GCGAGTATTT 
ATCGAACTTG 
TGCGCTCGTC 
TGGGGTAAAA 
TCTTCCCGAC 
G 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1311 



70 



75 



(2) INtORflATION FOR SEQ ID NO: 120 

(I) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 954 bare pairs 

(B] TYPE: nucleic acid 

(C) STRANDEDHESS: double 
iD) TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ii"Hv rf^ii ^t=«» (r'=« «il li^i .>t^ If" ij f 'I'j 

1LJ -^1! Zli O ..J^JC^ Cjj Oi ill O O iL-, 1J...I1 4-ji lUS 
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(il) lIOtECULE TYPE: DtJA (genomic) 

tiii) HYPOTHETICAL: NO 

5 tlv) AIITI-SEHGE: NO 

(vil OR I GI HAL SOURCE: 

(Al ORGAIJISM: PORYPHYRa^tOHAS GIWGIVALIS 

10 (ix) FEATURE: 

(A) NAME/ KEY: miff cofeature 

(B) LOCATIOM I. . . 954 



15 



35 



45 



50 



(xl) SFQUENCE DESCRIPTIOIJ: 5EQ ID HO: 120 



AATCATTTGA AGACAAATAT CAAGATGAGA AAAACAATAA TTTTCTGCTT GTTGCTCGCC GO 

CTATTTGGCT GTTCrrGGGC ACAAGAAAGA GTCGATGAAA AAGTATTCTC CGC7W3GAACA 120 

AGTATTTTTA CGGGCATCCT TtiAAAAGGTG AAAGCACCGC TTATGTATGG AGATCGTGAG 180 

CiTATGGGGTA TGGCTCGTGC GAGCGAGGAT Ta'CTTTTTTA TACTTCCCGT TACGGATGAC 240 

20 CTCACTCCCG TGCTTTTCTA TAACCGTCTT ACAAACGAAC CCTGCTTTGT GTCAGACCAA 300 

G6AATAACT6 AGTATTTCAA ATTCGCTCAA GAAGGTGATT ACATTGAAGT CGAAGGAAGC 3 50 

TCTGTATTCA TGGCGAATCT TTTGTACTAT CGTTTTTTCC CGACAAGAAT TACCTCCTAT 4 20 

AATGCTCCCA TTGAAGGTGT TGTGAGCAAG ACGGGAAATC CTGCTTTTAC AATCCCGATG 4 80 

CTOCCGGGGG TTTCTGATTG CATAGAAATC 1*CAAACAACC GCAAAGTCTT TCTGACCAAT 54 0 

25 CAATTAGGG<3 TTGTAAACAT CACTGACGGG ATGOAACCl'C CGATTATTGC CGGAGTCTCT 600 

GCrrcCTATG GATCTTCCGT CCGGGT6TAT GGTCATGTCT CACAGCGGTG GGACATCATA 660 

GGCCATTGCT ATTTGGATAT CTACCCAACC AATTGCTATC CGCTCAGCAC GAAACCCGTT 720 

GCAGGAGACG ATGAGGTTTT TGTCAAACAA CAAGGCAGGC AAATAGAGAT CGATAGCAAC 7 80 

AGCCCCATAG TCCAAGTGGT CGTATACGAT CTTGAGGGGA AAAGTGTTTT TCGCAAAAGA 04 0 

30 ATGACCGAAA ACGCTTATAC CCTATCCTTT AGAGCACCCA TGCTCGGCTT TATGACCATC 900 

ATGATCGAAA CACAAAATTC GATTATCAAT AAAAAACTTA ATGTTACACA GCi'A 954 



U) IMEX3RMATTOM FOR SEQ ID NO: 121 



(i| SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1383 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI/DEDNESS: double 
40 (D) TOPOLOGY: circular 



(11) MOLECULE TYPE: DMA (genomic J 

(iltl HYPOTHETICAL: VO 

Uv) AHTI-SEIISE: NO 

(vi) ORIGINAL SOURCE: 

(A» ORGAiriSM: PORY PHYROMQHAS GINGIVALIS 



(IX) FEATURE: 

(A) NAME/KEY: mlsc_feature 

(B) LOCATION 1. . .1383 

55 (xl) SEQUEUCE DESCRIPTION: SEQ ID NO: 121 

AGACGGGCAG TTAAAATCCG ATCACCTCCC CATATCCACT CATTATTTGT ACGTAAATGT 60 

CTTTTTTCAG ACTTTAAATA CTTATATTTA TCCCGAAAAA TTACCCAAGA GAGATTGGGT 120 

CGATTGTCGA TCCGTCTTAA GTCATACAAT CCAATAATCA TTATTGAAAT GAAAAAAACA 180 

60 ACCATTATTT CTTTGATTGT CTTCGGTGCT TTCTTTGCAG CCGTGGGCCA AACCAAGGAC 24 0 

AATTCTTCTT ACAAACCTTT TTCGAAAGAA GATATTGCCG GAGGAGTTTA CTCTCTCCCG 300 

ACTCAAAATC GTGCGCAGAA GGACAATGCC GAGTGGCTTC TTACAGCGAC CGTCTCCACA 360 

AACCAGTCTG CAQATACTCA CTTTATCTTC GATGAGAACA ACCGCTATAT CGCTCGTGAC 42 0 

ATAAAAGCCA ATGGGGTAAG AAAATCCACG GACTCCATTT ACTACGATGC CAAC6GGCGA 4flO 

65 ATATCGCATG TGGATCTTTA TATCTCGTTC AGTGGCGGAG AGCCTGCACT CGACACCCGA 540 

TTCAAGTACA CCTATGATGA CGAGGGAAAG ATGACCGTGA GGGAAGTATT CATGCTGGTA 600 

ATGGATCCGA ATACACCTAT CTCACGCTTG GAATATCATT ATGATGCACA GGGCAGACTG 660 

ACCCACTGGA TTTCITTTGC TTTCGGGGCA GAATCCCAAA AGAATACGTA TCACTATAAT 720 

GAAAAAG-jTC TGTTGGTCAG CGAAGTGCTG AGCAATGCAA TGGGGACAAC CTATTCAGAC 780 

70 ACCGGCAAAA CGCAATACAG CTATGACGAT GCAGATAATA TGGTGAAGGC CGAGTACTTC 84 0 

GTCGTCCAGC AAGGAAAGGC ATGGCAAGTA CTCAAAAGAG AGGAATACAC CTATGAGGAC 900 

7\ATATCTGCA TACAATATTT GGCTATTAAC GGTACCGACA CAAAGGTGTA CAAGCGAGAC 960 

ATCGAGAGCG ATAAGTCCAT CTCCGCAAAT GTCATTGACA TTCCGTCAAT GCCGGAACAG 1020 

ACCTGGCCTA ATATGTACGG ATTCAACGCA AAGCGACTGA AAGAGACTTA TTCCTCCTAC 1080 

75 GAAGGAGATG TGGCTACTCC TATATTCGAC TATATCTATA CGTACAAGGC TCTTACCTCA 114 0 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



U i53 J " 



J . iii' -iT-'ii tf-"^ 

; o 113 lu ii-s o ic: o s^is ^ 



WO 99/29870 
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ATGGCAACAC CTTCGACAGA AGCTCAGGTA GCAGTCTATC TCAATCCGTC AACCGACCGG 1200 

TTAGTGATTC TGGCCAACGG CATCACACAT CTCAGCATGT ACGACrTGCA GGGTAAGCTT 12 60 

ATCCGTGATT GTGCCTTGAG CGGCGATAAG GTGGAAATGG GTGTCGGATC TTTGACCAAA 1320 

GGGACATACC TGCTTAAAGT GAATACGGAT CAGGGAGCCT TTGTGACAAA AGTCCiTGATT 1380 

CGA 1383 



10 



15 



20 



25 



(2) IllFOWlATIOH FOR SEQ ID HO: 122 

ii] SEgUEJJCE CHARACTERISTICS: 

(A) LENGTH; 131:3 basa pairs 
(Bl TYPR: nucleic acid 
(C) STRAtlDEDNESS: double 
{V» TOPOLOGY: circular 

(li) MOLECULF. TYPE; DNA (genomic) 

tliij HYPOa'HETICAL: NO 

(iv) A1>(TI-SEI-ISE: NO 

( V I) ORI GI rJAL SOURCE : 

(A) ORGAI/ISM: PORYPHi ROf lOHAS GINCIVALIS 

(ixt FEATURE: " 

[hi HA11E/KE\: mi£C_f eature 
IB) LOCATION 1. , .1353 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xij SEQUEIfCE DESCRIPTION: SEQ ID NO: 122 



ATGGCAAAAG 
CCCGAGATGC 
GAAGGTGTTA 
CTGATGCACC 
ATCGCGGTGA 
CTGAACGAAT 
AAGGAGCTTT 
GTGGCTACAC 
TTGGCTCCGG 
GATGCCTTGG 
CTGGGCTTGC 
GGCGTGCTGA 
GCTACCGACC 
ATGATTGCCA 
TGCAATGTCT 
ATCGATGGCA 
CGGTGTGACC 
GCTGCACCCC 
AAAAACAAAG 
AGCAACGAGT 
ATTATAGCAT 
GACTTTGCCA 
GAGGGCTTGG 



TTATAAAAAC 
TGGCCGAACC 
TCCCCAAGGT 
ACAAGGCATA 
ATCGCGGTGC 
ACGAGTCATT 
TACTGTCGAG 
CGGATATAGC 
ACTTCGATTT 
CCAAACTCAC 
ACAATGCAGA 
TCAATCATAC 
TGATCGTGAT 
TGACCGGCTC 
TTGCTTCCTT 
ATGTGCTGAC 
AGATCACGGT 
GTCTCGATCA 
AGTACGTACT 
ATGACCGCGT 
TCGACATCGA 
CTTGCGAATT 
ATATGCTCTA 



AAAAAAAGGC 
GGCCCAAAGT 
GACGGCTCGT 
TCCGGAGATG 
CAAGCGCAAG 
CCCTGTCGGG 
CGGTATGTGG 
TCCACGCGAC 
CATCGTTCGA 
GACAGGAAAG 
AATCGTAGAA 
GAAGCCAATC 
CGGACGTTTC 
AGACGCTGCA 
CCCCGGCCGA 
CGGTAAGAAG 
GATCCCCGAA 
GTACAGCATG 
CGATGCCCG-j 
TTTCCCGATG 
CAAGATGGAG 
TGTGGATACA 
TAAGGAAATG 



CTTGCACTTA 
CCTACTTACG 
CCGGGGGATA 
AAGTTTACAA 
GTGTTGAGCA 
GATCCGTCTG 
GGTTTTATTA 
ATTTATATTA 
GGAGAAGAAC 
GTGTATGTGG 
GTACACGGAC 
AATCGGGGCG 
CTGCTTACGG 
OCTCACGGAT 
CTGACAATAA 
CTCTGCGAGA 
GGCGACGATG 
AGCAGAGCTT 
ATCAAGGGTG 
GACATCTATC 
GACTTAGGCA 
TCCAAGATCG 
AAT 



ATCTGAAAGG 
CGCTCGTGCU 
AGGTGCGTGC 
GTCCGGTTAG 
TCGAGGTGAA 
CCCTCTCTGC 
AGCAACGTCC 
CTGCCAACTT 
GCGCCCTGCA 
GCCTGAAGCC 
CTCATCCGGC 
AAACGGTGTG 
GCAAAGCCGA 
ACGTCCGTAT 
AGGAATCTCA 
AGGAGCCT7T 
TGGACGAACT 
ATTTCTCTTG 
GCGAACGTQC 
CGGAGTATTT 
TATATGAAGT 
AGCTGCAGCG 



AAAACCGCTG 
CGACGATTTT 
CGGCTCAGCA 
CGGCGAAGTG 
ACCGGACGGA 
CGAACAGATC 
TTACGACATA 
TACTGCACCA 
GACTGCCATC 
GGGTTCATCT 
AGGTAACGTG 
GACGCTCAAG 
TTTTACCAGA 
TATGCCGGGT 
CGAGCGTGTG 
CCTGTCAGCC 
CTTCCGGTGG 
GTTGCAGGGG 
TATGATCATG 
GCTCAAGGCT 
GGCTCCGGAG 
TATCGTTCGC 



(2) IIIFORHATION FOR SEQ ID NO: 123 

Ii> SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 585 ba?* pairs 
(D) l-YPE: nucleic acid 
fCl STRAtiDEDNESS: double 
tOI TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(lii) HVPOTHETICAL: NO 

(Iv) AllTI-SEHSE: HO 

fvi) ORIGINAL SOURCE : 

(A) ORGAttISM: PORYPK-i'RaiONAS GIl/GTVALlS 

(Ix) FEATURE: 

(A) NAIIE/KEY: inlcc_f-ature 

(B) LOCATiat 1 . . .585 



60 
120 
180 
240 
300 
360 
420 
430 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
108 0 
1140 
1200 
1260 
1320 
1353 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(xll SEQUEHCE DSSCRIPTION: SfclQ ZD NO:123 

ATGGATAAAG TGAGCTATGC TCTGGGATTG AGCATCGGTA ATAAT1TCAA GTCTTCGGGC 60 

5 ATCGACAGCG TCGTTATGGA TGATTTCATG CAAGGTCTGT CTGATGTACT GGAAGAAAAA 12Q 

CCCCCTCAGC TCTCGTATGA CGAGGCCAAti CGCGAAATAG AGGCGTATTT CATGGATTTG 180 

CAGCAGAAGG CTGTCAAACT GAACAAAGAG GCCGGAGAAG AATTCCTCAA GATAAATGCA 24 0 

CACAAGGAAG GTGTGACGAC CTTACCGAGC GCICTTGCAAT ACGAAGTCAT TAAGATGGGA 300 

GAGGGCCCGA AACCCACCCT TTCGGACACG GTAACCTGTC ATTATCACGG TACGCTCATC 360 

10 AACGGTATCG TTTTCGATAG CTCTATGGAC AGGGGAGAAC CGGCCAGTTT CCCTCTAAGA 420 

GGAGTTATAG CCGGCTGGAC GGAGATTCTT CAATTAATGC CTGTAGGATC CAAGTGGAAA 4flO 

GTAACTATAC CGAGCGATCT GGCGTATGGA GATCGTGGTG CCGGCGAACA TATCAAACCG S4 0 

GGTAGTACGC TCATTTTTAT AATCGAATTA TTGACTATCA ACAAA 585 



15 



25 



(2) IllFORMATION FOR SEQ ID HO: 124 



ii) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 819 base pairs 
20 (B) TYPE: nucleic acid 

(CJ STRAWDEDNESS: double 
(D) TOPOLOGY: circular 



(ill MOLECULE TYPE: DUA (genomic) 
<iii) HYPOTHETICAL; HO 
<iv) AMTI-SENSE: NO 



PCT/AU98/01023 



30 (vil ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRQMONAS GINGIVALIS 

Ux) FEATURE: 

(A) NAtlE/KEY: mtsc_t«a ture 
35 (B) LOCATIOH 1...619 

<xl) SEQUENCE DESCRIPTION: SEQ ID UO:124 

ATGAAAAAAG CATTACTTAT TGGTGCTGCT CTTTTGGGAG CAGTCAGTrr TGCAAGTGCT 60 

40 CAGTCTTTGA GCACAATCAA AGTACAGAAC AATTCAGTAC AGCAACCTCG TGAGGAAGCC 120 

ACTATTCAGG TTTGTGGAGA ATTGGCAGAG CAAGTTGACT GCATTGGGAC AGGTAATTCT 180 

GCAATCATAG CCGCTGCAGC GAAATTTGAA AGCGATGATC TCGAAAGCTA TGTTGGCTGG 24 0 

GAGATCATGA GTGTTGATTT CTTCCCTGGA TATAAAGCGT GCAAGTACAC ATCTGCAGTC 300 

TGGGCTGATG ATATGACCAT TTTGGGCCAA TCAGAAGATA GTGATCCCGA AATGCAGACT 360 

45 ATCAACAATC TTGCTCTCAA GACTAGTGTC AAGATTGAAG CCGGCAAGAA TTACATAGTT 420 

GGTl-ATATTG CTAATACCGC AGGTGGACAT CCTATCGGAT GTCATCAGGG CCCTGCCGTT 480 

GATC.GTTATG GAGATTTGGT TTCTATATCA GAAGATGGTG GTGCTACTTT CCCTCCGTTC i>4 0 

GAATCTCTTC ATCAAGCAGT TCCTACCTTA AATTACAACA TCTATGTCGT TGTTCATTTG 600 

AAGAAGGGTG AAGGTGTTGA GGCTGTTCTT ACCAACGACA AGGCTAATGC TTATGTTCAG 660 

50 AATGGCGTTA TCTATGTAGC CGGAGCTAAT GGTCGTCAGG TATCTCTGTT CGACATGAAC 720 

GGTAAGGTTG TTTATACCGG CGTTAGCGAA ACGATTGCAG CTCCTCAGAA GGGCATGTAT 780 

ATCCTCCGTQ TAGGTGCTAA GAGCATCAAG Cl'GGCTATC 819 

55 (2) IHFORMATIOH FOR SEQ ID MO: 125 

IM SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1662 base pairs 

(B) TYPE: nucleic acid 
60 to STRAtJDEDMESS: double 

{D) TOPOLOGY: circular 

(it) tlOLECUl.E TYPE: DNA (genomic) 

65 (lii> HYPOTHETICAL: NO 

(iv> AMTI-SEHSE: NO 

(vl) ORIGINAL SOURCE: 
70 (A) ORGANISM: PORYPHYROHOHAS GIUGlVAtiS 

(ixt FEATURE: 

(A) HA}^E/KEY: misc_Ieature 
O) LOCATION 1. . .1662 

75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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*xl) SEQUENCE DESCRZPTIOtJ: SZQ ID HO; 125 



ATGCCAAGAA TTATGAAATT AAAAATTGCA 
GTTTTATTTA GCXrCTCTGGC CAAGGCCCAG 
5 GAGACGATGT CCACCCTATC AGGATATTCA 
GACAATGGAT GGATCTATGT GATGTTGGAT 
CTGTATCGTT CCAAAGACGG TGGTGCTACT 
GTGCCTTATG ACTTCGATGT CTCGCATTGC 
GATATCRATG TTTGGACAGT OATGACAGCA 

10 GGCGTTTTGC TGATGCATCG CCATGATGCA 
AAGOATl-rCC CTAATAATAG ACTGATGGGT 
TCTCCTTACG GTTTGQGGGG CGATCCTTTT 
AGCGATCACA GCTTCTTGGA CTATA'ITTTT 
AAGCGTATTT ACACAAGACC CCAAAAACTG 

15 AGTACATCTC CTTCTCTTGG ATTTAATACT 
AATAAGAACC TTGATGGCTT CGACATTGGT 
CGCTATGCGT GGTCTGAACC GATAATAATA 
CCTTTGGGAG CACTAAGTAT AGAGATCCAA 
GT6G&TGGAG AACGCTCCCA TAACTTCCTG 

20 AAGCAATCTT TCAATTAITC TCCCGGACAT 
CACTGTATAC. GTATTCCGGC TTTGGCATAC 
TTTCAAGATC ACAATCTAAT GAGATACAGA 
TATGGTTGGA GTTGGCCATA TGTATATGCA 
CCGCAAGTAG CACTCAATCC TACCAATGGA 

25 AGCCCATATG ATGAAACCAA ACCACATCCT 
TCXTGATACGG AGTGGGTACA TGCTCTGGAC 
ATGAAGCTCT ACCCCAATCC TGCCAAAGAA 
GGGCACGAGG CAGTCGTATA CGACATGCAG 
GGGAAAGAAT ATAAGCTGAA TGTGCAGTAT 

30 GTAGCGGATA GGGAGTATTT CGTGGAAAAA 



CTCAGACTGC TGCTGGCGAC TTTTGCCATA 60 

ATGGATATTG GTGGAGACGA TGTATTGATC 120 

GAGGATTTTT ATTACAAGAT GGCTGTGGCA 160 

TTCTCTCGTA T'lTATTTTGA TGATGTCAGG 24 0 

TACCAAAAGT TAGGGTCrTT GGGGTCTTTG 300 

GATTTTATTG TAACGGGAAA GGATGAAGAT 360 

TTCGAATATG TAGGTGGTAC TATTGGCAAT 4 20 

GATATCAATA ATACAGAGTG TGTGTACAAG 4 80 

CTTAGCCATCG CCTCCAACTA CCGTOCGCCC 54 0 

GCTCTCGCTG TCCCCGTTAG TGGCTCCGGA 600 

TCGTTAGA7G GTGGAGT7\CA CTTTCAGCAA 660 

ACTATCAATA GAGTAGACCT TTCATTAGGC 120 

TCGCCACTAA TGGGAGTCGT ATTCGAAATG 180 

TTCATTTCCA ACTTTGTGGA CTATGATCCC 340 

GAAGAAGACT GTGGATGGAC TGATTTTAAT 900 

ATGATGITGG ATGACAATTC GGATAATACC 960 

ATCACTTACC CGGGCCArPA CGTATATCCG 1020 

ACACCGACAA AGAAAGATCT GCTCTTPAAA 1080 

GATAAGGAAG GCGATCGTTA TCTGACTA':rr 1140 

TGGATCAAAT ACGATGACAT TAACTCTTTT 1200 

AAAGAAGCTA AAGATAAAAA GAGGCGCCGT 1260 

AAGGCTTGTT GGGTATGGCA TACTCGCAAG 1320 

ACTCCTGTAA TTATTAAACA TTTCCTATGG 1300 

GTGGGGGACG TATTGCAGAA GGAGGGTTKGC 14 40 

TATGTTCl'GA TCAACCTACC CAAAGAAGGG 1 5 00 

GGCCGAATCG TGGAGAAA6T TTCATTTTCA 1560 

CTGTCCAAAG GTACGTACAT GCTGAAAGTT 1620 

ATCATTGTAG AG 1662 



(2) INFORWATIOH FOR SEQ ID NO:i26 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pair* 
(B» TYPE: nucleic acid 
(CJ STRAMDEDMESS: double 
(D) TOPOLOGY: circular 

40 

tli) MOLECU'LE TYPE: WW (genomic) 
(iii) HYPOTHETICAL: NO 
45 Uv) ANTI-SENSE: NO 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROTiONAS GIUGIVALIS 

50 (ix) FFATURE: 

(A) NAME/KEY: niisc^f eature 

(B) LOCATION 1 . . .1650 



55 



<xl» SEQUENCE DESCRIPTION; SEQ ID NO: 126 



ATGAAATTAA AAATTGCACT CAGACTGCTG CTGGCGACTT TTGCCATAGT TTTATTTAGC 60 

CCTCTGGCCA AGGCCCAGAT GGATATTGGT GGAGACGATG TATTGATCGA GACGATGTCC 120 

ACCCTATCAG GATATTCAGA GGATTTTTAT TACAAGATGG CTGTGGCAGA CAATGGATGG IBO 

ATCTATGTGA TGTTGGATTT CTCTCGTATT TATTTTGATG ATGTCAGGCT GTATCGTTCC 240 

60 AAAGACGGTG GTGCTACTTA CCAAAAGTTA GGGTCTTTGG GGTCTTTGGT GCCTTATGAC 300 

TTCGATGTCT CGCATTGCGA TTTTATTGTA ACGGGAAAGG ATGAAGATGA TATCAATGTT 360 

TGGACAGTCA TGACAGCATT CGAATATGTA GGTGGTACTA TTGGCAATGG CGTTTTGCTG 4 20 

ATGCATCGCC ATGATGCAGA TATCAATAAT ACAGAGTGTG TGTACAAGAA GGATTTCCCT 480 

AATAATAGAC TGATGGGTGT AGCCATCGCC TCCAACTACC GTGCGCCCTC TCCTTACGGT 540 

65 TTGGGGGGCG ATCCTTTTGC TCTCGCTGTC GCCGTTAGTG GCTCCGGAAG CGATCACAGC 600 

TTCTTGGACT ATATTTTTTC GTTAGATGGT GGAGTACACT TTGAGCAAAA GCGTATTTAC 660 

ACAAGACCCC AAAAACTGAC TATCAATAGA GTAGACCTTT CATTAGGCAG TACATCTCCT 720 

TCTCTTGGAT TTAATACTTC GCCACTAATG GGAGTCGTAT TCGAAATGAA TAAGAACCTT 180 

GATGGCTTCG ACATTGGTTT CATTTCCAAC TTTGTGGACl' ATGATCCCCG CTATGCGTGG 84 0 

70 TCTGAACCGA TAATAATAGA AGAAGACTGT GGATGGACTG ATTTTAATCC TTTGGGAGCA 900 

CTAAGTATAG AGATCCAAAT GATGTTGGAT GACAATTCGG ATAATACCGT GGGTGGAGAA 960 

CGCTCCCATA ACmCCTGAT CACTTACCCG GGCCATTACG TATATCCGAA GCAATCTTTC 1020 

AATTATTCTC CCGGACATAC ACCGACAAAG AAAGATCTGG TCTTTAAACA CTGTATAGGT 1080 

ATTCCGGCTT TGGCATACGA TAAGGAAGGC GATCGTTATC TGACTACTTT TCAAGATCAC 114 0 

75 AATCTAATGA GATACAGATG GATCAAATAC GATGACATTA ACTCTTTTTA TGGTTGGACT 12 00 
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TGGCCATATG 
CTCAATCCTA 
GAAACCAAAC 
TGGGTACATG 
CCCAATCCTG 
GTCGTATACG 
AAGCTGAATG 
GAGTATTTCG 



TATATGCAAA 
CCAATGGAAA 
CACATCCTAC 
CTCTGGACGT 
CCAAAGAATA 
ACATGCAGGG 
TGCAGTATCT 
TGGAAAAAAT 



AGAAGCTAAA 
GGCTTGTTGG 
TCCTGTAATT 
GGGGGACGTA 
TGTTCTGATC 
CCGAATCGTG 
GTCC7KAAGGT 
CATTGTAGAG 



GATAAAAAGA 
GTATGGCATA 
ATTAAACATT 
TTGCAGAAGG 
AACCTACCCA 
GAGAAAGTTT 
ACGTACATGC 



GGCGCCGTCC 
CTCGCAAGAG 
TCCTATGGTC 
AGGGTAGCAT 
AAGAAGGGGG 
CATTTTCAGG 
TGAAAGTTGT 



GCAAGTAGCA 
CCCATATGAT 
CGATACGGAG 
GAAGCTCTAC 
GCACGAGGCA 
GAAAGAATAT 
AGCGGATACG 



(2) IME*ORJlATION FOR SEQ TD NO: a 2*7 

til SEQUEHCF. CIlARACl'ERISTICS: 

(AJ LEliGTH: ll^O base pairs 
tB) TYPE: nucleic acid 
tC) bTHAIlDEDMESS: double 
(D) TOPOLOGY: circular 

(ii) nOLECULE TYPE: DHA (genomic) 

tiii) HYrOTIiETICAL: HO 

Uv» A1(TI-SEHSE: HO 

(vi) ORIGIllAL SOURCE: 

(A) ORGAIIISH: PORYPHYROMOHAS GIHGIVALIS 

Ux> FEATURE: 

IA> NAMG/KEY: mi6c_ feature 
IBt IK)CATlorJ 1.,.1170 

(XiJ SEQUa^CE DESCRTPTION: SEQ ID NO: 127 



ATGAAACGAT 
GCl'CAGTCAC 
TATCGACTGG 
G7\AGAAACGA 
TCGCTTTTCT 
TTAAAACAAG 
AAGTCGCCGA 
GATATGCCTT 
AACTATCTGA 
GAATTGATCG 
CTTCGTGACA 
TATACATACG 
ACCCTTGCCA 
ACGCATGAGG 
CAAACGTATT 
TATACCGATA 
TATCCTCCTC 
CTATTCGACA 
ATCGGAGTTG 
AAAACCATAC 



TACTCCCCTT 
CCCGAATCCC 
ACAAGATCAG 
TACCCACTAA 
ATGAAGACGA 
CGGAGAAGTA 
CAGACGAAAC 
TTGAGATTAC 
ATGGAAAGAT 
AAACGGGTAG 
GTGTATTTCT 
ACAATAAGCA 
ACAACTTCGA 
AGTTCTTCCG 
TCAATAACTC 
TGCA6GGTAA 
CTGCCACGGA 
TGAACGGGAA 
CATCTCTTCC 
GTGCAAAAGT 



TCTCCTTTTA 
TCAAGTGGAT 
TGTCCCGGAT 
AATACAAACG 
CAGGTTGGTT 
TGTATACGAC 
GCCAATAAAG 
GACAGAGATG 
TGCCCGAATA 
AATGGTATAT 
TCCTCTTCAA 
TAATTGTATT 
ATACGACACC 
TCCTCTTCTT 
CGCAGAAGGC 
TGCACTGACC 
TTTTCTGCGT 
GCTCATC?VGA 
GAGAGQCACT 
ATCGCTCAGA 



GCAGGACTCG 
GTACACACTC 
TCTCGTCAGA 
ACCACAGGAG 
CAGGTGCGCT 
GGTTCTAAGC 
AAAGTTAGCT 
AGCGATGGCT 
GATATCATGA 
GAGTTTGATG 
AACAAGTGGG 
CGTTGGGAAC 
ACTATCCCTC 
CCCAATTTTA 
TTGTCAGAGG 
GATGTTGCCG 
ATAGAAGGTT 
GCTACCGAAT 
TACATCGCAG 



TAGCXGTAGG 
GCATCGCAAG 
TATTCGATTA 
GTGCAATTAC 
ATTTTGACAA 
TGGTCCTTCG 
ATCACTATCT 
ATrrTGAAAG 
CTCAACAGAA 
CCAATAATGA 
TAGAAATGTT 
AAGACGAATT 
TGTCGTCTGT 
TGAAGCATAT 
TATGCGATTA 
TGAACGAATC 
CGCAACTGCT 
TGACAGGCGA 
AAATAACTGC 



AAAC6TGTCT 
AAATGCCCGT 
CTTCTATAAA 
AAGGATCGAT 
TAACCTTGAA 
AGAAATTCGC 
CTGT&3CAOC 
CCATACGCTT 
CCCATCGGCC 
TGCTGTACTG 
TACTCACCG7 
CGGCACCCTC 
ATTGTTCCCC 
GCGTACGAAG 
CAACTACTTC 
GATCAAGATT 
TCGCCTTTCG 
TTTGGCCATT 
TGCAAACAGC 



(2) IMFORI-tATION FOR SEQ ID NO: 128 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1233 bass palra 

(B) r/PE: nucleic acid 

(C) STRANDF.DNESS: double 

(D) TOPOLOGY: circular 

fii) HOLECULE TYPE: DNA (genomic) 

tiii) HYPOTHETICAL: NO 

liv) 7U7TI-SEHSE: NO 

(vi) ORIGIHAL SOURCE: 

I A) ORGANISM: PORYPHYROflONAS GIHGIVALIS 

tlx) FEATURE: 

(A) NAME/KEY: misc feature 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1650 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
84 0 
900 
960 
1020 

loeo 

1140 
1170 
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iBJ LOCATIOt-J 1, . .1233 



<xl) SEQUEWCF. 0ESCRIPl*IOt>4: SEQ ID MO: 128 



10 



15 



20 



25 



ATGAGACA<iC 
GTCGGTTGCA 
TCTTTGGACG 
ATGGCACAGG 
AAXCATTAGA 
TTTGTGGATT 
GGATTTTGGG 
GTCTACTTTT 
AAATTCTCCT 
CTGCCCGAAA 
GGACACAGCC 
GTGGTCGGCT 
ATCATAGTAT 
CATTATCTGC 
CCGAAGCCCT 
GTTTACCGCA 
GAACTGGTCG 
CGTCCGGGAG 
GCTGTCGGAC 
TATCCGCTCA 
TTGCTGAAC3 



AIT'l'ATCTCT 
GAACAGTCCG 
GCTTCCGACC 
AGGGATTGAG 
GCATGGCTAC 
CGCTACTGGG 
GCGGCGAGCC 
GGGTAGQATC 
CCACCGTTCC 
AGGAGCGACC 
AAACGCCCGA 
ATTTCCGCAA 
CCGATCACGG 
CTGCGGACAG 
CCTATACCGA 
AGGGGGAGGT 
TGATTCCGGA 
CGGCACATGG 
CCGATTTCCG 
TATGCAGGCT 
GCCTGATCCG 



ATTTCCTTTT 
ACAAACACCT 
GGACTATACC 
CGGGTCGCTC 
GGGGCTTTAC 
CATCTTTCGT 
GGTTTGGAAT 
CGAAACGGCT 
GTTTCGTGAC 
GCGCTTGCTC 
AAGCCCGCrG 
GCGGTTGCAC 
TATGGCCACG 
TTTCCTCTAC 
GCGAGCCTAT 
GCCCAAGCGT 
CATAGGCTCC 
CTATGACAAC 
TCCGGGCAGT 
GTTGGGl'ATA 
AGACAAACGA 



ATCTTGTTTC 
AAGCAGTCGG 
GATCGGGCAC 
CAACCATGCT 
CCCGATCATC 
ATATCCGACC 
ACGGCCGCAC 
GTGAACGGAA 
CGTGCCGACT 
ATGTGGTACA 
ACACTGGCAA 
TCTCTGCCCA 
TACGAAAATG 
ATGGCCACCG 
GAGATCCTGC 
TT6CGCTGTG 
ACCGTCTTTT 
CAAGCACCGG 
A&3GTGGAAA 
GAGCCTGCAC 
CCA 



TGCTTCTTGC 
AACGGTACGT 
GTACACCGGC 
TCCCCTCGCT 
ACGGTATCGT 
GAAAAfiCCGT 
GCCAAGGCAT 
ATCGGCCGTG 
CCGTCATCGC 
TCGAGGAGCC 
TGGTAGAGCG 
TAGCCGCACA 
ACAAATGTGT 
GGGCCTTCAC 
GGGCCATTCC 
GCACCAATCC 
TCGCAATAAA 
AAATGCGGGC 
ACCTGCCCAA 
CCAACGATGC 



CTTCTCTTAT 
CGTAGTCCTG 
GTTGGATCGG 
TACATTTCCC 
AGCCAATGAG 
GGAGACCCCC 
CCGTACCGGT 
GCGGTGGAAA 
GTGGCTCGGA 
GGATATGATC 
GTTGGACAGT 
GACCGACTTC 
CAATCTGTCG 
CCACTTGTAC 
ACATATATCG 
TCGTTTGGGC 
TGAAGACGTT 
TTTACrcCGG 
TATCACCATC 
GGACGAAACG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1233 
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(2) IHFOWIATIOH FOR SEQ ID HO: 12 9 

(i) SEOUE»CE CHARACTERISTICS! 

(A I LENGTH: 738 base pairs 
(B) TYPE: nucleic acid 
IC) STRAIIDEDT-IESS : double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(IIV) HYPOTHETICAL: MO 

liv) Al-JTI-SEMSE: NO 



45 



(vl) ORIGIWAL SOURCE: 

(A| ORGAIJISM: PORYPHYROt^tONAS GIHGIVALIS 

(iz) FEATURE: 

(A) NAIIE/i<Ey: mi£C_f eature 
(B| LOCATIOIt 1 . . .738 



50 



55 



60 



65 



Ui) SEQUENCE DESCRIPTIOJ; : SEQ ID HO: 129 



ATGAAAGTAG 
GCCACGTACA 
TGTTGCGGCC 
CGATTCGAAG 
GCTTTCGTGA 
GCAGCCAAGG 
CCCTCCCGAT 
CATCTGTCCA 
GAGATGGTGC 
GGCGGTATGT 
CTGGATCACA 
CATATGCAGG 
ATTTTAGCAG 



GTTTcrrrcAT 

AACTGCTGAA 
AGCCTATGGC 
AGCTGTTCGA 
AAGAAAACTA 
TTCGGGATAT 
TCGCCCATAA 
CCCCCAGTGA 
AGGGCATAGA 
ACTCGGTGGA 
TATCCACAGG 
GAGTGATAGA 
CAAACTTA 



CCCCTGTTAT 
GAtSTTTGGAC 
CAATGCCGGA 
GTCCTATGAT 
TGATCATATC 
ATGCGAGTTC 
GGTGAGCCTG 
AGTGCACCGA 
GGTATTCGAG 
GGAGCCGGAG 
TGCGGAGTAC 
CAGAGAGAAA 



GTCAATGCAG 
ATAGATGTCG 
TTCGAACAGA 
GTAGTC6TAG 
CTCAGACCGA 
TTGCACGATG 
CACAACAGTT 
CCGTACCACA 
CCCAAGCGAA 
GTATCCACCT 
ATCACAGGGC 
TTGCCGATCA 



TGTATCCGGA 
ACTACCCGAT 
AAGCTCAAAA 
GGCCATCGGC 
CAGGACATGT 
ACCTGAAGAT 
GCCACGGTGT 
ACAAGGTGCG 
TAGACGAATG 
GTATGGGGCA 
CGGACAGCTC 
AGACAATTCA 



AGTGGGTATC 
GGATCAGACA 
GCTGGCTTTC 
CAGTTGCGTT 
CTGCAAGTCG 
CACCAGCCTC 
GCGCGAACTG 
CCGGCTATTG 
CTGCGGTTTC 
TGACAAGGTG 
GTCCCTCATG 
TGCAGTAGAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
738 



121 INFORMATION FOR SEQ ID NO: 130 



70 



(1) SEQUENCE CHARACTERISTICS: 

(At LENGTH: 738 base paics 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(DJ TOPOLOGY: circular 



75 



(11) MOLECULE TYPE: DNA (genomic) 
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tiil) HYPOTHETICAL: HO 

(lv> AI^TI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

lA) ORGANISH: PORYPHYROf^OHAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: mlsc feature 

(B) LOCATION 1. . .730 



(Xi) SEC?L»E^^CE DESCRIPTION: SEQ ID 1X0:130 

ATGGATATTG TAAGTATGGC CGATAAAGCT CTTGTAGTGG AGATGAGAGA TGTGACGrrr 
rS^^SSS^ AAAACGTCAT TTTTCAAAAT TTGAATCTGA CcS???^^ ^^^C 
???^^G AAGAGCACTT TGCTGAAG^ ^A^^ 

UAWiT&cCTA TCTCTGCCGG TTATGCCCGC GTGATAGATT ATGATiTGGC AAAm-n-&aa 
TGCCCTATCT GCGCAGGAAT TTGGGCMTG TGTTtS^ 5?^^^^ 
20 fl^f^ CTACTGTTGC GGAGAATTTG GATTTCGTTT TgSaSc^AC GGaS2^ 
J^^SS^ ATCGCGAGCA GOGTATCSAG GAGGTTTTGA CCOSTQTGGi AATCT??2SS 
AGASACCGCA CGAACTGTCC GGAGGGGAGC AACAACGtS? GGGTAW^ 
J^AGCTTTGC TGGCGAAGCC TGCGTTGATC CTGGCCGACG AACCCACAGG CAACCTcS^ 
TCGGTGACCG GATTGCRGAT CGCTTCTCTG CTCTACGAAA TCAGTAAgS MgS^I 
25 S^^^r^ GCACGOlpAA CAGCAGCCTG CTGTCGCATC TGCcS^ ^^^^^^SS 



(2> INFORMATION FOR SEQ ID NO: 131 

(1) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 723 base pairs 

(B) TYPE: nucleic acid 

(C) STRAJ-IDEDNESS : double 
<D) TOPOLOGY: circular 

(il> MOLECULE TYPE: DMA (genomic) 

ilii> HYPOTHETICAL: NO 

40 

(Iv) AhiTI-SENSE: NO 

(vl ) ORIGINAL SOURCE: 
45 ORGANISM: PORYPHYRQMOMAS GINGIVALIS 

( ix ) £-EATURE : 

(A) NAIJE/KEY: mlsc feature 

(B) LOCATION 1. . .723 

50 



(XI) SEQUENCE DESCRIPTION: SEQ ID NO: 131 

SSSilS JA^^^'^GT AGTGGAGATG AGAGATGTGA CGCTCTGTCA GGAGGAAAAC 
SJ^UIS ^TTTGAA TCTGACCCTT TCCGCCGOAG ACTTCGTCTA 

55 ^^^rl nSSS?^^ cAcrrrGCTG aaggctttgt atgct^aggt ^a?^ 

?^T^S^ §^^GGG ^r^J^^^' CTGGCAAAGT .TGAAACGGAA 

GGAATTTGGG CATTGTGTTT CAGGATTTCC AGTTGCTria r^r:r-iii-v-^«^ 
JIEr^"^ CGTTTTGCGA Gc?^^^ SSJ^ 
f^^^^'^ TCGAGGRGGT TTTQACCCGT GTGSGAATGT CTCGGAaSg- TTaSm^ 
60 If^?^^ CffTGTGGGTA TaS^E ^^^^ 

wvr TGATCCTGGC CGACGAACCC ACAGGCAACC TCGATTrrfiT -ar-r-r:/-!!^-!./- 

SSISSI 51^°^^^ cgaaatca(^ aagS^ ^^tI^ ^t^^Ig 
SSSI'^'^''^'' gcatctgccg gcacggacat tggccgttcg taaSa^^g^ 
gatgcctcct ctttggtcga gctgagtgca gatgctgttt cpS^^SJ^ l^^^^ 



70 



(2) INFORflATIOC^ FOR SEQ ID HO: 132 

(1) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 696 base pairf 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY': circular 

'^^ tii) MOLECULE TYPE: DHiA (genomic) 
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(lii) HYfCri'HETICAL: NO 

liv) AMTI-SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORCyVNISM; PORypHyR0MOI>rAS GINGIVALTS 

iix) FEATURE: 

(A» NAME/KEY: misef eature 
<B) LOC:ATIOt4 1 696 

(xi) SEQUENCE DESCRIPTlOWi SEQ ID NO: 132 

ATGAGAGATG TCWCGCTCTG TCAGGAGGAA AACGTCATTT TTCAAAATTT GAATCTGACC 60 

CTTTCCGCCG GAGACTTCGT CTATCTGATA GGCTCAGTGG GATCGGGGAA 6AGCACTTTG 120 

CTOAAGGCTT TGTATGCTGA GCTTGCCTATC TCTGCC6GTT ATGCCCGCGT GATAGATTAT IBO 

GATCrrGGCAA AGl'TGAAACG GAAGCAGTTG CCCTATCfGC GCAG6AATTT GGGCATTGTG 24 0 

TTTCAGGATT TCCAGTTGCT GAACGGACGT ACTGrTGCGG AGAATTTGGA TTTCGTTTTG 300 

CGAGCTACGG ACTGGAAAAA CCGAGCCGAT CGCGAGCAGC GTATCGAGGA GGTTTTGACC 360 

CGTGTGGGAA TGTCTCGGAA GGCTTATAAG AGACCGCACS AACTGTCCGG AGGGGAGCAA 420 

CAACGTGTGG GTATAGCCAG AGCTTrtJCTG GCGAAGCCTG CGTTGATCCT GCCCGACGAA 480 

CCCACAGGCA ACCTCGATTC GGTGACCGGA TTGCAGATCG CrTCTCTGCT CTACGAAATC S40 

AGTAAGCAGG GCACTGCAGT ACTTATGAGC ACGCACAACA GCAGCCTGCT GTCGCATCTG 600 

CCGGCACGGA CATTGGCCGT TCGTAAGAAT GGCGATGCCT CCTCTTTCGT CGAGCTGAGT 660 

GCAGATGCTG TTTCAAGAAA AAATACGGAA ATAGAT 696 



(21 INFORMATION FOR SEQ ID HO: 133 

(i) SB<:^Bl>SCE CHARACTERISTICS: 

<AJ LENGTH: 657 base pairs 
IB) TYPE: nuel«ic acid 
(C) STRANDEDNESS: double 
(D> TOPOLOGY: circular 

till HOLECULF. TYPE: DNA (genomic I 

tiil) HYPOl'HETICAL: WO 

iiv) Aim -SENSE: NO 

(vl ] ORIGINAL SOURCE: 

(A) ORGAJiISM: PORY PH YROHOi JA5 GINGTVALIS 

Ux) FEATURE: 

(A) NA/'IE/KEY: mlac_feature 
{ B ) LOCATI ON L * . . 657 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 133 

ATGATTGAAA TCAGCAACCT CACCAAGGTT TTCAGAACAG AAGAAATAGA GACGGTAGCC 60 

CTCGATGGCG TATCGCTCAA AGTGGACAAA GGCGAATTTA TCGCCATAAT GGGGCCTTCG 120 

GGATGCGGTA AGTCCACTCT GCTCAATATC CTCGGCCTTC TCGACAATCC CACTTCCGGT 180 

ATCTACAAGC TCGATGGGGC AGAAGTGGC5C AACCTCCGGG AAAAAGACAG GACTGCC6TC 24 0 

CGTAAGGGCA ATATCGGCTT CGTATTCCAfi AGCTTCAACC TCATCGAAGA GATGACGGTA 300 

AGCGAGAACG TGGAGTTGCC GCTCGTCTAT CTGGGTGTGA AGGCCTCCGA GCGGAAAGAG 360 

CGAGTGGAGG AGGCACTGCG CAAGATGAGC ATCAGCCACC GGGCCGGCCA CTTCCCCAAT 420 

CAGCTCTCCG GAGGACAACA GCAGCGCGT6 GCTATC6CCC GTGCCGTQGT GGCCAATCCG 490 

AAGCTCATCC TCGCCGATGA ACCCACGGGT AACCTCGACT CCAAAAACGG ACCCGATGTC 54 0 

ATGGAACTGC TCAGAGGTCT CAATCGCGAA GGTGCAACCA TCGTCATGGT GACGCACTCC 600 

GAGCACGATG CACGTAGTGC CGGCCGCATC ATC7\ATCTGT TCGACGGTAA GATTCGC 657 



(2 J IMFORHATION FOR SEQ ID NO: 134 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1785 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDMESS : double 
(D) TOPOLOGY: circular 

(iij HOLECULE TYPE: DNA (genomic) 

(tli> HYPOTHETICAL: NO 
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liv) WJTI -sense: no 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GTIJGIVALIS 

Uxj FEATURE: 

(A) NAt-IE/KEY: mxsc_feature 

(B) LOCATIOTJ 1...1785 

(Xl) SEQUENCK DESCRI priori : SEQ ID MO: 134 



15 



20 



25 



30 



35 



40 



ATGAAAGAAT 
ATCTTGTTCT 
GCAACGGGAG 
TCTTTCXrCTG 
GTATCCXTTCT 
GGTATCTTCC 
CGCGCGTTGC 
ACCrCAAAAGG 
ATGTTGGGGC 
TTCGGCGTGA 
ATGCTCAACA 
TGGGACAAGA 
ATGTTTGCC3 
GTGGATGAGC 
GGAGAGAAAG 
AACAAAACGA 
ATAATAAAGA 
ATCAAGGCAG 
GGAGGTAGTG 
AAGCCTA'l'CG 
GCAGCCAACA 
ATGTTCCCGA 
ACATCCAAGT 
CTCATCCAAC 
CGCAACCGCA 
PAAGCTCTTG 
GCCGCGAAGC 
AACn'CTTCG 
ACCATTCTCT 
CCTTCGGGCA 



TTTTCAAAAT 
GTATCTTTCT 
GAACCATTCC 
AGATCGTATC 
CACAGGCAGT 
TCGATCTGGA 
AGGATTTCAA 
GTTACTACCT 
TTATCGGGAT 
AGATGGAGAT 
GGATGAGCGA 
TCACATCCGA 
ACAAAGGCGA 
TGGCTTACCG 
ATGAACTTCG 
AAGGCAGTCG 
AGCCGTTCGA 
CAGCCGATGA 
CTTTCACTTC 
TGGTCTCCAT 
OTATCGTGGC 
ACTTGGCGGG 
ATGCAGACTT^ 
GCTACATAGA 
CCAAGGCACA 
CACTCGGTTT 
TGGCTCAGCT 
AAGAGTTGCT 
CCGATCCGGA 
TACAGGCACG 



GTTTTTCGCC 
ATTTATCTTT 
GAAGATCGAA 
GGCCAATCCC 
CGAAGCCATC 
caiACCTTTCC 
CATGTCGGGC 
CTCCAGTATT 
TGCGACCCAA 
CTTCAAGGTA 
TGCCAATCGC 
TATTGCAGAG 
AATGTTCGGT 
TACCGATGTG 
GTTCGTATCG 
GATCGCCGTT 
CACTGACGGT 
CGATGATATC 
CGAACAGATA 
GGGCGACGTA 
AGAGCATACG 
CGTAGCCAAG 
GGGCAACACC 
GCAGGGCTAC 
GATCGACAGC 
GGTGGATGAG 
CGGTGGCAAC 
CTCCTCATCA 
AATAGAAGTT 
TCTCCCCTAT 



TCGATCCTCG 
TTCGGCATCG 
GCAAACTCCA 
TGGAGCATGC 
GGCCAACCCA 
GTCGGTATGG 
AAGTTCGTCG 
GCAGACAAAC 
ACAATGTTCr 
GGCACCTACA 
GAACAAATCA 
TCGCGCAAGA 
CTTGCCCA'JA 
GAGAAAGAAC 

crrrcTCAGG 

CTCTTTGCCG 
AGCTCCATCA 
AAAGCCGTAG 
TGGAAGCAGG 
GCAGCCTCGG 
ACTCTGACCG 
AAGATAGGAG 
TTCGCTCCGA 
GACCTCTTCC 
ATCGCTCAAG 
CTTGGAGGTT 
TACAGCATAG 
GCAGCGGATA 
CTGCGCGAAC 
TACTTCATGC 



GGGTTATAAC 
TAGCCGGTAT 
TCCTACATAT 
TCACAGGCAA 
AAAATAATCC 
CATCGGCAGA 
TATCCTATCC 
TCTACCTCAA 
ACAAAGATGC 
AGGCAGCCGT 
CCACATACAT 
CGGCAATGGA 
AAGCGGTGGA 
TCAAAAAGAT 
TTCTGGCCAA 
AAGGTGAAAT 
CACAAGAACT 
TACTTCX3TGT 
TAGCCGATCT 
GCGGATACTA 
GCTCCATCGG 
TGAATATG6A 
TGACGGTCGA 
TCACTCGCGT 
GCCGTGTATG 
TGGACACAGC 
AGTATGGCAA 
TGAAGTCTGC 
TCCGCTCCAT 
CGTAC 



GGCAGGAATC 
TGCCTCCAAG 
AWACAATTCT 
AGACGAGTCC 
C7VACATAACC 
GGAATTGCGT 
CGACAGATAC 
TCCGAAAGGA 
CCTCGACAAA 
AGAGCCATTC 
AAACGGGCTT 
TTCCGTGAAA 
GATGAAGCTC 
6TCCCAACGC 
TGGCCCGATG 
AACGGAAGAA 
CGCCAAAGAA 
CAATTCrCCG 
CAAGGCCAAA 
CATAGCCTGC 
CATATTCGGC 
CGTCGTACAG 
AGATCGTGCC 
ATCGGAAGGC 
GCTCGGCQAC 
TATCAAACGG 
GACCAAGCGC 
CATCCTGAGT 
GCCGCCCCGT 



60 
12C 
180 
240 
300 
360 
420 
480 
540 
600 
660 
"7 20 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1785 



45 



50 



55 



60 



65 



70 



75 



{2> IHFORMATIOH FOR SEQ ID MO: 135 

ti) SEQUENCE CIIARACTEHISTICS: 

(A) LEHGTH: 1767 base pairs 
tB) TYPE: nucleic acid 

(C) STRAMDEDHESS : double 

(D) TOPOLOGY: circular 

til) MOLECULE TYPE: DNA ( genomic > 

Uii) HYPOTHETICAL: NO 

(iv) AtlTI -SENSE: NO 

(vl) ORIGINAL SOURCE; 

lA) ORGAT^ISM: PORYPHYROt40t-fAS GINGIVAMS 

(ix) f-KATURE: 

(A) NAME/KEY: misc^Ieature 

(B) LOCATION 1. - .1767 

(Xi) SECJUENCE DESCRIPTTOr^ : SEQ ID NO: 136 

ATGTTTTTCG CCTCCATCCT CGGGGTTATA ACGGCAGGAA TCATCTTGTT 
CTATTTATCT TTTTCGGCAT CGTAGCCGGT ATTGCCTCCA AGGCAACGGG 
CCGAAGATCG AAGCAAACTC CATCCTACAT ATAIIACAATT CTTCTTTCCC 
TCGGCCAATC CCTGGAGCAT GCTCACAGGC AAAGACGACT CCGTATCQCT 
GTCGAAGCCA TCGGCCAAGC CAAAAATAAT CCCAACATAA CCGGTATCTT 
GACAACCTTT CCGTCGGTAT GGCATCGGCA GAGGAATTGC GTCGCGCGTT 
AAGATGTCGG GCAACTTCGT CGTATCCTAT GCCGACAGAT ACACCCAAAA 
CTCTCCAGTA TTGCAGACAA ACTCTACCTC AATCCGAAAG GAATGTTGCG 



CTGTATCriT 
AGGAACGATT 
TGAGATCGTA 
CTCACAGGCA 
CCTCGATCTG 
GCAGGATTTC 
GGGTTACTAC 
GCTTATCG'3G 



6C 
120 
180 
24 0 
300 
360 
420 
480 



SUBSTrrUTE SHEET (Rule 26) (RO/AU) 



iT'-h -3"=^ iP-ii ^Zr*' "^£1' 

o "si^ oi o> o 
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10 



15 



ATTGCGACCC 
ATCTTCAAGG 
GATGCCAATC 
GATATTGCA'j 
GAAATGITCG 
CGTACCGATG 
CGGTTCGTAT 
CGGATCGCCG 
GACACTGACG 
GACOATGATA 
TCCGAACAGA 
ATGGGCGACG 
GCA<iAGCATA 
GGCGTAGCCA 
TTGGGCAACA 
GAGCAGGGCT 
CAGATCGACA 
TTGGTGGATG 
CTCGGTGGCA 
CTCTCCTCAT 
GAAATAGAAG 
CGT CTCC CCT 



AA-OiCAATCTT 
TAGGCACCTA 
GCGAACAAAT 
AGTCGCGCAA 
GTCTTGCCGA 
TGGAGAAAGA 
CCCTTTCTCA 
TTCTCTTTGC 
GTAGCTCCAT 
TCAAAGCCGT 
TATGGAAGCA 
TAGCAGCCTC 
CGACTCTGAC 
AGAAGATAGG 
CCTTCGCTCC 
ACGACCTCTT 
GCATCGCTCA 
AGCTTGGAGG 
ACTACAGCAT 
CAGCAGCGGA 
TTCTGCGCGA 
ATTACTTCAT 



CTACAAAGAT 
CAAG'SCAGCC 
CACCACATAC 
GACGGCAATG 
GAAAGCGGTG 
ACTCAAAAAG 
GGTTCTGGCC 
CGAAGG"I'GAA 
CACACAAGAA 
AGTACTTCGT 
GGTAGCCGAT 
GGGCGGATAC 
CCGCTCCATC 
AGTGAATATG 
GATGACGGTC 
CCTCACTCGC 
AGGCCGTCTA 
TTTGGACACA 
AGAGTATGGC 
TATGAAGTCT 
ACTCCGCTCC 
GCCGTAC 



GCCCTCGACA 
GTAGAGCCAT 
ATAAACGGGC 
GATTCCGTGA 
GA6ATGAAGC 
ATCTCCCAAC 
AATGGCCCGA 
ATAACGGAAG 
CTCGCCAAAG 
GTCAATTCTC 
CTCAAGGCCA 
TACATAGCCT 
GGCATATTCG 
GACGTCGTAC 
GAAGATCGTG 
GTATCGGAAG 
TGGCTCGGCG 
GCTATCAAAC 
AAGACCAAGC 
GCCATCCTGA 
ATGCCGCCCC 



AATTCGGCGT 
TCATGCTCAA 
TTTGGGACAA 
AAATGTTTGC 
TCGTGGATGA 
GCGGAGAGAA 
TGAACAAAAC 
AAATAATAAA 
AAATCAAGGC 
CGGGAGGTAG 
AOAAGCCTAT 
GCGCAGCCAA 
GCATGTTCCC 
AGACATCCAA 
CCCTCATCTA 
GCCGCAACXrC 
ACAAAGCTCT 
GGGCCGCGAA 
GCAACTTCTT 
CTACCATTCT 
GTCCTTCGGG 



GAAGATGGAC 
CAGGATGAGC 
GATCACATCC 
C6ACAAAGGC 
GCTGGCTTAC 
AGATGAACTT 
GAAAGGCAGT 
GAAGCCGTTC 
AGCAGCCGAT 
TGCTTTCACT 
CGTGGTCTCC 
CAGTATCGTG 
GAACrrCGCG 
GTATGCAGAC 
ACGCTACATA 
CACCAAGGCA 
TGCACTCCGT 
GCTGGCTCAG 
CGAAGAGTTG 
CTCCGATCCG 
CATACAGGCA 



S40 
600 
660 
720 
780 
840 
900 
960 
i020 
lOftO 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1S60 
1620 
1680 
1740 
1767 



25 12) INFORllATION TOR SEQ ID MO: 136 

tli SEQUEUCE CHARACTERISTICS 

(A) LEU^JGYH: 939 base pa ire 

(B] TYPEi nucleic acid 
30 (C) STRAWDBCXiESS: double 

(D) TOPOLOGY: circular 

Ui) liOLECULE TYPE: DMA (genomic 1 
35 (tii) HYPOTHETICAL: NO 

(iv) AffTI-SEUSE: NO 



40 



(vi) 



ORIGXHAL SOURCE: 
(A) ORGANiaiJ: PORY PHYROHONTVS GTMGIVALIS 



45 



Ux) FEATURE: 

(AJ NAHE/KEY: misc_feature 
(Dl LOCATION 1. . . 939 

txl) SEQUENCE DESCRIPTIOri: SEQ ID MO: 136 



50 



55 



60 



65 



ATGAGAGCAA 
ATCGGACAGG 
GAAGCTATCC 
ITGGTGGAGA 
ATCAAATGCC 
GAATGCTTCC 
CTCCAACCAC 
ACGGGGTTGC 
ATGGTGGAGC 
AAGTCTTACC 
CTCAAGCACG 
ACCTTCGATT 
GATGCTCCCG 
GCATGCTCCA 
ATCTCCCAAC 
GAATATATAC 



ACATTTGGCA 
CTCAGAGTCG 
GACAGATGAG 
CAGGAGCCGG 
ACAAATCGTG 
GCAGCTATTC 
GCTATCGTCC 
AACGCTGTGG 
TGTATGAGCT 
CCGGGTCCAA 
AAGCTTACTT 
CTTTGGCCGA 
TGGATTTCCC 
TCTCCAAACA 
GCTATGGCAT 
CCCAAGAGGG 



GATACTTTCC 
AAACCGTACA 
CCGCTACAAT 
AGCCAGTACA 
GACCGGCAAG 
GGCCGCTCGC 
CCTGTTCAAA 
CTATGCCACC 
ATATGCTTTG 
CAAAAAATCC 
CAGCTACGGA 
AGAGTTCGAC 
GATCGAAAAG 
CACACAGCAC 
CCGGATGAAG 
CGATATACTG 



GTTTCGGTTC 
TACGAGGCTT 
ATACCGGCAA 
CTGGCCAGCG 
CGCACCTATC 
GAATCGTATG 
CTCGACAGAG 
AATCGGGGCT 
GATCGCGAGA 
CATCAAACGA 
CTGCTCTACA 
ATGAGAGCCT 
GGCGAn'GTGA 
GTAGTGCGTG 
AACCTCTACA 
CGCTTGCiSC 



TCTTTTTCTT 
ATGTGAAACA 
CCATCACCAT 
TACACAACAA 
GTACCGACGA 
AAGATCATTC 
AAGACTATCG 
ATGCCAATCT 
AGTACCCCTC 
CCAAGCAGAA 
TCATAGCCAA 
CCAAACTGGC 
TCTATCTGGA 
TGGGCGATTC 
AGCTCAACGA 



CGGGACAGCG 
GTACGCCGAC 
AGCACAGGCr 
TCACTTCGGG 
TGCGCCGAAC 
CCGATTTCTG 
GGGCTGGGCT 
GCTGATCAAG 
ATGGTTCCAC 
GCAGAOCGGA 
GCAAGGCGAT 
CAAATACAAC 
GAAAAAGCAC 
GATGCACAGT 
CAAGGATGGC 



60 
120 
IBO 
240 
300 
360 
420 
4 00 
54 0 
600 
660 
720 
780 
840 
900 
93 9 



12) IHFORItATTOIl FOR SEQ ID 110:137 



70 



(1) SEQUEMCE CHARACTERISTICS: 

(AJ LENGTH: 156? base pairs 

(B) rrPE: nucleic acid 

(C) STRAW DEDNESS: double 

(D) TOPOLOGY: circular 



75 



(li) HOLECULE TYPE: DNA (genomic) 



SUBSTITUTE SHEET (RiiJe 26) (RCVAU) 



U™U "^ifi a . 



ir-^: ^=11, li^^, risq, 

.111 u Oi tir: iUii 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(ill) HYPOTHETICAL: NO 
(iv) AHTI- SENSE; NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: POR'/PliyRa^ONAS GINGIVALIS 
t ix ) FEATURE : 

(A> NAME/KEY: misc feature 

ID) hOCJKTim 1. . .1569 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 137 



ATGGACGGAC 
AAAGTAGAGG 
TATCATAAGC 
ATCTACAAAC 
AATGATCGTA 
CGTCTTGTCG 
TTGTTGAGCC 
CATCAGAAGG 
ACGAACATGG 
GGTTTGGCTA 
GGTCGTTCCG 
CATCTGATGC 
AACGAAGGAG 
AAGGTGGAAG 
AATTCGCAGC 
GGTATGGATG 
GAAGCCAATG 
CCTATCACGC 
GATGAAGCTT 
GTGGTTCATC 
ACGGATGGTA 
GGGAAAAGCA 
TGGAAAGAAC 
GAAAACAAAG 
GTAGAAGCCA 
GAGGCTCCTT 
CAACAACGT 



GTCGATATTC 
CTGCGACACA 
TGGCAGGGAT 
TGGACGTTGT 
TCTATAAGAC 
AAGAGGGCAG 
GTATGTTACG 
AGGCCGAGAT 
CCGGTCGTGG 
TCATTGGTAC 
GCCGTCAGGG 
GCCTCTTTGC 
AAGTGCTCGA 
AGAACAACTT 
GTGAAGTCAT 
TACTCAATAC 
ATTTCGAAGG 
AAGAAATATT 
ACAAGTCTTT 
AGGTATTCGA 
AACGTGTCTA 
TCATCAAAGA 
ATCTGCGTGA 
ATCCACTACT 
TGAACCGTAA 
CCCAAGAAGA 



GGATCGCCTC 
GACATTTGCA 
GACCGGTACT 
AGTTATTCCG 
GGCACGTGAA 
ACCTGTACTT 
CfTGCGTGGC 
TGTAGCTCAG 
TACCGACATC 
GGAAAGGCAC 
TGATCCCGGT 
CACAGAAAAG 
AAACAACATG 
CGGTATCCGT 
TTATACCCGT 
CATATACGAC 
CTTCAAGGAft 
CAGAGGTAAG 
CCAACGTAAG 
GACCCAAGCC 
TAACATAGGA 
ATTTGAGAAA 
GATGGACGAG 
TATCTATAAA 
GACCGTAGCG 
6CTGGAACAC 



CATCAGGCTA 
ACTATCACTT 
GCTGAAACTG 
ACAAACAAGC 
AAATATGCAG 
GTCGGTACTA 
ATCCAACACA 
GCCGGTCAGA 
AAGCTCTCTG 
GAATCCAGAC 
TCGTCCATAT 
ATTGCATCAT 
CTGAGTAAGT 
AAACATCTGC 
CGCCGTCATG 
GTATGT7VAGG 
GATCTGATGC 
AAAGCAGAAG 
ATGGATCTGA 
GCCGTGTACG 
TGCAATTTGC 
GCTATCGTAC 
CTTCGTAATT 
CTCGAATCTT 
ATCCTAATGC 
AGGCGGCAAA 



TCGAAGCCAA 
TGCAGAACTA 
AAGCGGGAGA 
CTATCGCCCG 
CAGTTATCGA 
CTTCGGTGGA 
ATGTACTCAA 
AAGGAACTGT 
CCGAGGTTAA 
GAGTGGACAG 
TCTATGTTTC 
TGATGGATCG 
CCGTGGAGCG 
TTGAGTACGA 
CTTTGATGGG 
CTCTGATTGA 
GTGCACTCGC 
AGCTGACCGA 
rCGCAGAAGT 
AGCGCATTCT 
GTGAAGCGGA 
TGCATACTAT 
CCGTTCAGAA 
ACGAACTGTT 
GTGCTCGCAT 
TAGAAATCCG 



AGAGCATGTG 
TTTCCGCATG 
GCTTTGGGAC 
TAAGGATATG 
AGA3ATTGTA 
AATATCCGAA 
TGCCAAATTG 
TACCATCGCA 
GAAAGCCGGG 
ACAGCTTCGT 
CCTTGAAGAT 
TTTAGGTTTC 
TGCTCAAAAG 
TGATGTAATG 
AGAGCGTATC 
CAATTATGCA 
GATAGAATCT 
TATGCTTTTC 
GGCCCACCCT 
AATCCCCATT 
1-GAAACTCAA 
CGATGAGTCT 
TGCCAGCTAC 
CCGCAAGATG 
ACCGGTACCG 
ACATGCAACC 



(2) INFORMATION FOR SEQ ID NO:138 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1125 base pairs 

(B) TYPE: nucleic acid 

(C) STRA/^DEDHESS: double 

(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DHA (genomic) 
liii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 
(ix) FEATURE: 

lA) NAME/KEY: raise feature 

(B) LOCATION 1...1125 

txi) SEQUHMCE DESCRIPTION: SEQ ID NO: 138 

ATGAATTTCT TAAAAAAAGA ACCGTTTAAA ATATTCTCTA TGATTTATCT 
ACAATAACAA ACC6TGCCGG TACAGAACGC GCCGTGA^S I^^I^fl 
GCCAATGGTC ATCGCGTATC ATTAGTCAGC GTTTGTAC^ A^S^2^ 
CAAGTAGAAA AAGGAATAGA AGTACACCAT cicG^^^J^ 
GCCCGCAAAA CAGTATATTT CAAGGCTTAT CGAAGGATAA AA^ctI?? 
GAACCGGTTT TATTGATAGG GACl'AATATT TTTA^C^?^ 2^^SctC 
AACAGAGGCA GAATATTTAC GATCGGATGC GAAcAtA^CT OTA^SS^ 
ATTACAAAAC GCATAAGGGG GTTTCTGTAT TCAGGGCTTG SgCCGT^G- 

aaaagagatc agcaatcgtt cgaggcaatc itacgtSac ^^SIS" 

CCCAATCAAG TTTCATTTAC TACAGTCCAA AGAG^T^A ^SS^i 



gctgttagat 
caacctgcat 

GCCTTCCTTC 
CAATGCATTA 
CAAGAAGCGT 
TCAGATCAGT 
TGCCCGCCCT 
AGCACTGACA 
ATATGTCATA 
ACAAATGTTG 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1569 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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GCGATTGGCA 
GTGCTGCGAr; 
ATGCTACGTA 
ACACCGGAAA 
GAAGGACTAC 
GATTGTCCGA 
ATGGAAGCAC 
CGTAAGAAAA 
TATGAATGTT 



GGCTTACCTA 
AAAGGCCTGA 
AAGAAATTGC 
TTCGCAAATA 
CIAATGGTACT 
CCGGCCCGAG 
ATGAAGACTT 
TGGGACAAGA 
GGAAGAAACT 



CCAGAAGGGT 
TTGGAAGCTT 
ATCTCGCAAT 
CTACGAATCA 
TCTCGAAGCA 
GGAACTGATC 
CGCGGATAAG 
ATCAGAGTTG 
ATTCGTCGAA 



TTTGAATTCA 
ATCATAGTOS 
ATGGAGTCGC 
TCTGCTATTT 
GAAGCATATG 
GAAAACGGTC 
TTACGCTTAT 
ATGCSTCAAAT 
ATCtSGCTACA 



TGATAGAAGA 
GAGATGGCGA 
AAATAGAAAT 
ATCTAATGAC 
CACTACCTAT 
GCAATGGTTT 
TGATGGATGA 
CCTACTCTCC 
TGAAT 



TGCATCACGA 
AAATGAATCG 
ACATCCATCT 
GTCCCGTTTC 
AATCTCATAC 
CCTTGTGCCA 
TGAAACTCIT 
GGCAAATATC 



660 
720 
780 
840 
900 
960 
1020 
1000 
1125 



(2) IHFORHATIOH rOR SEQ ID NO:139 

(t) SEQUENCE CHARACTERISTICS: 

<A1 LEHGTH: lOeS bas« pairs 
*B) TYPE: nucleic acid 

(C) STRAilDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) HOLECliLE TYPE: DHA (genomic) 

<iii| HYPOTHETICAL: NO 

(iv) AllTI-SEHSE: HO 

tvi) ORlGUmL SOURCE: 

lA) ORGANISM: PORYPHYROriOt-lAS GItJGTVALIS 

{Ix) FEATURE: 

(AJ WAIHE/KEY: misc_fs3ture 
<B) LOCATION i. . .1086 

Ixi) SEQUENCE DESCRIPTION: SEQ ID MO: 139 



ATGATTTATC 
AACTTGGCTA 
AAAGAAGGAG 
AGGCTTTATG 
AAAGCCCTAT 
ACAATTTTGT 
TCTTATGATA 
GATGCCGTTG 
CGCTCTAAAG 
ACTACTCACA 
ATGATAGAAG 
GGAGATGGCG 
CAAATAGAAA 
TATCTAATGA 
GCACTACCTA 
CGCAATGGTT 
TTGATGGATG 
TCCTACTCTC 
ATGAAT 



TGCTGTTAGA 
ACAACCTGCA 
AGCCTTCCTT 
GCAATGCATT 
ACAAGAAGCG 
CrCAGATCAG 
TTGCCCGCCC 
TAGCACTGAC 
CATATGTCAT 
AACAAATGTT 
ATGCATCACG 
AAAATGAATC 
TACATCCATC 
CGTCCCGTTT 
TAATCTCATA 
TCCTTGTGCC 
ATGAAACTCT 
CGGCAAATAT 



TACAATAACA 
TGCCAATGGT 
CCAAGTAGAA 
AGCCCGC7VAA 
TGAACCGGTT 
TAACAGAGGC 
TATTACAAAA 
AAAAAGAGAT 
ACCCAATCAA 
GGCGATTGGC 
AGTGCTGCGA 
GATGCTACGT 
TACACCGGAA 
CGAAGGACTA 
CGATTGl'CCG 
AATGGAAGCA 
TCGTAAGAAA 
CTATGAATGT 



AACCGTGCCG 
CATCGCGTAT 
AAAGGAATAG 
ACAGTATAl-r 
TTATTGATAG 
AGAATATTTA 
CGCATAAGGG 
CAGCAATCGT 
GTTTCATTTA 
AGGCTTACCT 
GAAAGGCCTG 
AAAGAAATTG 
ATTCGCAAAT 
CCAATGGTAC 
ACCGGCCCGA 
CATGAAGACT 
ATGGGACAAG 
TGGAAGAAAC 



GTACAGAACG 
CATTAGTCAG 
AAGTACACCA 
TCAAGGCTTA 
GGACTAATAT 
CGATCGGATG 
GGTTTCTGTA 
TCGAGGCAAT 
CTACAGTCCA 
ACCAGAAGGG 
ATTGGAAGCT 
CATCTCGCAA 
ACTACGAATC 
TTCTCGAAGC 
GGGAACTGAT 
TCGCGGATAA 
AATCAGAGTT 
TATTCGTCGA 



CGCCGTGATC 
CGTTTGTACA 
TCTCGGAATT 
TC6AAGGATA 
TTTTATCAAT 
CGAACATATC 
TTCAGGGCTT 
CTTACGTGGA 
AAGAGATGCT 
TTTTGAATTC 
TATCATAGTC 
TATGGAGTCG 
ATCTGCTATT 
AGAAGCATAT 
CGAAAACGGT 
GTTACGCTTA 
GATGGTCAAA 
AATCGGCTAC 



60 
120 
100 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1086 



(21 IMFORI-iATIOM FOR SEQ ID HO: 140 

li) SEQltEMCE CHARACTERISTICS; 

(A) LENGTH: 1920 base pairs 

(B) TYPE: nucleic acid 
(CI STRAHDEDNESSr double 
(D) TOPOLOGY: circular 

(ill ItOLECULE TYeE: DHA (genomic) 

tilil HYPOTHETICAL: NO 

(iv) Aim -SENSE: NO 

(vij ORIGI^^AL SOURCE: 

<A) ORGAKISIl: PORYPHYHWOHAS GIHGIV7U.IS 

(Ij:) FEATURE: 

(A) NAflE/KEV: miffc f eature 

(B) LOCATIOTJ 1...1920 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID HO: 140 



ATGGGAAAAA TCATTGGAAT TGACTTAGGC 
5 GGTAACGAAC CTATCGTTAT TACftAACAGT 
GCTTTT6TGG ATGGTGGCGA GCGTAAGC3TG 
AATCCGACCA AGACGATATA CTCTATCAAA 
TCCAGAGAAG TGGAGAGAGT GCCATTCAAG 
GTAGATATAG ACGGTCGTCT CTATACGCCG 

10 ATGAAGAAGA CGGCCGAAGA CTACCTCGGT 
CCCGCATACT TCAACGACGC TCAftCGTCAG 
CTGAAAGTTC GCCGTATTGT GAACGAGCCT 
AAGTCCAATA AGGATATGAA GATCGCTGTC 
TCTATCTTGG AATTGGGCGA CGGCGTTTTC 

15 CTOGGAGGAG ACGACTTCGA CCACGTGATC 
CAGGAW5GTG TGGATCTTCG CCAGGATCCT 
GAAAAAGCCA AGATAGAGCT CTCCAGCACT 
ATGCCGGTGA ACGGCATCCC CAAGCACTTG 
CAGTTGGCCG ATCGTCTGAT TCAGGCATGT 

20 GCCGGTATGT CACGTGGCGA TATCGATGAA 
CCTSCTATTC AGGAGATTGT GGAGAAGATC 
CCCGACGAAG TGGTAGCTGr GGGTSCC3CT 
AAGGATGTCT TGCTGTTQSA CGTTACCCCC 
GTGATGACTC GCTTGATCGA TGCCAATACC 

25 ACCACAGCAG TGGACAATCA ACCTTCGGTA 
TTGGCTAAGG ACAATAAGAG CATCGGCCGT 
CGTCAGACAC CGCAGATCGA AGTAACGTTT 
ACGGCTCATG ACAAAGCTAC CGGCAAGAAG 
TTGTCCGATG AT'JAGATCAA GCGCATGAAG 

30 AAGAAAGAGA AAGAACGTAT CGACAAGATC 
GAAAAGCAGT TGAAGGAGTT GGGAGACAAA 
ACCGCTCTCG ACAAACTGAA AGAAGCACAC 
GCCATGGCCG AACrGCAAAC CGCTCTTTCC 
GGAGCAGCCC AAGGTGGCGC ACAACCCGGT 

35 GCCGGTGATC AGCCCTCTGA CGACAA6AAC 



ACAACGAACT CTTGTGTCTC TGTATTGGAA 60 

GAGGGCAAGC GCACAACGCC CTCGGTAGTG 120 

GGCGATCCGC CCAAGCGTCA GGCCATCACC J 80 

CGCTTCATGG GCGAAACTTA CGATCAGGTT 2*10 

G-rAGTACtSTG G-SGACAATAA TACTCXX3CGC 300 

CAGGAAATT-r CGGCCATGAT CCTTCAGAAC 360 

CAGGAAGTAA CCiGAGGCCGT GATCACTGTG 420 

GCAACGAAAG AAGCAGGAGA GATCGCCGGC 4 80 

ACGGCAGCTT CTCTGGCCTA CGGTCTGGAC 54 0 

TTCGACTTGG GTGGCGGTAC CTTCGATATC 600 

GAAGTGAAAT CGACCAACGG TGATACGCAC 660 

ATTGACTGGC TGGCAGAAGA GTTCAAGTCT 720 

ATGGCTATGC AGCGTCTGAA AGAAGCTGCC 180 

TCATCTACGG AGATCAACCT CCCCTATATC 940 

GTGATGACGC TTACAAGGCC TAAGTTCGAG 900 

GTGGCACCCT GCGAAACGCC CTTGAAA6AT 360 

GTGATTCTCG TAGGTGGTTC CACACGTATT 1020 

TTCGGTAAGG CTCCGTCCAA GGGTGTGAAT 1080 

ATTCAAGGCG GTGTTCCGAC CGGTGAGGTA 114 0 

TTGTCGCTCG GTATCGAGAC TATGGGAGGC 1200 

ACTATCCCGA CGAAGAAGAG CGAAAtCTTT 1260 

GAGATTCATG TACTTCAGGG TGAGC5TTCT 1320 

TTCAACTTGC ACGGTA'ITGC TCCGGCGCCC ] 3fl0 

GACATCGATG CCAACGGTAT CCl'GAATGTA 14 4 0 

CAGAATATCC GCATCGAAGC CTCCAGCGGT Z500 

GAAGAGGCGC AGGCCAATGC CGAACCAGAT 1560 

AATCAGGCCG ACAGCATGAT CTTCCAGACG 1620 

TTCCCGGCCG ACAAGAAGGC TCCGATCGAT 1680 

AAAGCACAGG ATGTAGCTGC TATCGATACA 1"740 

GCAGCGGGCG AAGAGCTTTA CAAGAATGCC 1800 

CCGGACTTCG GCGGTGCTCA AGGTCCCTCT 1860 

GTCACAGACG TAGACTTCGA GGAAGTGAAG 1920 



{2» INFORMATION FOR SEQ ID NO: 141 

40 ti) SEQUENCE CHARACTERISTICS I 

(A) LENGTH: 1347 ba5« pairs 
IB) TYPE: nucleic acid 

(C) STRAtJDEDtlESS: double 

(D) TOPOLOGY: circular 

45 

Ui) MOLECULE TYPE: DNA (genomic) 

Ciii) HYPOTHETICAL: HO 

50 (iv) ANT JL -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISII: PORYPHYRa4aiA5 6IHGIVALIS 

55 (ix) FEATURE: 

(A) HAME/KEY: inisc_£eature 
(B> LOCATTOI4 1...1347 



60 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 141 



ATGCQCTACG ACTTAGCTAT CATCGGTGGA GGGCCGGCCG GTTATACGGC TGCCGAACGT 60 

GCTGCCAAAG GTGGCCTGAA AACCCTCCTA ATTGAGAAGA ATGCTCTCGG TGGT6TATGC 120 

CTCAACGAAG GATGTATACC GACCAAGACG CTACTCTACT CGGCCAAAGT GCTACATCAA 100 

ATTGCTACGG CATCTAAATA TGCAGTAAGT GGAACGGCCG ATGGACTTGA CCTCGGCAAG 24 0 

65 IfTGATTGCCA GAAAAGGTAA AATCATTCGC AAGCTGACTG CAGGCATCCG TTCACGOCTG 300 

ACAGAGGCCG GAGTAGAGAT CGTGACGGCA GAAGCTACCG TAACGGGATC CGATGCAGAC 360 

GGCATCATCG GCATTACTG-a GGGCGAAGCA CAGTACAAAG CTGCCAACCT GCTACTATGT 420 

ACCGGTTCGG AGACGTTTAT TCCACCCATC CCCGGAGTGG AGCAGACAGA GTATTGGACA 4 80 

AACCGTGAAG CTCTACAGAA CAAAGAGATT CCGACCTCTC TCGTCATCAT CGGTGGTGGA 54 0 

70 GTGATCGGAA TGGAGTTCGC TTCTrrcrTC AACGGTATCG GTACGCAAGT GCACGTGGTG 600 

GAGATGCTGC CGGAAATACT CAACGGTATC GATCCCQAAC ATGCAGCTAT GCTACGCSCT 660 

CACTATGAAA AAGAAGGAAT CAAATTCTAC CTCGGGCACA AAGTAACATC GGTTCGCAAC 720 

GGAGCTGTTA CGGTAGAATA CGAAGGAGAA AGCAAAGAGA TCGAAGGAGA ACGTATCCTG 780 

ATGAGTGTGG GACGTCGCCC CGTGCTGCAA GGATTCGAGT CGCTCGGATT GGTGCTTGCC 84 0 

75 GGCAAAGGTG TAAAGACTAA TGAGAGGATG CAAACTTCCC TGCCCAATGT CTATGCTGCA 900 



SUBSTTTUTE SHEET (Rale 26) (RCVAU) 



lu^ -^^ii .^ii o .Jt. 'in:; c3 m o hl.. lUi.iUJi 



wo 99/29870 
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GGTGATATTA 
GTAGATCAGA 
GTGTACACCA 
GGACGTGCCT 
AACGAACAAG 
GCACACCTCA 
ACCGGCATGA 
ATCCTAAAAG 



CAGGCrrCTC 
TTTTGGGCAA 
ATCCCGAGGT 
ACACTGTTCG 
GCAATGGAGA 
TT6GCAATCC 
CGGATCGACA 
AAACTCTCCC 



GCTTTTGGCA 
AACAGACGAA 
CGCCGGTGTG 

TCGCcrrccr 

GTGCAAACTA 
GGCCGGCGAA 
AATCGAACGA 
CGGAGGT 



CATACGGCTG 
ACGATGAC<rr 
GGAGAGACGG 
ATGGCCTTCT 
CTACTTGATG 
CTCATCGTAA 
ATCATATTCC 



TACGGGAAGC 
ACCGTGCCGT 
AAGAATCGCT 
CCGGTCGATT 
AAGAGAACCG 
CCGCTTGCCAT 
CTCATCCGAC 



AGAGGTAGCA 
ACCAGGTGTG 
TCGCAAAGCA 
TGTAGCAGAA 
CTTGATCGGA 
GGCCATCGAG 
TGTAGGCGAA 



960 
1020 
1080 

imo 

1200 
1260 
1320 
1241 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) IMFORMATIOrr fOR SEQ ID HO: 14 2 

li) SEQUESICE CHARACTERISTICS: 

tA) LENGl'H: 2823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(lil MOLECULE TYPE: DMA (genomic! 

Uiil HYPOTHETICAL: HO 

{iv} AIJTI-SEMSE: NO 

(vi) ORIGIUAL SOURCE: 

(Al ORGANISM: PORYeHYROHONAS GIMGIVALIS 

{ixj FEATURE: 

\A) tWE/KEY: misc_feature 
(B) LOCATION 1. , .2823 

(Jtl) SEQUENCE DESCRII>TIOt-r: SEQ ID NO: 142 



ATGGAATTGA 
CTTTCTGCCC 
GACAACGGAT 
TTTATCGCAC 

, TTCTTGGAAC 
TATCTCGAAA 

, AAGACGGAAT 
TTGCTTATCC 

' GAGCGCGGTG 
GAGGCt'ATAC 
CTGATGGACG 
TATCGTCCCG 
AACAAGATCA 
TATACGCCGG 
ACTACCACGC 
TCGATATTCG 
OJTCrGTCCG 
TCTAACTTCA 
GAGGGTGAAG 
TTCGGTATCA 
AATCAATACA 
TACTTCACCG 
TTTGCTCCTC 
GTGAAGAATG 
GAAGCAGACT 
GACGAAGTCT 
GAGAAGAAAG 
TACCTCAAGA 
GGTATCCTCT 
GGTGGCTTGG 
TCCGTATCTC 
GATATGGAAA 
G7VAGCGTTCA 
CCGATGGCTG 
AAACCCATGA 
AATGAGCGAT 
GCCAAGATGA 
GATAAGATGA 
GAGAAGGAAA 
TATACGCrCA 
ACGGCTACCG 
GAGCAATATC 



AAAGATTTTT 
AACAGGCTCA 
TGACTTATTT 
AAAAGGTAGG 
ACATGGCTTT 
CGATCGGTGT 
ATACGATAAT 
TGCATGATTG 
TGATCCAGGA 
TTGCCAAGGC 
TCGTGCTCAA 
ACCTGCAAGG 
7UVGAACTCTT 
TAGAGGACAA 
AGCTCTCCAT 
GACTTGTGGA 
AGATTACTCA 
TGTACATCAC 
CGGAGAAAGC 
CC7\AAGGCGA 
ACGAAAGAGA 
ATGGCGGCTA 
AGGTTCCTCT 
CTGTCGTTAC 
TCCTCGCTGC 
CCGACCAAAA 
ATCAGAAGTT 
AGACCGAT'rr 
CCGGAAAGCA 
GCAACTTCGA 
CCTCTTTGTC 
CrTTCTTCCA 
AGGCCACACA 
CGCTTATGGA. 
AAGCTGCTGA 
TCGCTGATGC 
AGCCATTGAT 
ATAAGGCTCA 
TGGATACTCC 
AGAACAGTCT 
TTCGCGAGAA 
CTCAGCCCAA 



ATCACTTGGT 
GCCACTCCCT 
CATCCGTCAC 
TTCTArrCTT 
CAACGGTACG 
ACGTTTCGCT 
GGATGTGCCG 
GAGTAACAAT 
AGACTGGCGT 
TATGCCGGGT 
CTTCAAGCAT 
TCTGGTGATC 
CAAGGACGTT 
CGATGAGCCT 
CAGCTTCAAG 
AGACTATATG 
CAAGCCTAAC 
CCAGACTAAG 
GATGAACGCA 
ATACGATCGT 
CAAGCGTAAG 
TATCCCGGGT 
GGAAGCATTC 
CCTCACCGGT 
TTTCAAAGCT 
ATTGATGGAG 
CGGTACCACG 
CAAATCAAAC 
TGCTCCCAAC 
TGCTATCCAG 
TCTGCTCAGT 
GTTGATCTAT 
GGAAAAGTTG 
CTCTATCCGT 
CGTGGAGAAA 
CGGCGACTTT 
CGAAACTTAT 
GCTACCGGCT 
TTCGACTACT 
CCTGCTGGAA 
GGAAGGCGGT 
GGCTCTGATG 



CTTCTGCTTG 
ACAGATCCGG 
AACGAGAACC 
GAAGAAGATA 
AAGAACTTCC 
CAGAACCTGA 
ACTACACGTC 
ATTACCCTCG 
GCTCGTCGCG 
AATAAATATG 
GATGAGCTGC 
GTGGGAGATA 
CCTGCTCCCG 
ATCGTAGCCA 
AGCGACCCCA 
AAACAGGTGA 
GCTCCTTTCC 
GACGCATTCA 
TTGGTGGCAG 
GCACGCACGA 
AACAATGCTT 
ATTCAGGTGG 
AATCAGGCTA 
CCTTCAAAGG 
GCTCGTCAGC 
AAAGCTCCTA 
GAACTTACCC 
GAAATCCTGA 
CAATCTGTGA 
CTGGATAAGG 
GAAGGTCTTT 
CTCCAAATGA 
TACAATAACT 
CATACCATGT 
GTAAA'ITACG 
ATGTTCTTCT 
CTTGCTTCAT 
GCCCGTTCGG 
ATATTCGATG 
GTCTTCTCAG 
GCATACAGTG 
CAGATCTATT 



TGGGATTCAT 
CTGTTCGTGT 
CGAAAGATCG 
GCCAGTCCGG 
CCGGTAAGAA 
ACGCTTCTAC 
AGGGAATCAT 
ACGGGCATGA 
ATGCCAACCT 
CAGAACGCAT 
GCAACTATTA 
TCGATGTGGA 
TGAATCCAGC 
TTGCTACCGA 
CTCCTCAAGA 
TCACTACAGC 
TCAGTGCAGG 
ATTTTGTTCC 
AGATAGAAAG 
ATGTGCTCAA 
ATGCCAATGA 
AATATCAGAC 
TTGCCCAAAT 
CTGAAGCCAA 
AGAAAGTAGA 
AGGCCGGAAA 
TTAGCAATGG 
TGAGTGCTCT 
TGAATTCGTT 
TGCTGACAGG 
CGGGCAAAAC 
CTGCTAACCG 
TGAAAAATCA 
AOGGCGATAA 
ATCAGGTAAT 
TTATCGGTAA 
TGCCCAACCT 
GAAAGATCGA 
TCGTGTCCGG 
CCGTAATGGA 
TGGCTGCATT 
TCCCCACGGA 



TCCGATGAAG 
CGGTAAGTTG 
TGCGGATTTC 
TTTGGCTCAC 
CTTGATCAAC 
CGGATTCGAC 
CGACTCCTGC 
GATCGACGAG 
TCGTATGTTC 
GCCCATCGGT 
TAAGAAATGG 
CTATGTGGAG 
AGAGCGTATC 
TGCl'GAGGCr 
AGTGCGAGGA 
CGTGAATGAG 
AGCI'TTCITC 
CACGGTTCGT 
CCTCCGTCAG 
GCGATACGAG 
ATACTCCACC 
GGTGAATGCT 
GATCGATCCG 
GATTCCGAGC 
AGCCAAGAAA 
GATCGTTTCC 
CATCAAAGTA 
CAGCCCGCCT 
CATGAACGTG 
TCGCTCTGCT 
GACl'GTAGAA 
CAAGGATCCC 
GGAAGCCAAC 
TCCGATGATG 
GGCTTTCTAC 
TCTGGATGAA 
CAAGCGTGGC 
TTGCAAGTTC 
AAATGTGGAA 
TCAGGTGTAC 
CGGCGGTCTC 
TCCTGCTCGT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
€60 
720 
780 
040 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1»60 
1920 
19€0 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 
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GCCGAGGAAA TGAATGCTAT CGTTTTTGCT 
AATGTGGAAT ACTTTAACAA GACTATCGAA 
CGTGAGAATC GTTTCTGGCT CGAAGCCATO 
ATCACAGACT ACGAATCCGT ACTGAACGGT 
GCAGACCTCT TGAAGCAGCA GAATCGGGTT 
CAA 



GAGTTGGAGA AGCTTGCCAA GGAGGGCCCC 
AACCTGAATA AGCAGCACAA AGAAAGTCTG 
AAGGCGTCTT TCTTCGAAGG AAATGACTTC 
CTTACTCCTG CTGAAITGCA AAAGTTTGCG 
GTTGTCATOA TGGCTCCTGT TGCAAAGGCT 



2500 
2640 
2700 
2T60 
2820 
2823 



10 



15 



(21 IMFORIIATIOH FOR SEQ ID HO: 143 

iij SEOUEMCE CHARACTERISTICS: 

(A) LENGTH; 2052 base pairs 
(BJ TYPE: nucleic acid 
(CI STRAI^DEDNESS: double 
(Dl TOPOLOGY: circular 



20 



(11) MOLECLrLE TYPE: DHA (genoioicl 
tiii) HYPOTHETICAL; NO 
(iv) AllTI- SENSE: NO 



25 



30 



35 



40 



45 



50 



55 



60 



65 



GIHGIVALXS 



(vi) ORIGINAL SOURCE; 

(Ai oRGA^^IS^'I: eoRYPHYRaionAa 

ax) FEATURE: 

(Al WWIE/KEY: misc^feature 
( B ) LOCATION 1 . . , 2052 



(Kl) SEQLiENCE DESCRIPTION: SEQ ID IIOzl-JB 



ATGAGTAAGA 
TTCCTGTACA 
ACGCAGAAGC 
CGCGTAACGG 
GGGATGACCG 
GAGTTTCTTG 
TTTTACTCGG 
GATGCTACGG 
<3ACAAGGCTG 
CTCAAAAAAG 
ATCATTTTCG 
AATCAGATCA 
GACTATAAGG 
CACCTCAATG 
AACAACTTGG 
GATGAAGTAC 
CCGGATATTC 
ATCTCGTCTC 
CGCCCCACAT 
ACGGATGAGA 
CACAAGTACA 
GGACAGGTAG 
GCATCCGACA 
AGCCTGCTGG 
AACAATCTGA 
ACTCTCGTGA 
GCTTTCGyiAT 
CGCCGTATGC 
GATTCGTACA 
GAGAAAGAAT 
GCGGAAGAGG 
GCCACGAAGG 
AACGATCAAC 
GCTCTGCT CG 
CAGCGTCTTC 



AAGGAACAAT 
GCGACCATGA 
TGAAAACGCT 
TCAGCGTGGA 
AAGAGGAGGT 
AAAAGTACAA 
CTTTCATGGT 
CGGTGAAATG 
ACCGTGGCAC 
AAAAGATAGA 
GCAAGAAGCA 
ACGACACACA 
AATTTTACCG 
TGGACTATCC 
ATCTGCAGCG 
AGCGTATCGT 
CCCTCAACGT 
ATATCACCAA 
TCGAGGAGAA 
AGTTCTATGA 
CGTTCGACGA 
TGTATCTCTA 
AAGGCTACAG 
AGCAAAAOTT 
TCCGCAAGGA 
AGCTGTTCGA 
CGCTCGGAGC 
GCGATATGGC 
ATCTGGTACT 
CGGTAGAGCC 
CCAAGCTGCT 
AA6CCAAGGA 
TGACCAAATA 
GAAGCCGATT 
TC 



CGGGGTAACG 
GATATTCCTG 
TACATCCGTC 
TGAAGTGGCA 
GGAGAAGTAC 
AGACGACAAG 
GTCCGAGCGA 
GAGCTGCGAC 
CGACATCGTG 
GGGGCTCCTC 
GGAATGGAAA 
TCCTGCCTGG 
TTCGCfCTAT 
GTTCAATCTG 
CAACAAGATT 
GCCGGACTTC 
ATCGCCfCTCC 
GAAGGTGGCA 
ATGGGATAGT 
6CGTGCAGCC 
ATACCGAACG 
TGCTACGGAC 
CGTGATGCTG 
GGAGAAGACA 
GGAAAGAGCC 
AGCACGCCTG 
CGAAGGTGAA 
ACAGCTGCAG 
TAATACCGAT 
TTCGCTCACA 
CGATGAGGAA 
GAACAACGCC 
TGCTCAGGAC 
GCTGACGGGA 



AGCGACAATA 
CGTGAGATCG 
GGCGAATTCA 
CGCACGATCA 
ATCAATCAGA 
GCCGCCA'ITA 
GTGGACCTGA 
GGATCGCCCG 
ATGCACATCG 
GGCAAATACT 
GACGGCAAGA 
ACCAAAAAGC 
CCCATGTCCG 
ACAGGTATCC 
CAGCTCTACT 
CTCACCCTCC 
TATCTGCAGA 
GACCGTCTGG 
CTGAAGCTCT 
AAATTCTTCC 
CTCGTCGAAG 
AAGCATGGAC 
TTGGATGGTC 
CACTTTGTCC 
GAAGTGAAAC 
CCACGGGACG 
GCCATCCTTA 
CCGCrAATGA 
CATCCGCTCA 
GAGCTTAGAG 
AAAGGGAAGA 
GTCGAACAGG 
AACGAGCTGA 
GAGGCTTTGG 



TATTCCCCGT 
TCTCCAATGC 
AAGGCGAGAC 
CGGTCAGCGA 
TTGCTTTCTC 
TCGGCCACTT 
TCACGCGCTC 
AATACACGCT 
ATGAGGAGAA 
GTAAGTTCCT 
TGCAAGATAC 
CTGCCGACCT 
AAGAGCCTCT 
TCTATTTCCC 
GCAATCAGGT 
TCCACGGGGT 
GCGATGCCAA 
AAGAAATTTT 
TCGTCGAATA 
TTTTCACCGA 
GTGTACAGAC 
AGTACAGCCA 
AGTTGGATCC 
GTCTCGATAG 
TGTCCGATAC 
AGAAGAAGCA 
TCACy\CAAGC 
GCTTCTACGG 
TCGACAGGGT 
CGAAAATCGC 
AACCGGAGGA 
CCAAAACCGA 
TAGGTCAGCT 
CCGAATTCAT 



CATCAAAAAA 
CGTGGATGCT 
GGGTGACCTC 
CCGCGGCGTA 
CACTGCGGAA 
CGGACTCGGA 
TTTCCGAGAA 
CGAACCl'GCG 
TAGCGAGTTC 
TACCGTGCCG 
GGACGAGGAC 
CAAGGACGAA 
CTTCTGGATC 
GAAGATCAAA 
TTACGTCACC 
CATCGATTCG 
TGTGAAGAAG 
CAAAAACGAC 
CGGTATGCTG 
TATGGACGGA 
GGATAAGGAC 
CGTGAAACGT 
GCATATCGTG 
CGATACGATC 
GGAGGGCCCC 
CTTCAATGTA 
CGAATTCATG 
CGAACTCCCC 
ACTCTCCGGT 
CGAGCTGAAA 
AATCCCTGTT 
AGGCAGTATC 
CATCGACTTG 
TCGTCGCAGC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
760 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2052 



70 



75 



(2) IDFORflATIOW EX)R SEQ ID NO: 1-1 4 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1392 base pales 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



tii) MOLECULE TYPE: DMA t genomic) 

(ill) HYPOTHEnCAL: NO 

(iv) Aim -SENSE; NO 

(vi) ORIGINAL SOURCE: 

tA) ORGAWISM: PORY PHYROMONAS GUIGIVALIS 

(ix) FEATURE: 

(A) NAHE/KEY: misc feature 
(BJ LOCATION 1...1392 

<xij SEQUENCE DESCRIPTIW: SEQ ID HO: 144 



ATGGAAAAAC 
CGTCTGGTGC 
TTGTCCATAA 
AATTTCTCCC 
TTCACTTCCT 
GGAATGAGGC 
CGTACCATAG 
CAGTCTGACA 
GCCCATATCA 
GCTCCCACCC 
GCCGAAGCTC 
GGTGGGATrC 
ACCAATGCTT 
CTGGACGAAA 
GAACAGACAT 
TCGGCTACGA 
TACTATCGAA 
ATACAGCTGC 
GCCGTTTGGA 
CGCGAACTGA 
GCCCGGGATQ 
GCTTTGACCG 
GTTAGTC6TG 
AAATACGGAC 



TGATCGATAT 
TCAAGCGAGC 
TGCGGAATCC 
TTTCGACCTC 
GCCCTGTCAT 
TTGGAGCTTT 
ATACGGCCTT 
GA'SATACAGC 
TAGGCCGGAG 
ATGCCTCTGT 
TGCACCGTGG 
CCGAAAGTTT 
TTTCCGACAG 
TACGCGAACT 
TCGAGCCGTT 
ATGCTTCCTT 
TCAACCTGAT 
TGGTGGAAGC 
GTGCGGAAGC 
AAAACGTAGT 
TGGCTGACTT 
ACATGGAGGA 
CTGCACGAGC 
TG 



TTTGGTCGTA 
GGGCTATAAT 
TGATGGCGGC 
CGGCAGGGAA 
ACTGATGACG 
CGACTTCATA 
GCATCTGGCT 
CCGTCAGCCG 
CGATGCCATC 
CCTGATCACG 
GAGCAAACGA 
GTTCGAAAGT 
GAAAGGACGG 
ACCGGTCGGC 
GGGCGAGAGC 
GGAGCGAATG 
ACATCTGCAT 
CTTCAGTGAA 
TATGCGACGT 
GGAGCGTACG 
CGGTTCGCAG 
AGCTGCTATC 
CTTGGGATTG 



GACGATGATG 
CCCGTTATAG 
TGTAAGCCGG 
GGATTGGAAC 
GCTTGGGCTT 
GGCAAGCCAT 
GCTCCCTCAG 
AAAGCTACAG 
TGTAAGATCA 
GGCGAGAGCG 
GCCTCAGCCC 
GAGCTGTTCG 
TTCGAGCTGG 
AACCAAGTAA 
GTCTCCCACC 
GTAGCCGAAG 
CTGCCTCCGC 
GCCTTTGCCC 
ATCTGTGCCA 
CTATTGCl'CT 
GTGACGGCAG 
CGAGAGACGC 
AGCCGGGGAG 



TGGCAGTCTG 
CCAACAGTCC 
CTGTGATTCT 
TACTGGAGAA 
CGATTCCACT 
GGGACAACGA 
CTGTGGCGAA 
TCCAAGAGAA 
AGG7U\CGGAT 
GTACGGGCAA 
CATTCGTCAA 
GACATAAGAA 
CTGATGGCGG 
AACTGCTGCG 
GAGTGGACAT 
GIACGTTTCAG 
TGCGTGAGCG 
AATCG7^ACGG 
TGCCCCTACC 
CGGGATCGAG 
CAGACCACTC 
TGACTAAATA 
CTCTTTACCG 



TGCCGCACTG 
CGACGAAGCT 
GATGGATATG 
GATGCAGATA 
GGCACTGGAG 
TCGGCTCCTT 
TCCATCGGAA 
TGACCCCTGT 
ACGCCGCATA 
AGAGTTGATA 
GGTCAATTTG 
AGGAGCTTTT 
CACGATCTTT 
AGTGCTACAG 
CCGTGTGGTA 
AGAGGACCTC 
TCAGGAGGAT 
ATTGCCCCAT 
GGGCAATGTA 
AGAAATCAGT 
CGACGAACGG 
CAACGGCAAC 
GCGAATGGAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
90O 
960 
1020 

loao 

1140 
1200 
1260 
1320 
1380 
1392 



45 



50 



55 



60 



65 



70 



75 



12) lUFORMATION FOR SEQ ID NO: 14 5 

(1) SEC3UENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 
(81 TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

tli) MOLECULE TYPE: PNA {genomic) 

(liil HYPOTHETICAL: NO 

(Iv) Aim -SENSE r NO 

(vij ORIGINAL SOURCE: 

(A) ORGA>aSM: PORYPHYROHONAS GIHGIVALIS 

Hk) FEATURE: 

(A) NAME/KEY: mlsc feature 
( b ) LOCATION 1 . . . 7S0 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID HO: 145 



ATGCTTAAGA 
AATCTGGAGA 
AGTACGCTCT 
ACATTCAATG 
TTTCTCAGTT 
GCAGCTGTCA 
CTCAAGATGA 
TCTGTGAACG 
ATGCTCGAAC 
CTCCGCATCG 



TAAAGAACCT 
TCAATGCCGG 
CTTCCGTTTT 
GAATCGACCT 
TCCAATATCC 
ATGAACATAG 
TGCGAGAGAA 
AAGGCTTCTC 
CCAACCTGGC 
TAuCAGGCGG 



CCACGCCACA 
AGAGATTCAT 
GGTGGGACAT 
GCTCGAACTC 
GGTCGAGATC 
GAAAGCGATC 
GCGTGCCATT 
CGGTGGAGAA 
TATriTGGAC 
GGTAAACCGA 



GTACAGGGCA 
GCTATCATCG 
CCCTCCTTTG 
GAACCGGAAG 
CCGGGCGTCA 
GGAGCAGAAC 
GTGGAGCTGG 
TWWVGAGGA 
GAAACCGATA 
CTCCGCTCTC 



AAGAGATATT 
GGCCGAACGG 
AAGTCACGGA 
AACGTGCACA 
GCATGGTGAA 
CCGTATCGGC 
ACAACAAATT 
ACGAAATCTT 
GCCGCC7CG}\ 
CGGAGAATGC 



GAAAGGAATC 
ATCGGGGAAA 
AGGAGAGGTG 
CCTCGGACTC 
TTTCATGAGG 
AAGCGACTTC 
GGCCAGCCGT 
CCAAATGGCT 
TATCGACGCT 
TGCTATTGTG 



60 
120 
180 
240 
300 
360 
420 
480 
&40 
600 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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ATCACACACT ATCACCGnT GCTCGAGTAC ATCAAGCCGG ACTTCGTACA CGTCCTTTAC 660 
AAGGGGCGCA TCGTCAAGTC GGGAGGAGCC GAGCTGGCTC TCACGCTCGA AGAAAAAGGC 720 
TACGACrGGA TCAAGGAAGA GATAGGAGAA 750 



10 



15 



Ul INrOW'lATIOrt FOR SEQ ID K0:1'16 

(1) SEQUEJICE CHARACTERISTICS: 

CA) LENGTH: 1383 base pairs 

(B) TYPE; nucleic acid 

(C) STRATJDEDNESS: double 

(D) TOPOLOGY: circular 

(iiJ MOLECULE TYPE: DHA tgenoiivic) 
tlii) HYPOTHETICAL: NO 



20 



25 



liv) AHTI-SEJ-ISE: HO 

(Vl) ORIGINAL SOURCE: 

<A) QRGAHISM: PORYPHYRWOWAS GIHGTVALIS 

UX) FEATURE: 

(A) NAl'lE/KEY: misc_feature 

(B) LOCATIOTJ 1 . . .1383 



(xi} SEQUENCE DESCRIPTIOM: SEQ ID HO:l<16 



30 



35 



40 



45 



50 



ATGGCTAAGG 
CAAGGCAACT 
CCGGCATCGG 
AGACTCTTAC 
TTCGACCGCG 
CCGGGAATCG 
ACGCTCTATG 
GGGCAAGCCA 
CGTGCAGAAla 
GAGGAAATGG 
CTCAAATACT 
GGTAGCATAG 
GGGGATAAGC 
TCCGAGCTGG 
GAACATCTCA 
GAGGGCATTC 
AATCCGCAGC 
CiGGAGAAAC 
GGTATCAAAA 
AGTCTGGACA 
GAGATACGTC 
AAAGAGATAT 
CGGCTCGTGC 
GAA 



AGAAAACGAT 
GCAATOCCTG 
GCAAGCATGC 
AGGATGTGGA 
TACi'GGGTGG 
GTAAGTCCAC 
TGTCGGGCGA 
TGAATGGGTG 
AACTCACACC 
AAAGCTCGGC 
GCAAGACTAC 
CCGGACCGAA 
ATCATCrCTA 
GGATATACGA 
TCACACGCAA 
GCCCGATACT 
GTTC6GCCAC 
GTGCCGGCTT 
TAGCCGATCC 
TCGTTATCCC 
CCGTGAGCCG 
TGGTACCGGC 
CGGTCAGAAA 



CTACGTCTGC 
TGtiAGAGTGG 
AGCCAAGAGT 
GTCCGGCGAT 
AGGAATTGTC 
GCTTATCCTC 
AGAAAGTGCC 
CTACGTATAC 
CGATCTCCTC 
CGGCAGCGTG 
GGGTATCCCC 
GGTGCTGGAG 
CCGGATACTC 
GATGCGGCAG 
TAGGGAAGAC 
CATCGAAGCG 
GGGCTTOGAT 
CAAGCTCATA 
GGCTACGGAT 
GCCGGCCGTA 
CATCGAGCAG 
CGATAATTTC 
GGTGGAG(aAA 



CGTTCGTGCG 
AACTGCATTG 
TTTATGCCTC 
GAAGAGCGTA 
AAAGGAGCAT 
CAGACGGTGC 
CGACAACTGA 
TGCGAAACGA 
GTGATAGACT 
GGGCAGATCC 
GTCATCGTCA 
CATATAGTGG 
CGAGGACAGA 
GACGGTCTGC 
CTCAGTGGCA 
CAGGCTTTGG 
ATTCGGOGGA 
CAGAAGGATG 
CTGGCCGTTA 
TGCATGACGG 
CGCATAACGG 
CGCCAGGAGG 
GCCTTCCCCC 



GAACCAAATA 
ATGAGGAGAA 
GGGAGCAGGA 
TTCGCCTCGG 
TTGTCCTGCT 
TGCGTCTGCC 
AGATGCGCGC 
ATATAGAGAG 
CTATACAGAC 
GCGAATGTGC 
TCGGACACAT 
ATACGGTGCT 
AGAACCGCTA 
GTGGCGTGGA 
TAGCCATAGC 
TCAGCTCGGC 
TGAACATGCT 
TGTTTCTGAA 
TCTCGGCAGT 
GCGAGGTCGG 
AAGCGCGTCG 
ATGCCGGCCG 
ATCTGTTCTC 



CGCCAAATGG 
GGTGCCGGCA 
•CAACCGGCCA 
CGATGAAGAG 
TGGCGGCGAG 
GCAGTTGCGC 
CGAACOACTG 
GATACTCL'CC 
GGTCTATACC 
CGCCTTACTG 
CACCAAAGAA 
TCTCTTCQAC 
TGGCAGTACT 
GAATCCGAGC 
CGTAGCGATG 
CATTTATGCC 
CTTAGCCGTA 
CATTGCCGGA 
GCTGGCGTCG 
ACTCTCCGGA 
CATAGGGTTC 
CTTCGGTATT 
GAAAGGAAGA 
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12) IMFOm-JATlOH FOR SEQ ID NO: 147 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 
(D) TYPE; nucleic acid 
[C\ STRANDEDNESS: double 
ID» TOPOLOGY: circular 

{ii) MOLECULE TiPE: DMA (qenomlc) 

(ili) HYPOTHETICAL: NO 

<lv) ANTI -SENSE: NO 

tvi) ORIGINAL SOURCE: 

(A) ORGAIIISII: PORYPHYRWONAS GIHGIVALIS 

(i>;) FEATURE: 

(A) NAME/KEY: misc_f eaturc 

(B) LOCATION 1...813 



STJBSnTUTE SHEET (Rule 26) (RO/AU) 



WO 99^9870 PCr/AU98/01023 

100/490 

txi) SEQUENCE DESCRIPTIOH: SEQ ID NO: 14 7 

ATGAATAGCT^ GACATCTGAC AATCACAATC ATTGCCGGCC TCTCCCTCTT T13TACTGACA €0 

TTGGGCGGCT GCTCCGTAGC CXAACAAGAT ACGCAGTGGA CTCTCGGCGG AAAGCTCTTT 120 

5 ACTTCX3GCGT GGATACAACG TTCGGCCGAA TATCAAGCGC TTl'GCATTCA GGCATAC7VAC 180 

ATCGCTACGG AAAGAGtGGA CGCTCTACCG GCAGAACGTA AACAAGGAGA TAGGCCTTAT 24 0 

GCCATCGTAA CGGACATAGA CGAAACCATT TTGGACAATA CGCCTAACTC CGTGTATCAG 300 

GCTCTCAGCG GCAAGGATTA TGATGAAGAG ACTTGGGGGA AATGGTGTGC ACAGGCCGAT 360 

GCCGACACAC TGGCAGGAGC TTTGTCTTTC TTCCTCCATG CAGCGAACAA GGGGATCGAG 4 20 

10 GTCrPTTACG TCACCAACCG CAGAGACAAT CT6CGCGAAG CAACTCTTCA GAACCTTCAG 4 80 

CGTTACGGAT TCCCCTTTGC CGATGAAGAA CAT'rrGCTTA CGACCCATGG GCCATCCGAC 540 

AAAGAACCCC GTCGGCTCAA AATACAAGAA CAGTATGAAA TAGTATTGCT CATAGGAGAC 600 

AACTTGGGCG ACTTCCACCA CTTCTTCAAT ACGAAAGAAG AGTCCGGACG CAAACAGGCT 660 

CTGGGCCTGA CAGCCGGGGA GTTTGGCCGG CACTTCATCA TGCTGCCCAA TCCCAACTAC 720 

15 GGATCTTGGG AACCGGCATG GTACGGCGGG AAGTATCCGC CACTGCCCGA AAGAGACAAA 780 

GCACTTAAAC AACTGCACTC ACAGAACAGC AGA 013 



2p: 



(2) I14t'OR;tfiTI0>>J FOR SEQ ID HO: 143 



(IJ SEQUENCE CHARACTERISTICS: 

(A> LENGTH: 1251 base pairs 

(B) TYPE; nucleic acid 

(C) STRAMDEDMESS: double 
25 (D) TOPOLOGY; circular 



30 



35 



(11) HOLEOJLE TYPE: DtiA (genomic) 

(tni HYPOTHETICAL: NO 

(iv) AMTI-SEHSE: NO 

[vi) ORIGINAL SOURCE: 

(A J ORGANISM: PORYPHYROMONAS GINGIVAL IS 



(iy.) FEATURE: 

(A) NAHE/KEY: misc_f cature 

(B) LOCATIOH 1. . .1251 

40 (xl) SEQUEHCE DESCRIPTION: SEQ ID WO: 148 

AT6AGCACCA ATATAGATGT ACAACAGATC AAACAGCGTT TCGGCATCAT CGGTAGCAGT 60 

CC6CTGATGG AACATGCCAT ACGAGTGGCA GCACAGGTGG CTCCTACCGA CATGTCCGTC 120 

CTCGTGACGG GGGAGAGCGG TTCGGGGAAA GAGTTCTTCC CACAGATAAT CCACTACTAC 160 

45 AGCGCCCGGA AACATCATAG CTACATTGCA GTCAATTGCG GAGCCATCCC CGAAGGAACC 240 

ATCGATTCCG AGCTGTTCGG ACACCGCAAA GGTTCCTTTA CC6GAGCCGT ATCGGATCGC 300 

AAGGGGTACT TCGAAGAAGC ATCCGGCGGC ACGATCTTTC TGCACGAAGT GGGCGAACTG 360 

CCTTTGCCCA CGCAGGCGAG GCTGCTGAGG GTGCTGGAGA CGGGC6AGTT CATCCCCGTA 420 

GGAGCCAGCC AGTCGCAGAA GACGGATGTC CGTATCGTAG CGGCGACGAA TGTGAACCTC 480 

50 AAGGAGGCGG TAGCGAACGG GAAGTTCCGG GAAGACCTCT TCTTCCGGCT CAATACGGTA 54 0 

CCGATCGAGG TGCCTGCGCT GCGTATGCGA CCGGACGACG TGCCCTTGCT TTTTCGOCGA 600 

TTCGCCGCCG ACAGCGCCGA GAAGTATCGG ATGCCTCCGC TGCGCCTATC GGACGAAGCC 660 

CGTACCATAT TAATGCGTTA CCGCTGGCCC GGCAATGTGC GAGAGCTGCG CAATATAACC 720 

GACAGGCTGA GCATCCl'GGA GGAGGAGCGG ACGGTATCGG CAGAGACCAT CACTCGCTAC 780 

55 CTGGACGCTG AGGGGATGCA AGACCTCCAC CCCGTCGTGA TCCGACGGAA CGAAACGACC 84 0 

GAAGCGGACA AACAAATCCC CCATTACGAG CGCGAAATCA TCTACCAGGT GCTATACGAT 900 

ATGAAGAAAG AGATAGCCGA TTTGAAGGGG ATGATGAACC GCCTGGCGCA CCACGAACAG 960 

CCCTCATGGC CTGTAGGGTC GGACGTCTGG GGCAACGACG ACAAGCGCAC CGCAGATCCG 1020 

AA6TGGGGCG TCAGCACGCA CAAGGCCCCC ATCGCGAACG CGGCAGAACC CGTGGAGCCG 1080 

60 ATACAGGAAG CCAGCGAATA CACCGAGGAT CCGGTTTCGC TGGAGGAGGT AGAGAAGAAA 1140 

ATGATT^'CCC TTGCATTGGA ACGCCACGGC GGAAGGCGCA AGCAGACAGC CGAGGAACTG 1200 

AAGATTTCGG AGCGGACACT ATACCGTAAA Al'CAAGGAGT ATGGACTGGA A 1251 



65 iZ) IHFORl'IATION FOR SEQ ID HO: 14 9 

(1) SEQUENCE CHARACTERISTICS: 

I A) LF^-IGTH: 1806 base palrfi 
(B) TYPE: nucleic acid 
70 (CJ STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

(ii> MOLECULE TYPE: DNA (genomic) 

75 (iil> HYPOTHETICAL: HO 



SUBSTTTUTE SHEET (Rule 26) (RCVAU) 



n'^h iF"" iP^i ^ 

:fUi ^rm «ii 




- a . n=ni .rtP .fn^, 

t Oi iu lUi o rtn:: u u«m 
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(iv) AHTI-SEMSE: MO 

tvi) ORIGIHAL SOURCE: 

(A) ORGAI^ISM: PORYPHYROMONAS GINGIVALIS 

{ixj FEATURF.: 

(A) HAME/KEY: mlsc_f eature 

(B) LOCATION 1...1806 

(Xl) SEQLFENCE DESCRIPTlOtJ: SEQ ID MO: 14 9 



ATGATCCrAC 

GCGCGAAGTC 

GGCCATCGTC 

6AGATCCGAC 

GCAGAAGCCT 

CGCGAAGAAG 

GATGGCGTGC 

TTCTGGAAAG 

GCAGAGAAGA 

GAGCTGATGC 

TACTTCCGAG 

CGTCCCCCTT 

GTAGAACGCC 

GCTACCGCTG 

GCTCCCG-rAG 

GAAG7VAGATG 

AATGGCATGG 

CTGCAGGGTG 

CCGGCTATCA 

CAGGGAGGTC 

TTCAATCCCC 

AAACCTTTTT 

GCCTTGGGAG 

GGAGCCTATC 

TCACTCAGTT 

GGCAAAGCAG 

AATCGGGATT 

ATAATAGACA 

GATCGACTGC 

TCGGCCGAAT 

GGAGAA 



TCTTCX3GCGG 

CGTTTTTCTA 

TGACAGGAGC 

TGATCGTGGA 

CTGAACAAAC 

GTATCGTCTG 

AGCGTCTGCT 

AGCGCACCAC 

ACGGCTTTCC 

AAGCCGTTCG 

AAAAGATAGA 

TGCCTCCTTC 

TCGTGCCGGG 

CAGTAGTAGC 

AATTGCCTTC 

CTGTCGTATC 

CGGTATGCGC 

AAGGGGTGGG 

ACCTCGTACC 

TGGATATTAC 

GACTCGGCAT 

CGGCCGAAGC 

CCAATCATAT 

CGGCACTCAT 

GTGTAGCTTC 

TGAAACTCGC 

TCCTGCACGA 

GCCTGAATTT 

TACGAAAGAT 

TAGAACTCCT 



TACTACGGAA 
CTCCACCAAA 
CATGAC6GTT 
CGCCGCTCAT 
AGGTATCCCC 
GTGTGCAAAC 
GATGCTCACA 
CTTITGCCGC 
TGCGGAGCGC 
CCCCGATGCC 
AGCTGCCCGA 
ATTCATTCCC 

ATTCT'rrrcA 

AGCCATGTAC 
GGGCGAAATA 
CGCAGTCCTG 
TACGATCAGG 
GGTAGTGACG 
TCGACGAATG 
GATTAGCGTA 
ACGGGACGGC 
GTTCGTTGGT 
CGTCCTCAAT 
TCCACAGGCC 
CTTCCCTTCT 
CGAAGGCTAT 
ACTGGCTCGT 
GGCTCGTGAG 
TGCCGAACGA 
GCTGATCGAT 



GGCCGTGCCG 
GGCAATCTGC 
GCCGACATGG 
CCTTTCGCCG 
GTAGTAAGAT 
TACGATACGG 
GGAGTGAATA 
ATATTGAAGC 
ATCGTTTTCT 
ATTATCACAA 
CGGATGGGCA 
GTAGGCGGGC 
CTCCGAAGCG 
CGATTGATGG 
GTCAGTCTGC 
AAAGATGCAG 
CTCAATCCCG 
CTCCCCGGCC 
ATGACACCAG 
CCCGAAGGCC 
ATCTCTATTA 
GCCATCCGTA 
TCGGGAGCCA 
TTTGTGCAGT 
GTCCGTTCGG 
CTGGATACGC 
CAGGCAGGTT 
CTATGGACTA 
TCTTGGGAAA 
GAGTCCGGAG 



CAGCTCGCGT 

AAGAGATCCA 

TTTCGTTTTG 

AAGAATTGCA 

ACGAGAGACA 

CTGCCGAGCG 

CGATCCCCAA 

GAGACGAATC 

TCGAACCGCA 

AAGAAAGCGG 

TCCGTATATA 

CTGTCGGTTT 

GATTCACTAC 

GGCTTGGCTC 

CCATAGCGGA 

GTGATGATCC 

AACATGAGGA 

TCGGTCrGGA 

AGGTACGCCG 

GAGAGGCTGC 

TCGGAACATC 

AGCAAGTGGG 

AGACTGAGCG 

ATGGCAATTT 

TAACGGTAGG 

ACAGTAAAAA 

GTTCGGAAGA 

TGCCGA6TGC 

CTTGCCGCCC 

CGA-n-CG-flT 



GCTGGATGAA 

6AGTAGCCAC 

TCGGAAAGAA 

CGCTTCAGTG 

ATACCCTCCA 

GATGCTTGGC 

GCTGGCTGCT 

GGTTGCTTTG 

TGCGGACGAG 

AGAGAGCGGT 

TGCCGTCGTA 

GAGACCGGCG 

CGGCACCACA 

TCTCGCCGAA 

AATTCGAGAG 

GGATGTGACC 

AGTCCGCTTC 

GGTCGGAGGT 

ACTCTATGCG 

TACCCAGACA 

GGGAGTCGTG 

TAT'l-GCCACC 

TTATGTAAAA 

CGTCGGCGAA 

AATCATGCTC 

GGTAGTGATG 

CATCCATGCC 

GGAGGACAGC 

ATCGGTACCA 

TCGTATCGGT 



(2) IWFORI^IATiai FOR SEQ ID HO: 150 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 
ID) TYPE: nucleic acid 

(C) STRAIIDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DtJA (genoiaiO 
(iii) HYPOTHETICAL: HO 
(iv) ANTI-SEIJSE: NO 

(vi) ORIGINAL SOURCE: 

(A) OKGAIIISI1: PORYPHYRaiOtJAS GTKGIVALIS 

(ix) FEATURE: 

I A) HAIIE/KEY: mlsc_Ceature 

(B) LOCATION 1...132 9 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO:150 

ATGTTGAGGA CTTTCCGAAT CGGTGGTATT CACCCCCCCG AAAACAAGTT 
AAGCCCGTAG AGGTGTTGCC TATCCCCTCA CAGGTAGTCA TCCCTCTTGG 
GGTGCACCGG CAACTGCCAC GGTCAAGAAA GGGGATGAAG TTAAGGTCGG 
GCTCAGGCCG GAGGATTCGT ATCAGCTAAT ATCCACrCAT CTGTGTCGGG 
AAGATCGATA ACGTATACGA CTCAAGCGCC TATCCCAAGC CCGCAGTr-FT 
GAAGGTGACG AATGGGAAGA GGGCATCGAT CGCTCACCAG CCATCGTCAA 
CTGGATGCAA AAGAAATCGT AGCCAAAATT TCTGCAGCCG GTATTGTGGG 
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GTCGGCAG3C 
TCAGCACATC 
GACTATCATT 
TAAGGTGCTG 
CATTAGCGTA 
AGAATGCAAT 
TCTTGGCGGT 
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SUBSTITUTE SHEET (Rule 26) (RO/AU) 



J L E 

# 



wo 99/29870 



PCT/AU98/01023 



102/490 



10 



15 



GCTACCTTCC 
ATCATCA^CG 
CACGGCGAAG 
GCCGTCATCG 
ACTGCATATC 
AAGCAGCTGA 
ACAGGTGCCG 
AACAAGCCTC 
AACCTCCTCC 
CCGGAGAATA 
CCGGATGTGC 
GTTCGCAAGC 
GGACTCAATC 
AAAGGCAACG 
CCrcCTCTGC 
CGTAAGCAA 



CTACCCATGT 
CCGTAGAGTG 
AGATCATGAT 
QAGTTGAGAA 
CGGGCATAGA 
TCGATGCAGT 
TAGTACAAAA 
TGQTCGAGCG 
TTCGTATAGG 
CGGGCAAGAT 
CTGTGACCAA 
CTATGCGCGA 
CGGCTTTCCT 
TGGTTGACTG 
TGGATTATAT 



GAAGCTGTCC 
CGAGCCTTAT 
CGGCGTGAGT 
TAATAAGAAA 
GGTAATGCCG 
GATCCGCAAG 
CGTGGGTACG 
CAT03TGACG 
TACTCCTATT 
CATCGGCGGA 
AGGCAGCTCC 
CTGTATCCGA 
TATGCGCGAC 
TATCGAATGC 
CCX3CCAAGCC 



CCTCCTCCGG 
CTGACGAGCG 
ATCCTGATGA 
GATGCTATTG 
TTGAAGGTGC 
CAGGTAAAAA 
GTATTCGUCG 
GTTACAGGAA 
GCGGCTTTGA 
GGTCCGATGA 
GGAGTATTGA 
TGCGCCAAGT 
ACCTTATATA 
GGTTCGTGCA 
AAGAAGACrC 



gcaacaaacc 
accatgtcct 
aagccattca 
ctcacctcac 
aatatcctca 
gcggtgcctt 
tgtacqaagc 
aaaaactgtc 
tcga;\gcagc 
tgggacgcgc 

rrCTCGATAG 
GCGTCXSGAGT 
AGAGCTGGGA 
GCTTCACCTG 
TGATGGGTAT 



TGAGATCCrG 
TATGCTGGAG 
GGTAAACAAG 
CAAACTGGCC 
AGGCGGTGAG 
GCCTATCAGC 
AGTACAGAAS 
rCGTCOTTCT 

agctggcttg 
tctgctgtca 
agaagaggca 
gtgtccgatg 
aacagcgga;>. 
tccggccaac 

CCAAAGAGCA 



460 
54 0 
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T80 
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(2) INFORMATION FOR SEQ ID NO: 151 

tl) SEQUENCE CHAKACTERISTICS: 

(A) 1£NGTH: 1431 base pairs 

(B) TYPE: micleic acid 
(CJ STRAMDEDHESS: double 
(Dl TOPOLOGY: circular 



30 



(ii) MOLECUr.E TYPE: DWA tgenomic) 
(iii) HYPOTHETICAL: NO 
(iv) A11TI-SF.NSE: NO 
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(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: tORYPHYROMONAS GIHGIV7VLIS 

( ix ) FEATURE : 

(A) NAME/KEY: misc_f mature 

(B) LOCATION 1...143'7 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151 



ATGAAAAGAA 
CAAAATGCTT 
GTTGTACTGA 
ATTGTTACGC 
GTCATTACAG 
TTGCCCCAAG 
TTTGCTTTTA 
CGTGAGGAGG 
TTTGATCCAC 
GAAGAAGTGG 
AAGGCAGATG 
TTTGTAAGCA 
TTTGCTTCTC 
AGACTCGCGG 
TTCGGCGGTG 
CGTGACCGCG 
CAGGCTCTGA 
CGTCGCAATA 
GAAATCATTA 
TCTTATCCTG 
CCTGAAAGTG 
CAACAGGCAA 
GGAGCTGCTT 
GTTGCAGAAG 



TACAACTAAC 
ACGAGfiGAGT 
ATGGTGCGGC 
CTATTCTTCG 
GCCGCAATAG 
CAAAACATGC 
CAGGAGAACA 
TACGAGGTTG 
TCTTCAATCC 
AAAAACAGCG 
TCCTTCCTGA 
CCGTTAAGGC 
CCGAGGCTTC 
AAGAATTGGT 
AAGATTGGAA 
TATTGGAGAT 
AGCAACTTTC 
CGATAACCAT 
AGACTGCTCC 
AGGGGCACCA 
TAACGGGCCG 
TTGCTCTGTT 
ATGCTCGTAC 
GAGATGCAAA 



TeiTATCGCT 
AATTTCATAT 
AGATATGAGT 
TTCAGAAGAT 
AACAAAAGCT 
AGCl'CAATAC 
TGCTTATGCA 
TGCTAAATCC 
GGCAGAGGCT 
AGAGTCCAGC 
GTATCGCAAC 
TAATCCAAAC 
AATAGCCCAC 
GCGTAAGTAT 
GGGGCTGAAA 
AATCAACTCC 
GTCTTATCGT 
GGGGTATATC 
GAAAGAACTT 
AGAGCGTTTG 
AATCAATTTG 
GAGTCCGATT 
GGGAGATTTT 
TGCGCAGCGC 



CTCTTCGCCG 
AAAATTTCGT 
AATTTAAAGC 
GGTACCAGCC 
CTCAAGCGTG 
ATTCGCCGTC 
TCATGGATGA 
CCTGTAGCTC 
CCTTATTTGT 
TTCGATGCTT 
AATAAGGCGG 
TATTCGGTCA 
AATAAGGCTT 
GGCAAAACAT 
CTQGCTATCG 
GATAAATATG 
TATATCTTGG 
GTTCGTGATT 
AGTGAGGCCG 
TTTGCTCTGA 
GCTGTAGCCG 
CAGACAGAAA 
GCTCGTGCCG 
AACCTCGATA 



CTGTTGCCGG 
TGGACAAAAC 
TCAAGAGCAC 
GGGTGGAATT 
AAATCGCATT 
ATAATGGGAA 
TGGATGCCAA 
TCTCGAGTAA 
TGGCACACAT 
ATATCAACTT 
AGTTAGAGAA 
ATAAAATGAT 
TGTCGGAGCG 
TGCCGAATAT 
AAAAGAGTGA 
CCGATGATGA 
ATCAGATCTA 
ATACCCTCGA 
AAATGTACCG 
ATACGACCCT 
CTTTTAATGG 
AGGGTGTAAG 
AAACCTTCTT 
TGCTGCTTGG 



TTTGGTCGCT 
CGGAAACAAG 
TCAGATGATC 
TCCTTCGGTA 
TAGTTCGGCT 
GAGCGAGCAG 
GTTTGTGGTT 
TATTGTTCCT 
TACTCCGGCA 
CAAAGTCAAT 
AATCAAAGAA 
CATCGAAGGG 
CCGTGCTAAA 
AACCACTGAA 
TATAGCCGAT 
TGCACGTGAA 
TCCGAATTTG 
AGAAGCTCGT 
TGTGGCAATG 
TAAGTATTTC 
TGGAGACGTT 
CAATATCCTT 
CCGTAAGGCC 
CAAAAAG 



60 
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(2) INFORMATION FOR SEQ ID NO; 152 

fil SEQUENCE CHARACTERT5JTICS: 

(A) LENGTH: 1149 base pairs 

(B) TYPE: r\ucletc acid 
(CI STRANDEDNESS: double 
(D) TOPOLOGY: circular 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



'fl"^! .f=Hl «J( 
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(li) IfOLECULE TYPE: DMA (genomic) 
(ill) HYPOTHETICAl-: NO 
liv) Aim-SEWSE: NO 

<vij ORIGINAL SOURCE: 

(A) ORGAHISH: PORYPHYROMONAS GIHGIVALIS 
(Ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATIOM 1. . .114 9 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 152 



ATGGCAGAAA 

GAACTGAAAA 

6ACAAGGAGG 

CCGCAGAAGC 

GGAGGTTTCA 

TTCGGTGGGT 

GTTCGCAGAG 

GGTGTGGAGA 

GGCACGGAAG 

ACACGTGTGA 

CACGGAGAGS 

ATCGGCGAAG 

TCCGTGAACG 

GTGATCGCCG 

CTGCTTATAT 

GGACGAGCCA 

AATAAGGGGT 

GTCTATATCC 

TCGGACAGCr 

ATGCTCGAT 



AAAGAGACTA 

AAGCATATCG 

CCGAAGAGCA 

GCAGTCAATA 

GCGGAGGCGG 

TCGGCGGTTT 

GGTCTGACCT 

AGAAAGTGAA 

AAGCCAATGG 

GCAACACTTT 

GTGAGATCAT 

AGGTGATCTC 

GCAA6GGAAA 

AGGAACCGGA 

CCGTTCCGTT 

AGATCCGCAT 

TGCCCAGCGT 

CCGAATCGAT 

TCAAACCTAC 



TTACGAAGTC 

CAAGAAGGCT 

CTTCAAAGAG 

TGACCAGTTC 

TATGTCCATG 

CCGCGGATTC 

GCGAGTACGA 

GGTAAAAAAG 

CAAGACTACC 

CCTTGGGGCC 

CACGAAGCCA 

ATTCCACATC 

TGCCGCGCCC 

TCCGAATCTG 

GGCTATAAAA 

CGAGGCGGGG 

AAACGGCTAT 

CGATGCCAAA 

CGATGCTGCT 



CTCGGTGTAT 
ATCCAATACC 
GTAGCTGAAG 
GGCCATGCCG 
GAGGATATTT 
TCCGATATGG 
GTC5AAGCTTT 
CAGGTAGTGT 
TGCCAGACCT 
ATGCAGACCC 
TGCTCCAAGT 
CCTGCCGCTG 
CGAGGAGGCG 
ATCCGCAATG 
GGAGGTAGTG 
ACACAACCCG 
GGCATGGGAG 
GATGAGCAGG 
CGTAAGGATA 



CGAAGAATGC 
ATCCTGATAA 
CCTACGACGT 
GATTGGGCGG 
TCAGTCGCTT 
GCGGTGGCAG 
CTTTGGCCGA 
GCAGCAAATG 
GCCATGGAAC 
AGAGCACTTG 
GTAAGGGCGA 
TAGCCGAAGG 
TGAATGGCGA 
GCAACGATCT 
TGGAAGTGCC 
GCAAGATGCT 
ACCAACTGGT 
CTATCGCAGC 
TAGACAAGAA 



CACCGACGAT 
GAACCCCGGT 
ATTGAGC6AT 
AGCTGCCGGT 
CGGTGATCTA 
TCGCAGACGT 
TATAAGTAAA 
TCGTGGCGAT 
CGGCGTGGTT 
TCCCACTTGC 
AGGTGTG6AG 
AATGCAAATG 
CTTGATAGTC 
GATATACAAT 
GACGATAGAC 
GCGTTTGCGC 
GAATGTCAAT 
GATGGAAAAC 
ATACAGAGAG 



(2) INFORMATION FOR SEQ ID NO:153 

(i) SEQUENCE CHARACl'ERISTICS: 

(A) LENGTH: 87 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii» MOLECULE TYPE: DNA (genomic) 

<lli) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 
(D) LOCATION 1. . .87 9 



(Xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 153 



A^i^^^^^Sn TGATTTTAGC GACTTTGGGA CTTATGGCCA TTGCCATGCT 
AGCAACAACA AGGATTTGGA GAACAAAGGG GAGGCTACTC TTTTGGTAAC 
TCCTATAAAG CTCCACGCGA AACCTATGCG AAGATTGAGA AGACtStgc 
^^ATAAGCTG GACATACACG V^Smta T^J^ 

cagggtattt atatcgatgc tccggatgag gctttggaga aattggctcg 

AAGAAGATCA ATGTACAGAG TCTTCATGI-G ATTCCCGGCC GAgJ^S 
GACTTTG-rCA ATAAGTTTAA ggcagcacat AGTGATATTA CTGTGAAGGT 
CTTTTCGATA CCGATGAAGA tatgcgcgac gtggcagaga tcttg^caa 
S^fSJ^ agaaaggtga agctattgta ttcatgggaS aS^^ 

ATGCCCGTAT CAATAAGATC ATGAAGAACT ATAGCAAGTT 
GOAACCGTCG AGTCCGATCC CTCTATCAAT GATGTTATTG CCGAACTGAA 

^S^^f^ IS^f^^^^ gccgctgatg agtgtggcag gcgacS?^ 
S^^^?^"^ atgaggacga tagctggaag acgttgctga ccaatgccgc 

IS-A^^S^C ?^rS^?^ <^CAATTTC TCAGCTCTTG GAGATATAGA 

AATATCIltGC tcaagcatat gaaagccacc tctgctcgc 



ctcatgttca 
gtitggtagc 

CGCACCTTAT 
ACTGGCTCAG 
TCTGGGTTAT 
TGAGATGATC 
AGGGGCTCCG 
GCGTTTTCAG 
GCATGCTGCC 
CATGATCGTC 
AGAAACCGGT 
TACGAATGAT 
CTACACAGTT 
AGAGATCCGG 



€0 
120 
180 
240 
300 
360 
420 
460 
540 
600 
660 
720 
780 
840 
900 
960 
1020 

loeo 

1140 
1149 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
879 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



10 



WO 99/29870 PCr/AU98/01023 

104/ 490 



(2) INFORllATIOH FOR SEQ ID NO: 154 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRAMOEDNESS: double 
<D» TOPOLOGY: circular 



(iii MOLECULE TYPE: DNA (genomic) 

fiii> HYPOTHETICAL: NO 

15 (iv) ANTI-SEIJSE: NO 

tvi) ORIGINAL SOURCE: 

fA) OR(3ANISM: PORYPHYRO^^O^^AS GINGIVALIS 

-20 (ixl FEATURE: 

(A) NAME/ KEY: it\i2c_fftaturs 
^ <BJ LOCATION 1...106B 



25 



45 



55 



(y.i) SEQUENCE DESCRIPTION: SKQ ID 110:154 



ATGACATCCG TCAGCCACTT ACCJTACAATT TCTGTCGCAG GTATCCTG«5C TGCGCTGGGA 60 

GGGGCTGTAC TCATTCTCTT CGGGGTTAAT CTCTTCCTCG GCTCGGTGGC TATTCCGATG 120 

AGCGAGATCT TCCGACATCT TTTTTCAGA'J' CGTCCCGAAG GAGGAGAAGC ACTCGTGCAC 18 0 

TACAATATCC TATGGAAATC CCGCCTCCCC GAAGCCCTCA CGGCTGCTTT TGCCGGCGCA 24 0 

30 GGTTTATCCG TTAGTGGCTT GCAGATGCAG ACCGTCTTTC GCAATCCTTT GGCCGGTCCG 300 

TCCGTTCTCG GCATCAGCTC CGGTGCCAGT TTGGGTGTTG CTTTGGTCGT TCTGCTGAGC 360 

GGCTCGCTGG GAGGAGTGGC ATTGAGTAGC CTGGGTTATA TGGGCGAGGT GGCCATGAAT 420 

ATAGCCGCTG CCGTAGGCTC GCTGGCAGTA ATGGGGCTGA TCGTTTTTGT CAGCACCAAG 4 80 

GTGCGCAGCC ACGTTACGCT GCTCATTATC GGCGTTATGA TCGGATATGT AGCCACTGCC 540 

35 GTCATCGGGG TATTCAAGTT TTTCAGTATC GAAGAAGATA TTCGGGCATA CGTAATTTGG 600 

GGGTTGGGCA GCTTTTCCCG TGCCACGGAT TCGCAACTGA GTTTCTTTGC CATTCTGATG 660 

TTGATCTTTA TTCCGGCCGG TATGCTCCTT GTCAAGCAGT TGAATCTCTT ATTGCTGGGA 720 

GAAAGCTACG CACGTAATCT GGGACTGAAT ACTCGTCGGG CACGGCTGCT CGTGATCTCT 780 

TCCGCCGGTT TGCTCATC6C TACCGTCACG GCCTATTGCG GTCCCATCGG CTl'irrrGGGG 84 0 

40 ATGGCTGTGC CACACTTGGC ACGGGTTATC TTTCACACAT CGGATCATCG GATCCTGATG 900 

CCTGCTACCT GTTTGATTGG AAGTGCTCTG GCTCTTTTCT GCAATATCAT TGCTCGTATG 960 

CCGGGGTTTG AGGGGGCTTT GCCCGTCAAT TCCGTAACGG CTTTGGTGGG AGCACCTATT 1020 

ATCGTCACCG TTTTGTTCCG GCGCAGACGC TTCAAGGAAG AAACCGAC 1068 



i?) IHFORflATION FOR SEQ ID NO: 155 



ti) SEQUENCE CHARACTLRISTICS : 

(A) LENGTH: 2271 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRAJ-fDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iiii HYPOTHETICAL: NO 
liv) ANT I -SENSE: HO 



60 Ivi) ORIGINAL SOURCE: 

(A) ORGAFJISM: PORYPHYRCMOMAS GINGIVALIS 

(Ixj FEATURE: 

(A) NAME/KEY: misc_feature 
65 (B) LOCATION 1...2271 

(Ki) SEQUENCE DESCRIPTION: SEQ ID NO: 155 

ATGCGGACAA AAACTATCTT TT1TGCGATT ATCTCTTTTA TTGCTCTATT Gl'CGTCTTCT 60 

70 CTGTCGGCTC AGAGCAAAGC CCTTTTAACC GGTAGTGTGT CGGATGCCGA AACCGGAGAG 120 

CCTCTTGCCG GTGCTCGAAT CGAAGTCAAA CACACCAACA TAGTAGCCGG TGCCGATGCC 180 

GGCGGACATT TCGAGATCAA GAACCTGCCG GCAGGGCAGC ATACTATTAT ATGTTCGTTG 24 0 

GGGGGGTATG GACAGAAAGA GGAGGTGGTT GCCATCGAAG CCGGACAGAC CAAAACGATC 300 

rCTTTTGCAT TGCGACTGCG AACGAACAAC TTGGAGGAAG TCGTCGTTAC CGGTACCGGT 350 

75 ACACGTTACC GCTTGGTCGA TGCTCCTGTG GCAACGGAAG TCCTTACCGC TAAGGACATA 4 20 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



tP^i ti^^^ ^( "^r ^fs«, 

lU^i 



\Ui O CJi*ft«S'^S 



WO99/29S70 



PCT/AU98AH023 



105/490 



10 



15 



20 



25 



30 



GCCTCTTTCT 
TTCGtX.*CCCA 
CTCATCCTTA 
CGTATTTCTC 
TACGGATCCG 
CTGAGTGCAT 
CTCGATATAA 
G6CTGGCAGA 
GAG6AAACGT 
TCCTATTATG 
ATCTTCACTC 
GCTTCACTCG 
GTCTACGATC 
AYCAACAACC 
AACGATCAGA 
TVAACTGACCG 
ACCGACAAGG 
GATTGGTTCA 
ATGACGCCTA 
GCTAACGGCT 
ATGGGTTCGC 
TATGCTTTGG 
AATCAACrrC 
CAGGGAATCA 
GATGTCCTAT 
GTGGAAGCTA 
AATGTCCACG 
GGCCGTATTC 
CGACTCGCCA 
GGTATAGACA 
GTAACGCCGG 



CGGCTCCTAC 
ATCTGATGGG 
TCGATGCTAA 
CTGATCAGAT 
ATGCCATCGC 
ATACGTCACA 
ACATCGGTAA 
ATAGTCCXiTT 
ATAAGAAAAC 
CAACTAACAA 
C6ACTTTTTC 
GTACGAACTA 
GCTTCCGTTT 
AAGGTCAGAC 
TCCGATACAC 
GCGGTTTGGA 
CACATGCTTC 
ATATGACAGC 
AGGTATCCAT 
ATAAGACTCC 
ACAATCTCTA 
GCTTGGAGTA 
GCAATCTGAT 
AGAAAACCAA 
GTGATGCCTC 
AGAATCTCCA 
CCGATTGGGT 
AGAGCOAGCXS 
CATCGC3\TCG 
ACCTGrrTGA 

GACGTAcrrr 



TTCCGAGGCC 
CTCTTTCATG 
GCGTGTGTAC 
CGAACGGATC 
CGGGGTAATC 
TCGCATATCG 
GTTCAGTAGC 
CGAAATAAAA 
TTTTCGTGCA 
TCTTAGCTTC 
CGAAAAGAAG 
TCITTTCCCC 
CGGATATTTG 
CGAGCAACCX; 
GGC7VGAGGCT 
GTATTTCCGT 
CACGCTCTCT 
CGGrTTCCOT 
ACTCGCCAAG 
CACGCTGAAA 
TCTCGGCAAT 
CAATCAAGGC 
CTCCTTTATG 
GCAGTATGCC 
TATCXSGTl'GG 
GACGGATGAG 
TCACTACTGG 
TTACTACAAA 
TTTCGCTCAT 
CTACGTGGAT 
CTTTGCTCAA 



TTATTGCAGG 
CAGCTGAACG 
GGCGATGTAG 
GAACTGGTGA 
AATGTGATCA 
AAGTACAACG 
AATACCAACT 
AAGAAAAAAG 
CAGOAAAATC 
AGCGGAAATG 
GCCTATGACA 
AATGGTCTGC 
TATCATGACA 
ACATTCTTTC 
CGCGGTGTAT 
GAGGAATTGA 
GCTTATGTAC 
CTGOTACACC 
TATGGGCCGC 
GAGCTTTTTG 
GCGGATCTTA 
CCTATCTCGT 
GATATACCGA 
AACATAGGAA 
GGTATCAAGT 
TGGCl'GGAAG 
GGTCAGTATA 
GACGGCAATG 
TTCCGCCACA 
GATCGl'CCTA 
ATAGCGATTC 



GGCTGAGTCX: 
GCCTTAGCAG 
GCGGTCACGC 
AAGGTGCTTC 
CAAAAAAGAA 
ATCGGCAAAC 
ATTTCTTCTA 
GATCCGGCGA 
AGGGTGTAAG 
TGCAGTACAA 
TGGATTATCG 
ATACGCTTTC 
AGGACAGCAG 
CGGGTCAGCT 
TTACACTGCC 
TCTCTCCCTA 
AAGATGAATG 
ATCAGGAGTT 
TGAACTTCCG 
CACGGAACGA 
AGCCACAGAT 
TCAGTGCAAC 
CCTCACCC6A 
AAGCTCGCAG 
TAGGAGCCGG 
GAGCTGGACG 
GACTTGGCGT 
CTCCCGACTA 
TCATCCTGOA 
TG-SGTGTCAA 
GATTCAACAA 



GTCTTITGAC 
TAAGTATATC 
CGATTTGAGT 
GAGTTCGCTC 
TACGAATOiA 
CAATACTTCG 
CCATACGGAT 
ACCGGTCTTG 
CCAATCGCTT 
TAAACGTCAG 
TGCTTTGACG 
TTTCGATGCC 
TGAGAGCCl'G 
ACGCT^TAAA 
TTATGCGCAG 
TAATTTGATT 
GAAACCGCTC 
CGGTACACGA 
CGCTACGTAT 
ACTCACCACT 
GTCGGATTAT 
GGTTTATGAC 
GCACGAAGCT 
CCGCGGCCTT 
ATACAGCCTC 
TCATCGTGCC 
GAGCCTTTTC 
TACCTTGTGG 
TGGAACGCl'C 
TTATGCTACC 
C 



480 
540 
600 
660 
720 
7 80 
840 
900 
960 
1020 
1080 
1X40 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1670 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
227 5 



35 



40 



(2) IHFORHATION FOR SEQ ID NO: 156 

(i) SEQULUCE CKARACTEFaSTICS: 

<A) LEIIGTH: 993 bass pairs 
(B> TYPE: nucleic acid 

(C) STRAl^DEDNESS: double 

(D) TOPOLCX3Y: circular 



45 



(ii» HOLECULE TYPE: DNA (genomic) 
tiii» HYPOTHETICAL: NO 
(iv) AllTI-SENSF.: NO 



50 



55 



(vi} ORIGINAL SOURCE: 

(A) ORGA/IISM: PORYPHYROMaJAS GINGIVALIS 

(ixl FEATURE: 

{AJ NA^iE/KEY: iidsc_f eatuc^ 
\B) LOCATION 1. . , 993 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 156 



60 



65 



70 



ATGACGGACA 
ATCGCAGTGG 
GGACAGATAG 
CTTAGGGTAT 
CCCGACGTAG 
AACGCCAAGG 
CAGAAAGCTC 
TATGACCAGG 
GCCATCGCTA 
CGCGAAGACA 
GTGGAGTCGT 
ATATTCCCCA 
ATGOGCGATA 
GGAGCCGTTC 
ACATTCATCA 
TACGACCTGA 
CTACGCCCGG 



ACAAACAACG 
TGACGATCGT 
AAGTGACCGA 
CCGAGGGACA 
CGGCTAAGAT 
CTCTCAAAGG 
AGGCCGGCGT 
GAGTGGTACC 
CGCAAAAAGC 
AGCTGGCAGC 
ACATCAACGA 
AAGCCGGCGA 
TGTGGGCCAG 
TGGAGACTGT 
AGAACATGGG 
AGACCTTCGA 
GTATGTCCGT 



TAATATCGTA 
TGGTTTTTTC 
ATACCGAGTG 
GCAGGTGCAC 
GGAGCAGGCA 
AGCACGCAGC 
AGCCATAGCG 
GGCTCAGAAG 
GGCCGAAGCC 
TTCTGCCCTC 
AACCTACCrC 
ACTCGTAGGT 
C7TTGCCGTT 
GGTGCCGGCT 
TACCTATGCT 
GGTAAAGGCC 
GATCATACGC 



TTCCCGGCGT 
ATGCTCAGAC 
TCCAGCAAAG 
GCCGGCGATA 
AAGGCTGCCG 
GAACAGATAC 
ACCAAGACAC 
TTGGACGAAG 
CAGTACAATA 
GTCGATAGAG 
ATCGCCCCAC 
ACCGGCGCAC 
CGTGAGGATT 
CTGAATGAAG 
GCCTGGAAAG 
ACCCTTGCGG 
AAG 



TTCTCCTCTT 
CGGCCGAGGA 
TGCCCGGGCG 
CCCTCGCTGT 
AAQCAGCTGC 
AGGCAGCCTA 
ACCAGCGCGT 
CCACTGCCCA 
TGGCTCGCAA 
CGAGAGGAGC 
GGGCAGGCGA 
CTATCATGAA 
TCCTCAGCAG 
AAAAAGTACG 
CGACCAAGAC 
ATAAAGACAA 



GCTGGGAGTC 
GATTATCCAA 
CATCAAGGAA 
CATCGAAGCC 
ACAGGCrCAG 
TGAGATGTGG 
GCAGAACCTC 
GCGCGATGCG 
CGGTGCCCAA 
CGTCGCCGAG 
AGTGTCGGAG 
TATCGCCGAU 
CATGACCATC 
CTTCAAGATC 
AACAGGGCAG 
GGCACAAAAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
9O0 
960 
993 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



tf»Hi iP^i W"' ar^i! ,Ji ~=ii i?=h .rtf .tfJ' -"^j «=n) 

^Ui ""rs? iCiJ .Ji„ ii::; ii3 -icsi ill ' US cli iin:: q u -ilj 



wo 99/29870 PCT/AU98A)1023 

106/490 

(2 J lUrORMATIOH FOR SEQ ID NO: 157 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 
{B) TYPE: nucleic acid 

(C) STRAIJOEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(Iv) ANT I -SENSE: NO 
15 tvl) ORIGINAL SOURCE: 



10 



20 



60 



(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAI^E/KEY: misc feature 

(B) LOCATION I. . . 801 



(Xi) SF.QUENCE DESCRIPTION: SEQ ID NO: 157 

ATGCGTATTG TCAGTAATTT TTTGTTCGTC TCTTTTTCGG TTTrGCTTTT TGCATCATGC 
S^J^???' GAGAAAAGGT CGTTTACCTG CAAGATATCC A^^^^ TC^^^ 
AGT^^ SrAr™ JJ^TTGAG AAGGACGATG TGCTGAACAT C^^Tc 
nnJit^r^ ftT^^^Z^ AACGCCCTAC AACCAAGTGT TGACCACTCG TGCACTGGCC 
CbCAACGGCT ATGGAACGAA CTCGAACGAA GGCTTCCTGG TCGATTCGAA AGGGTACATC 
30 ^Tirln^.: T^^'^«^^^ GATCTATGTA GAGC^cS^A CT^^^^ ACTGGA^^ 
^Snlnf^SS S^^^^^-^AT Tl'CCAGTGGA TTTATCAAGG ATCCTACGGT AACGGTGCAG 
^Ir?^^^ TCAAGGTGTC GGTTTTGGGA GAGGTGAATC ATCCGGGTTC GATGTC^A 
^C^S^ l^l^^^^ TTTGGAAGCG ATCGGAATGG CCGGAGACCT GACAATCTAT 
AC^A?^^ rlanJ^^.^ ^^^^'^^^'"'"^ GAAACCGATG GGCATCGCGA GGTTTTCCAG 
GT^^^^r. r^^n^ti ^L^S??^ ^^5eS?S?!?T ACTATCTGCA TCAOAACGAC 



60 
120 
180 
240 
300 
360 
420 
480 
540 

35 GTrlT^i^ ^rrCiCTCGCA AGCCCCCTGT ACTATCTGCA TCAGAACGAC ^60 

SStSa^p It^^^^^ CGACAAGAAA ACACAGATGA GCGAGATCAA CCAGAATAAT 720 
^^?^AG Irl^^cWl r^^''''^^^ ACTTTGGTAT CCATTTCCAC GCTGACGATT 780 



40 (2) INFORf-lATION FOR SEQ ID NO:lS8 

(i) SEQUENCE CHARACTERISTTCS: 

(A) LENGTH: 1707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNF.SS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

50 (iii) HYPOTHETICAL: NO 

(iv) A»4TI -SENSE: NO 

ORIGINAL SOURCE: 

ORGATUSM: PORYPHYROMONAS GINGIVAI.TS 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIOH 1, . .1707 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:158 



J^S^^^^^i CCAATCTGTT TTTATCTCTG CTGGTGATCT TTATCACCGG TAGTTTTATG 60 
65 ^^S^S^"^ CACAGAAGTC CAAGACGAAC AAACTCACCG AAGAAGATCG GA^CCGCAAT ill 
AGTCGATGGA TGTGCTTAGC AATATTATCG GTAACGTCAG GC^GTATT?? \ll 

A^r^^J^^ S^IT^'^^ GAGGAAATGG ATGAACTGAA ATTGATGACT 300 
^S^X^ ATGCCGGAGT CGGAGCTATC ATATCGCAGC GCCCGGATAG TGCTGTGATT 360 

70 A^CctSctS ™™ TATGCCCGCA GACGAAGCAG GATTGATAGC AgSS^^ 
CTG^I^SS^ l^lr?i^ CS^'''^^ AAATCCACCA CACCGAAAGT AAGCCAAGCA 
SS™?SS^ ^AGCCGGTAC TGTTGCAAAG GTGACAGTAA TGCGCTATGG CGAAACCAAA 
SS^^^ TTTCCGTG.AA ACGTCAAAAA GTGATTATGA ATTCCGTCAC TTACaIS^ 
SIgS™ ATATATCCGC TTGAACAACT TTACGGiSi IIS^S 

75 CCGCCn-GTT GGATCTTCGT GACAAACAAG GAGCGAAAGG TCTCATTTTG 720 

75 GATTTAAGAG GCAATGGTGG CGGACTGATG CAGGCTGCTA TCGAGATAGT ?S?ctg??c 78^ 



420 
480 

540 
600 
660 



SUBSTITUTIE SHEET (Rule 26) (RO/AU) 



™!i tT-H: tV^ii m-^n 



Q '^^ O JL £ O O n, O O ^ O ^Ui o 



wo 99/29870 



PCT/AU98/01023 



107/ 490 



10 



15 



GTCCCTAAGG 
TTTCGOVCAT 
CAATCGGCAT 
CTGATCGGAC 
AACCGCGTGA 
CGTTTGGACT 
CACAAAATCT 
(iACATCGAGG 
GACGTTTTCG 
6ATTTTTCCA 
TTTGACTATG 
GAAGGCTACC 
AACCTGTCGC 
ATTGTCACTC 
6TAGTCAAAG 
GCAGCTCCGA 



GCAAAGAGGT 
TGACTGAACC 
CTTCCTCGGA 
AAAAGAGCTA 
TCAAATTGAC 
ACAGCCGCyVC 
TTTACACTGC 
TCAAACAAGA 
ATTTCGTCAC 
TAACGAACGA 
ATCGCCAGAG 
TGCCGGAAGC 
GTGATCTGCT 
GCTATTATTA 
AAGCTATTAA 
AAGCAGAGAA 



GGTAACGACC 
GATCGACACG 
GATTGTAGCC 
T6GCAAAGGG 
TACGCCCAAG 
CAATCGGACA 
TGCCGGAAGA 
TACAGCrCCG 
ACGTTATGTG 
GGACTATGCA 
TGGCAAGATG 
CAACTCGGAG 
ACGATTCAAA 
TGAGCGAGGC 
GCTGCTGAAG 
TAAAGGG 



AAAGGTCGCA 
AAACTCCCGA 
GGAGCACTGC 
CTTGTACAAA 
TACTACATCC 
GGTATGGCAA 
CGTGTAGAAG 
ACATTACTTT 
CTCAAGCATA 

GcrrrcTGCA 

CTTGACAAAC 
CTTMAGCAC 
AAGGAGATAA 
AGTATCaSCC 
GACCATCCGG 



TTGCAGAGTC 
TAGTAGTCCT 
AGGATATGGA 
CGACTCGTCA 
CAAGCGGAC6 
CGGCCATTCC 
ATGCAG6AGG 
ATTATATGGC 
AAACGATTCC 
AGATGATGGA 
TGGAGGAACT 
TACGCGAAAA 
OWVCTATCT 
AGAGTTTGCC 
AACAAATTCG 



GGCGTCCGTA 
GATCGATGOA 
CAGGGCTGTA 
GCTACCATAC 
TTGTATTCA6 
TGACAGTCTG 
AATCCTGCCT 
CATCAAtAAT 
CAAGCCGGJIG 
AGAAAAGAAA 
GGCTAAGATA 
GCTAAAACCC 
CAACAATGAG 
GGAAGATAAG 
ACAGATCCTT 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1300 
1440 
1500 
1560 
1620 
1680 
1707 



20 



25 



(2) IHFORTIATION FOR SEQ ID HOtl59 

(1) SEQUENCE CHARACTERISTICS; 

(A) LEHGTH: 2943 base patrs 

(B) TYPF.: nucleic acid 

(C) STKATtDEONESS: double 
{□) TOPOLOGl': circular 



Ui) MOLECULE TYPE: DMA (genomic) 



30 



(111) HYPOl'HETICAL: NO 
(iv) AJ?TI-SRNSE: HO 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGWUSM: PCRYPHYRCMOilAS GXNGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATIOII I. . .2943 

(xi) SEQUENCE DESCRIPTIOH: SEO ID NO: 159 



ATGCAAAACA 
TACCTGTCAT 
GTTGCC6GAA 
CfGAAAGAAG 
GTTATCTTGA 
CCCAACTTCA 
GATATTTTCG 
GGTTCGGGTS 
CTGCfCAAAG 
ATCGATGCTT 
CTTGTCGAAC 
GCCAACCTAC 
GCCAATGATC 
CCTGCAACTA 
GATTCTGCTA 
TCTCTGCTTA 
ATGGCTCAOA 
GTGCTTTTCC 
CTCTACGAAC 
GTAGTGACTT 
TCGATGACGA 
GGACGCGCAA 
GAGATCACGG 
CTTGCCAACG 
GTGATTGGTC 
GCTTTGGTTA 
GCAAACGGCG 
GCCSTGCTGA 
GCCAACGTAC 
CGTGCCGTTA 
ACTATTATTA 
ACTACGTTGA 
GTCrrCGAGA 
ACTCGCAATC 



AAGGATTTGT 
TCTCTTTCGT 
TGGCCTATCT 
crCAAGCCCA 
AACTTAACGC 
ACAAAGCTCT 
TGAAGGAATA 
ACCTTCGCGA 
AAAAATATAA 
TCGGTGTGGT 
TCCCCGGAGT 
AGTTCTGGCG 
GTCTGAGCGA 
CTGACl'CTGT 
CTGTAGCACA 
CTCCCGTGAA 
TATCTGAAAT 
TCTGGGGTGC 
TCTATGCTAT 
CCGCCAAGAG 
TGAATGAAGA 
TCGCTATCGT 
GCGGTCGCTC 
TACTCAACTC 
CTACGCTGGG 
TCCTGATGTG 
CATTGATTGT 
CCCTCTCGGG 
TTATCTTCGA 
CGGATGGTTA 
CCGGTATCAT 
TTATCGGTCT 
AACTGGC6AA 
TCCTTGTCAA 



GATTGTTATC 
TACGAACCGT 
TGATTCCATG 
GCAAATTGGT 
AAGCGATCTG 
GGAGAATGCT 
TCGCAAGCTC 
CCAGATTACC 
TAGTGCTGTA 
T6CACCTAAT 
GAAAGACCCT 
TACATACAAA 
ATTGGCTATG 
AGCTGCAACA 
AAAAGAG6CC 
TCGTGGCGGT 
GCTCCAGCAA 
TAAAGCAATC 
TCGTACCAAT 
TGATATCCAA 
AGGTGCTCGT 
TTTGGATGGT 
TCAGATCTCC 
CGGTAAAATG 
TGCCGAfi-l'CC 
TTACATGTGT 
AAACAGCTTC 
TATCGCAGGT 
GOGTATCAAA 
TGGCAACGCT 
CCTATTCCTC 
TATCGCTTCT 
AAAAGGTCGT 
TCCCTCATAC 



ACATOGGCTC 
TACGAAAAGA 
TCGAATGAGA 
CTTGGCCTTG 
CrrCGTAACC 
GCCAAGAGCA 
GATCCCAACG 
GCAAAGTCTA 
GAAGCTTCGT 
TTGCAGCGAT 
GAGCGTGTTC 
TTCGAAGAGG 
AACAACACGQ 
GCCGATTCTG 
AAGGATGCTA 
GCAGTAGTGG 
GCTCACGATC 
GAAGACCCCG 
C6TACGGGAG 
AATGACTTCG 
AAATGGGCGC 
GTGGTTTATT 
CGGCACTTCA 
GATGCTACGG 
ATTAAAGCCG 
CTGGCTTACG 
TTCACATTGG 
TTG6TGCTQA 
GAAGACCTTC 
TTCTCTGCCA 
TACGGGACGG 
TTCATTACGG 
TTGGATAAGA 
AACATCTTGG 



TG6CCATCAT 
AGGCTAAGGC 
AGGTCTGGTT 
ACTTAAAGGG 
TCTCTAACAA 
CGGAGCAATC 
6TCGCTTGGC 
CGGATGCAGA 
TCAATGTGCT 
TGGAAGGGCA 
GTACCCTTTT 
TCAGCGGAGA 
ATGCTACCCC 
CTGCTCTACA 
CTCGTAAAOA 
GT6TGGCTC6 
TGAAGGTTAC 
AAACCAAAAA 
ATCCTGATTT 
GTCGTTCCGA 
GTATCACAAA 
CTGCTCC6AA 
CCGTGGAGGA 
TAAGCATCGA 
GATTCTTGTC 
GTTTCTTGCC 
GCGTATTGGC 
CGCTGGGTAT 
GTGCCGGTAA 
TCTTCGACTC 
GGCCGATTCG 
CT6TCTTCTT 
TTACATTCAC 
GTAAGCGCAA 



CT6TGCGTTT 
GATGGGCCAT 
CGGCTACACG 
GGGTATGAAC 
AAGTTTG6AT 
C6ACTTCATC 
CGTTATCTTC 
CGTAGTGCGT 
CCGTGCTCGT 
AGGGCGTATC 
GCAACGCAGT 
CTTGATCGCT 
GGAAACAGAG 
AGCTGTAGCr 
CGCACTCTTC 
TCGTGCTAAT 
ACGTGAAGAT 
G6AGACCGAC 
GGGAGGTGAT 
ACCGATCGTT 
GQATAACGTG 
CCTGAATGAT 
GGCCGGTGAC 
ACAGGAAAAC 
GTTCCTGCTC 
GCGTCTTATC 
TTCTTTCCAT 
GGCTCTGGAT 
GACTCC6ATT 
GAACGrrACG 
CGGTTTTGCC 
GACTCGTATC 
TACGAGCATT 
GACCGGCTTT 



60 
120 
IflO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1000 
1860 
1920 
1980 
2040 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



iL.P 3 Oi ™iL i£:^ iE:I> m IL-fi ii«fi li:"- 'iZZh %Jl^ 



wo 99/29870 



PCT/AU98/01023 



108/490 



10 



15 



ATCATTCCGG 
AGGGGTATTG 
TCCGAAGCCG 
ATCGGTACTG 
GAAGAAACTG 
CAGCAGCCTA 
AGTATGTCGA 
ATGGCCATTT 
TCTGTGGCCG 
CCCTTCACCA 
TCGCTCAATG 
AACAGAGATC 
AATACGTCTT 
ATGCGTAGTT 
TTTGTTGCTA 
AAA 



TGATTATCAT 
AATTCTCCGG 
TTCGTTCGGC 
AAGGGACAGA 
AACCAGAGAT 
CTGCTGATCA 
GTGACATCAC 
ATATTCTGAT 
CTACTACATT 
TGGAGATCGA 
ACACCGTGGT 
GCTATCAGGT 
TCACTACGTT 
TCACGTTCTC 
CACCCCTTGC 



CGTTTTGGGA 
AGGACGTAAC 
CTTGTCTTCT 
GGTGCGTATA 
TACTGACAAA 
GTTCTTGGAC 
GAGAGGTGCT 
TCGCTTCCGT 
CTGCATTATT 
TCAGAACTTC 
TGTATTTGAC 
GATCAACGAT 
TATCGTTATG 
GATCCTGCrC 
CTACGAGATC 



CTTATAGCTTT 
TACGTAGTTA 
CXCCTGCAGG 
TCTACGAACT 
TTGTATCAGA 
AATATCATTA 
ATTTGGGCTG 
GACATTTCTT 
GCTCTGTATG 
ATCGCTGCTA 
CGTATCCGAG 
GCCCTTAATT 
TTGGTAATCT 
GGTATCGTTA 
CAAAAGCGCA 



CATTTACAAT 
AATTCGACCA 
AAAAGGTATT 
ATAAGATCCA 
GCCTGAAAGG 
GCTCTCAGAA 
TGCTGTTATC 
TCTCTGCCGG 
CGTTGCTGTG 
TTCTGGCTAT 
AGACGATGAA 
CAACATTGGG 
TCATCTTTGG 
TCGGTACATA 
AGCTCAACAA 



CGGTCrCAAT 
GCCTGTATCT 
GGTTACCTCC 
GGAGGAAAGC 
TTTCTACACC 
AGTAAGTCCX: 
GATGATCTTC 
GGTATTCGTA 
GAAGATTCTG 
CATCGGTTAC 
ATTGTACCCC 
TCGAACATTA 
AGGTGCTACG 
CTCTACGCTC 
AGCAGCTAAG 



2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2S80 
2640 
2700 
2760 
2820 
2880 
2940 
2943 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) INFOWIATIOH FOR SEQ ID HO: 160 

<i) SEQUF.NCE CHARACTERISTICS: 

(A) LENGTH: 3051 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA {genomic! 

(iii) HYPOTHETICAL: MO 

(iv) AIJTI-SENSE: NO 

(vij ORIGINAL SOURCE: 

(A) ORGAIJISM: PORYPHYROMONAS GINGTVALIS 

(ix» FEATURE: 

(A) NAME/KEY: mi£c_f eature 

(B) LOCATIOTJ 1. . .3051 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160 



ATGAAAAGAA 
CAGAATAGAA 
GCGAATGTCG 
TTCACGCTTA 
ACCAAAGAGG 
CTGGAGCAGG 
TCTGTGGCCA 
GCCCTCCAAG 
GTCGCTTCTG 
ATCGTGGATG 
GAATCTATGT 
AACGGAGTCG 
TTTAATGCCA 
ACTGGAGATG 
GITCAGAAGG 
TCTTTGAAAG 
TGGCTCAAGG 
GGGTCTCAGG 
CGTGAACCGG 
GAATGGCTGA 
TATTTTGGAA 
TATAACCCTT 
AGACCTTCTA 
CAGGCCAATG 
CAGGCTGGTG 
TATGATTCTA 
TTTACAAATA 
ATGGGGCATG 
GAAAGTGATA 
GAACACAGAG 
GACAAATGGA 
AATAATAGAA 
ATTCAAGAAA 
AACTCGGAGA 



TGACGCTATT 
CCGTGAAGGG 
TGGTTGTCGG 
GCGTGCCl'GC 
TCGCCATCGC 
TAGTTGTATT 
AAGTGTCCAG 
GTCAGGTAGC 
TGGAGATCGA 
GTATGCAAAC 
CCGTTTTGAA 
TTTTCATTCA 
GTTACQGGAT 
AATTGCTGGA 
TTAAAGATAT 
ATGAGTATGG 
CTTTGTTTAA 
GAACTTCATA 
CAAATTTTAA 
AAGTTGGTGC 
AGTATTATAT 
TTGATGTGAA 
TGACAGAACC 
TAAATGGTTT 
TTGATATTAC 
CTCCTCTTGQ 
CGGCTGAATA 
AATATATTGA 
AGTTGATGTT 
TCGCTGAATA 
TGTATATAGA 
GCGCGTGGTT 
GTAATTGGCT 
TTGGTAATTA 



CTTCCTTTGC 
TACAGTTATC 
AAACACCACT 
CAATGCCAAA 
TAATGTGATG 
GGGTTACGGl' 
CGAAAAGCTC 
CGGTATGCAG 
TC5GTACAGGG 
TTCTTTGGAT 
AGATGCTTCT 
AACGAAGAAA 
TTCTCAAATC 
TTTTCAGGTG 
GATCCTTGCC 
TAAGACATTG 
AACAGCACCC 
TTATGCCTCT 
GCGCTATAGT 
AAATTTGTCT 
GGGGTCAGGT 
TGGGGATTTA 
GTACTTCGCA 
CGCCCAGATT 
TAATACTCGC 
GGAAAGAAGA 
TAAGTTTTCA 
ATATGAAGGG 
ACTGAGCCAG 
TGCCTATTTG 
TTTCTCTGTT 
CTATTCTGTC 
CAGTGATCTT 
CAACCACCAA 



TTGCTGACGA 
TCCTCCGAGG 
ATCGGTGCTG 
ATGTTGAGAG 
AAGATCGTAC 
ACGGGACAGA 
GCGGAAAAGC 
GTTATGACTA 
TCGTTGGGGG 
GTTGTGGCTA 
GCAACATCTA 
GGTAAAATGA 
CTGAATACTA 
AAGGCAGGTT 
GGAGCTGAAG 
TTCCCAGTGG 
ACCAGTCAAG 
ATAGGCTACT 
GGCCGGCTCA 
GGTGCGATAG 
ACTTTCCCTG 
GCAGATGTCT 
AAAATGAGAC 
ACTCCGATCA 
ACTTCTTCTA 
GAAAGAGCTT 
ATTGATGAAA 
GATGTTATTG 
GGAAAAACCG 
TCTTTCTTTA 
CGTAATGACC 
GGTGGAATGT 
CGACTGAAAA 
GCACTCCTTA 



GCATTGGGTG 
ATAATGAGCC 
CAACCGACTT 
TGTCCTATTC 
TGGATCCGGA 
AACTCAGCAC 
CCGTTGCCAA 
CATCCGGTGA 
CAAGCTCTGC 
CGATGAATCC 
TTTATGGAGC 
GCGAGAGAGG 
AGCCCCTTGA 
TTTGGGGGAA 
ATTTGTATGG 
ATTTTAATCA 
GTGATATTTC 
TCGATCAGGA 
ACTTCGAAAG 
CGAATAGACG 
TGTTAACGAT 
ATTACATGTA 
CGTTCAGTTC 
AAGGCCTTAC 
AGAGAATGCC 
ATCGAGATGT 
AACATGATCT 
GGGCATCTTC 
GAAATAGTTT 
GTCGTTTTAA 
AATCCTCTCG 
TTGACATATA 
TGAGTTATGG 
CTGTGAACAA 



GGCTATGGCC 
CCTCATCGGC 
GGATGGCAAC 
CGGTATGACT 
CTCTAAGGTT 
TGTTTCCGGT 
TATCATGGAT 
CCCTACTGCC 
ACCATTGTAT 
GAATGATTl'T 
TCGTGCTGCA 
TCGTATTACC 
TAATATGATG 
CAATCAAACC 
CAATTATGAT 
TGATGCAGAC 
TTTCTCCGGA 
AGGTATGGCT 
TCGTATCAAT 
ATCTGCCGAC 
GCCTCGTTAT 
TGGAGCTACC 
CGAATCACAT 
TTTAAAGGCA 
CAATAATCCG 
TAGCAAGTCT 
TACAGCATTG 
TAAAGGATTT 
GTCTTTGCCT 
TTACGGTTTT 
ATTCGGATCC 
TAATAAATTC 
TACAACGGGT 
TTATACTGAA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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15 



GATGCTATGG 
TCTCAGTTCA 
GATTTCTATG 
GGTTTCTTCT 
CTTAAGGG6A 
TACAATAGAC 
ACCGGTACTA 
ATCGACAAAA 
AATAAAGTTA 
ACTCCTCCTA 
GCTGATTTTG 
AATGCAGGTG 
GATAATAAAG 
TTGGAGAATG 
AGTCTTTTTG 
CTGTTAACTG 
AATCAATATC 



GGCTTAGCAT 
ACTTCGGTTT 
TCCGCACTAC 
CACAGTATCA 
CGATCTACCA 
AGGAAATAAC 
TATGGGAAAT 
AAACCGGTAA 
CAACAAGCCA 
TTACAGGTGG 
CCTACATCGT 
GATTGATGCA 
AAACAGATGT 
CTTCTTTCCT 
CTGGGGAGAA 
TTACGAAGTA 
CTAATTCTAA 



TTCTACAGCA 
GGCTGCAGGG 
GAATGATATG 
GAATGTAGGC 
AAATAAGGAC 
AAAGCTTTTC 
TGGGTACCCC 
GCAGTTGTGG 
GTACTCAGCT 
TTTCTCCTTA 
TGGTAAATGG 
ATTAAATAAA 
TCCAAAATTG 
GCGTTTGAAG 
TGTGATTGGT 
TAAAGGCTTT 
GCAGTACGTT 



GGCAATCCCG 
GCTTTCAATA 
TTGATTGATG 
TCTATGAAAA 
TGGAATGTAT 
TTCGGTCTCA 
AATTCGTTCT 
TATGTTCCTG 
GACTTGGAGA 
GGTGCTTCTT 
ATGATCAATA 
GATAAAATGC 
GGACAGTCTC 
AATCTCAAAC 
GGAGCTCGTG 
GACCCrGAAG 
GCGGGTATTC 



ACCTCTCGTG 
ATCGCTTATC 
TCCCGATGCX: 
ATACGGGTGT 
ATGCTTCTGC 
ATAAGTACAT 
ATATGGCTGA 
GTCAA6TCGA 
CACGAATTGA 
GGAAAGGACT 
ATGAIXGTTA 
TATTGAATGC 
CTCAGTTTGA 
TCACCTATGT 
TCTATTTGAT 
CAGGGGGGAA 
AGTTGTCTTT 



GGAAAAGCAG 
TGCA6AGGTA 
TTATATCAGT 
AGACCTTTCT 
GAATTTCAAC 
GTTGCCTAAT 
ATATGCTGGA 
TGCGGATGGT 
TAAGTCTGTT 
TTCTTTAQAT 
CTTTACAGAG 
CTGGACAGAG 
TACGCATTTG 
ACTCCCCAAT 
GGCGCGCAAT 
TGTGGGAAAA 
C 



2100 
2160 
2220 
2280 
234 0 
24 00 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3051 



20 (2) INFORMATION FOR SEQ ID HO: 161 

(i) SEQUENCE CHARACTERISTICS: 

(A) L£NGTH: 3042 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

{ill MOLECULE TYPE: DNA (genomic) 

30 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
35 (A) ORGANISM: PORYPHYROMONAS GINGIVA LIS 



40 



55 



70 



( ix } FEATURE : 

(A) NAME/ KEY: misc:_f eature 

(B) LOCATION 1...3042 

{xD SEQUENCE DESCRIPTION: SEQ ID NO: 161 



ATGACGCTAT TCTTCCTTTG CTTGCTGACG AGCATTGCGT GGGCTATGGC CCAGAATAGA 60 

ACC6TGAAGG GTACAGTTAT CTCCTCCGAG GATAATGAGC CCCTGATCGG CGCGAATGTC 120 

45 GTGGTTGTCG GAAACACCAC TATCGGTGCT GCAACCGACT TGGATGGCAA CTTCACGCTT 180 

AGCG-J'GCCTG CCAATGCCAA AATGTTGAGA GTGTCCTATT CCGGTATGAC TACCAAAGAG 24 0 

GTCGCCATCG CTAATGTGAT GAAGATCGTA CTGGATCCGG ACTCTAAGGT TCl'GGAGCAG 300 

GTAGTTGTAT TGGGTTACGG TACGGGACAG AAACTCAGCA CTGTTTCCGG TTCTGTGGCC 360 

AAAGTGTCCA GCGAAAAGCT CGCGGAAAAG CCCGTTGCCA ATATCATGGA TGCCCTCCAA 420 

50 GGTCAGGTAG CCGGTATGCA GGTTATGACT ACATCCGGT6 ACCCTACTGC CGTCGCTTCT 4 80 

GTGGAGATCC ATGGTACAGG GTCGTTGGGG GCAAGCTCTG CACCATTGTA TATCGTGGAT 540 

GGTATGCAAA CTTCTTTGGA TGTTGTGGCT ACGATGAATC CGAATGATTT TGAATCTATG 600 

TCCGTTTTGA AAGATGCTTC TGCAACATCT ATTTATGGAG CTCGTGCTGC AAACGGAGTC 660 

GTTTTCATTC AAACGAAGAA AGGTAAAATG AGCGAGAGAG GTCGTATTAC CTTTAATGCC 720 

AGTTACGGGA TTTCTCAAAT CCTGAATACT AAGCCCCTTG ATAATATGAT GACTGGAGAT 780 

GAATTGCTGG ATTTTCAGGT GAAGGCAGGT TTTTGGGGGA ACAATCAAAC CGTTCAGAAG 840 

GTTAAAGATA TGATCCTTGC CGGAGCTGAA GATTTGTATG GCAATTATGA TTCTTTGAAA 900 

GATGA GTAT G GTAAGACATT GTTCCCAGTG GATTTTAATC ATGATGCAGA CTGGCTCAAG 960 

GCTTTGTTTA AAACAGCACC CACCAGTCAA GGTGATATTT CTTTCTCCGG AGGGTCTCAG 1020 

60 GGAACTTCAT ATTATGCCTC TATAGGCTAC TTCGATCAGG AAGGTATGGC TCGTGAACCG lOBO 

GCAAATTTTA AGCGCTATAG TGGCCGGCTC AACTTCGAAA GTCGTATCAA TGAATGGCTG 114 0 

AAAGTTGGTG CAAATTTGTC TGGTGCGATA GCGAATAGAC GATCTGCCGA CTATTTTGGA 1200 

AAGTATTATA TGCGGTCAGG TACTTTCGGT GTGTTAACGA TGCCTCGTTA TTATAACCCT 1260 

TTTGATGTGA ATGGGGATTT AGCAGATGTC TATTACATGT A'i'GGAGCTAC CAGACCTTCT 1320 

DO ATGACAGAAC CGTACTTCGC AAAAATGAGA CCGTTCAGTT CCGAATCACA TCAGGCCAAT 1380 

GTAAATGGTT TCGCCCAGAT TACTCCGATC AAAGGCCTTA CTTTAAAGGC ACAGGCTGGT 14 40 

GITGATATTA CTAATACTCG CACTTCTTCT AAGAGAATGC CCAATAATCC GTATGATTCT 1500 

ACTCCTCTTG GGGAAAGAAG AGAAAGAGCT TATCGAGATG TTAGCAAGTC TTTTACAAAT 1560 

ACGGCTGAAT ATAAGTTTTC AATTGATGAA AAACATGATC TTACAGCATT GATGGGGCAT 1620 

GAATATATTG AATATGAAGG GGATGTTATT GGGGCATCTT CTAAAGGATT TGAAAGTGAT 1680 

AAGTTGATGT TACTGAGCCA GGGAAAAACC GGAAATAGTT TGTCTTTGCC TGAACACAGA 17 4 0 

GTCGCTGAAT ATGCCTATTT GTCTTTCTTT AGTCGTTTTA ATTACGGTTT TGACAAATGG 1800 

ATGTATATAG ATTTCTCTGT TCGTAATGAC CAATCCTCTC GATTCGGATC CAATAATAGA 1860 

AGCGCGTGGT TCTATTCTGT CGGTGGAATG TTTGACATAT ATAATAAATT CATTCAAGAA 1920 

75 AGTAATTGGC TCAGTGATCT TCGACTGAAA ATGAGTTATG GTACAACGGG TAACTCGGAG 1980 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



iF~« 4r=-M it^' «il -rr' 
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if4. ...tJ= ^""ij" ri=4r 



WO 99/29870 



PCT/AU98/01023 



110/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Al-l'GGTAATT 

g*3gcttagc:a 
aacttcggtt 
gtccgcacta 
tcacagtatc 
acgatctacc 
caggaaataa 
atatgggaaa 
aaaaccggta 
acaacaagcc 
attacaggtg 
gcctacatcg 
ccattgatgc 
gaaacagatg 
gcttctttcc 
gctgggcaga 
gttacgaagt 
cctaattcta 



ACAACCACCA 
TTTC-l'ACAGC 
TGGCTGCAGG 
CGAATGATAT 
AGAATGTAGG 
AAAATAAGGA 
CAAAGCTTTT 
TTGGGTACCC 
AGCAGTTGTG 
AGTACrCAGC 
GTTTCTCCTT 
TTGGTAAATG 
AATTAAATAA 
TTCCAAAATT 
TGCGTTTGAA 
ATGTGATTGG 
ATAAAGGCTT 
AGCAGTACGT 



AGCACTtJGTl' 
AGGCAATCCC 
GGCTTTCAAT 
GTTGATTGAT 
CTCTATGAAA 
CTGGAATGTA 
CTl'CGGTCTC 
CAATTCGTTC 
GTATGTTCCT 
TGACTTGGAG 
AGGTGCTTCT 
GATGATCAAT 
AGATAAAATG 
GGGACAGTCT 
GAATCTCAAA 
TGGAGCTCGl* 
TGACCCTCAA 
TGCGGGTATT 



ACTGTGAACA 
GACCTCTCGT 
AATCGCTTAT 
GTCCCGATGC 
AATACGGGl'G 
TATGCTTCTG 
AATAAGTACA 
TATATGGCTG 
GGl'CAftGTCG 
ACACGAATTG 
TGGAAAGGAC 
AATGACCGTT 
CTATTGAATG 
CCTCAGTrrTG 
CTCACCTATG 
GTCTATTTGA 
GCAGGGGGGA 
CAGTTGTCTT 



ATTATACTGA 
GGGAAAAGCA 
CTGCAGAGGT 
CrTATATCAG 
TAGACCTTTC 
03AATTTCAA 
TGTTGCCTAA 
AATAT6CTGG 
ATGCGGATGG 
ATAAGTCTGT 
TTTCTTTAGA 
ACTTTACAGA 
CCTGGACAGA 
ATACGCATTT 
TACTCCCCAA 
TG3CGCGCAA 
ATGTGGGAAA 
TC 



AGATGCTATG 
GTCTCAGTTC 
AGATITCTAT 
TGGTTTCTTC 
TCTTAAGGGG 
CTACAATAGA 
TACCGGTACT 
AATCGACAAA 
TAATAAAGTT 
TACTCCTCCT 
TGCTGATTTT 
GAATGCAGGT 
GGATAATAAA 
GTTGGAGAAT 
TAGTCITTTT 
TCTGTTAACT 
AAATCAATAT 



(2) INl'ORMATIOfI FOR SEQ ID NO: 162 

(1) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 7,«I63 base pairs 
(8) TYPE: nucleic acid 

IC) STRAW DEDIIESS: double 

ID) TO POLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) AMTI-SEIJSE: tlO 

(vi) ORIGIIIAL SOURCE: 

(Al ORGAIJISII: PORY PH\'R0t10KAS GIHGr/ALIS 

Ux) FEATURE: 

(A) MAIIE/KEY: misc_f cature 
(J3) LOCATION 1...24 63 



(yA) SEQUENCE DESCRIPTION: SEQ ID NO: 162 



ATGAAGAAAA 
AGCATGCAGG 
TTCTCATTCA 
GACGAGTTTG 
GCATTCCTGC 
GTCGAGATGT 
GAAGCCTATA 
TCCGTCATTG 
GCCAATACAG 
ACCGGATATG 
CCGAGGATTA 
GACGGAGCAT 
AATGTCGATT 
CCTTGGCACC 
GGTGCCAATA 
AGACATGCAG 
AGTATTTCCG 
TGGTATGTAA 
AITACCCTTG 
AATCTGAACA 
ACAGTGACTT 
GACAAGCGCA 
AAAGCCGGCA 
ATTACGATCA 
CTCCCTGTAC 
TTCCAAGATC 
CCTGCCATGA 
GTTACGCTCA 
ACTGTCAATG 
GGCCAGACGG 
GATGGCGGCA 
GGCCAATACG 



AGAATTTTTT 
CACAACAGGC 
AAGTGCCTGA 
ACCCCGTAAC 
GCTATGGGCT 
TCGACTACCG 
.TCAGCATGAT 
GCAAGTCCGT 
GG AAAAAGCC 
TGGTACTGCT 
AGAACATTCT 
ACAGAGCCGG 
TGAACCGTAA 
CGGAGGCAAC 
TACATGGAGG 
ACGATGAGTG 
CCAGCTACAT 
TTCGCGGAAG 
AAATCAGCAA 
AAGAATCTCT 
CCGCTGCGAA 
ACTCCGATGT 
CTTATACGGT 
AGGACAAAGA 
CCGATTTCAC 
AAACGACAAA 
GTACAGAGCA 
AAGTGTGGAA 
CCGTTATGCC 
TATCTTTCCA 
CTCCCGCTAC 
ATGTCACGCT 



GCTTCTTGGC 
CAAAGATTAT 
TAGGCCCGTT 
CAATGAAGCC 
CAAGCCTACA 
CTCAGGAGAA 
GGAAGACTTC 
AAAGGATCGT 
TCGCGTGCTC 
CCGACTCATA 
GGATAAAACG 
AAACCACACC 
CrTCAAGGAT 
TGCATTCATG 
AACAGAGGTG 
GTACAAACTG 
GACCTCCGAA 
TCGTCAGGAC 
CACGAAGTTG 
GCTTGCTCTG 
CGGACAGCCT 
TTACTCCGAT 
GAAATACAAA 
AACCGTCATC 
AGCTTCTCCT 
TAACCCCACG 
GAATCCGCTC 
TGCAAGT6GT 
TGTAGCTGAA 
AAACCAATCC 
CAGTGAAGAC 
CAAGGCGATC 



ATTTTCGTTG 
TTCAACTTTG 
CTACAAGAGC 
ATTGCCTATG 
TTCTTGACTC 
AAATAC6AAT 
CAAACAAAGT 
AAACTGATGA 
TATACTTCTA 
GACCATCTGC 
GAAGTATGGA 
GTACAAGGAG 
GATGTAGCCG 
GATTTGGAAG 
C5TGAACTATC 
ATCAGTCGCA 
ACCAATTCGG 
AATGCAAATT 
GTGCCGGCCr 
ATCGAAGAAT 
CTCAAATGCC 
GCTACCACAG 
GCCGAGGGTT 
ATUGACATTG 
ATGACCATCT 
AATTGGGAGT 
GTATCCTATA 
TCCAACACGA 
TTCGTCGGTA 
ACCAATGCCA 
GAAAACCCGA 
AGTGCTTCCC 



CTTTGCTGAC 
ACGAACGGGG 
TGGCTCTGAT 
CCAGCGAAGA 
CTCCATCCAT 
GGAATGCTTA 
ATCCATCACT 
TTTGCAAGCT 
CCATGCACGG 
TGTCGAACTA 
TCTGCCCTTT 
CTACfCGCTA 
GTGATCACCC 
G?\AACACCTC 
CATGGGATAA 
ACTACGCAGC 
GAATCATCAA 
ATTTCCATCG 
CTCAACTTCC 
CCTTATACGG 
AGATCTTGAT 
GCTACTACGT 
ATCCTGAGGC 
CATTGGGCAA 
CAGTAGGCGA 
GGACGTTCGA 
GTCATCCCGG 
TTACGAAAGA 
CCCCGACGGA 
CCAACTACGT 
CTGTGCTTTA 
GTGAAACGGT 



TTTCATCGGC 
CGAGGCCTAC 
CATGTCCATC 
GGAGTTCGAG 
GCAGCGCGCT 
CCCCACCTAT 
TTGTACTACT 
GACGTCCTCT 
AGACGAAACG 
CGAATCC6AT 
GACCAATCCG 
CAATGCCAAC 
CGATGGAAAA 
TTTCGTGCTC 
TAAAAAAGAA 
CGCTTGTCAG 
CGGTTCAGAC 
TCTGCGAGAA 
AAAGTATTGG 
CATCCATGGT 
AGAAAACCAT 
ACGTCCTATC 
AACTCGTACC 
CTCGGTTCCT 
AAGCGTCCAA 
AGGCGGACAG 
TCAGTACGAC 
AAAATTCATC 
AATAGAAGAG 
ATGGATATTC 
CAGCAAAGCC 
GAAGACGAAA 



204 0 
2100 
2160 
2220 
2260 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3042 



60 
120 

leo 

240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1S60 
1620 
1680 
1740 
1800 
1860 
1970 
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GAAAAATACA 
ACACCTCGAA 
CCGACrrCAT 
CCGGTGGTCA 
GGCGGAAGCA 
GAGGACATAG 
CTCATAGAAG 
GTACTCAAAA 
GCAATCTACA 
GGG 



TCACTGTCAA 
AftGTAAAGAA 
GGCTTTGGGT 
CCTACAATGA 
ATGTGAAGAA 
TGGCACAGAC 
CCAACGCTGC 
CTACTCCGAA 
CCATCAATAT 



GAAAGCTCCG 
AGGCGAGACA 
GTTCGAAGGC 
AACAGGCAAG 
AGCAGAAGAC 
GGGTATCGTC 
TATCAAAGCG 
TCAGCTCCGC 
CAAAACGGAA 



Gl'CCCl'GCTC 
GTTACTTTCA 
GGCTCTCCTG 
TACGATGTCC 
TACATTGAGG 
ATTCGTCCGC 
ATCGTTCTCT 
TCGACCGTAG 
AAATCCGCTC 



CGGTAGCCGA 
AAGACTTGTC 
CCACCAGCAC 
AGCTGACTGC 
TTATCCTCGA 
AAAACGGAAC 
ATGACATCRA 
ATCTTTCCAT 
GCACGGAAAA 



CTTCGAACGA 
TACGAACAAT 
GGAGCAAAAC 
C3\CCAACGAG 
TGACAGTGTC 
GAAGCAGATC 
TGGACGGGTC 
CCTGCCCGAA 
GATCCATATC 



1980 
2040 
2100 
2160 
2220 
22B0 
234 0 
2400 
2460 
2463 



(2 J IHFORMATIOH FOR SEQ ID NO: 163 

(1) SEQUEMCE CHARACTERISTICS: 

<A) LENGTH! 228 base paixs 
(BJ TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(iv) AIITI-SE^^SE: HO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 

lAt NAME/KEY: misc feature 

(B) LOCATION 1...22fl 

(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 163 

ATGCGACTGA TCAAGGCTTT TCTCGTGCAA CTCTTACTGC TCCCCATTTT CFTCTACAAG 
CGGTTTATAT CGCCGCTTAC ACCGCCTTCA TGCCGGTTTA CCCCCTCATG TTCGTCCTAT 
GCCATCGAAG CCTTACGTAA ATATGGCCCG GGCAAAGGAC TATTGCTGAG CATCAAGCGT 
ATTCTCCGCT GTCACCCGTG GGGTGGAAGT GGCTATGACC CCGTTCCG 



60 
120 
180 
228 



(2) INtORMATIOH FOR SEQ ID NO: 164 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2577 base pairs 

(B) TYPE: nuclftlc acid 

(C) STRANDEDt>IESS: double 
(0) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA < genomic J 

(ill) HYPOa*HETICAL: NO 

(iv) AIITI -SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: mlsc_feature 

(B) LOCATION 1...2577 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164 



ATGGCATACG 
ACCCGTCTCG 
GGAGACAATG 
CGGTTGATCG 
ATCTTCACCC 
GAGGACGAGG 
AGCTTAGTAC 
TACTTCGGAC 
GGGTACCAAG 
AGCGGGACAG 
ACCACCACGA 



ACTTTAGACA 
GCGTCGTAGC 
GCGCGATCGA 
AGTTGGAAGC 
CTTCGGCTCG 
CCGTCAGCCC 
AA7\AGATATT 
AGCGCAACCC 
ACAACGACTT 
GCGGAGGCTC 
CACGGAGTGG 



AACATTCCGC 
CGTGACGCAA 
CATCATGCGG 
CATCGCCGAG 
GGAGGCTATC 
CXiTCCATCTG 
TATGAAACAA 
CTCCGAAGGO 
CGACGACGAA 
CGGCGACGCC 
AGGCGACACG 



AACAGCCTGG 
GATATGCTCG 
CACTATGGGA 
AGTTTGCCTG 
GATGATGCCA 
TTGCTGAGTA 
GGTATAAAAT 
AAGTCTCCCT 
GAGGACGAAT 
CCCGAACAGA 
CCTGCACTGG 



AGTACAGCTA 
TACTCGGTAT 
TCAACTTCTA 
CTTCGCCTGA 
CAGACATCTG 
TCCTCAACTC 
ACGACACCAT 
CCGAAATGGA 
CCTCTCOGCC 
ATACCGGCGG 
ACACCTTCGG 



TCAGGAAGCA 
CATTCGCGAC 
CGAACTCAAA 
GGGATCGCCC 
TGCCGACATG 
GACACAGGAG 
CCTGTCGGAT 
GATCCTCGAC 
TTCCGGGAAT 
AGGCGATACT 
CTvCCGACATC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



ACTGCCATCG 
AGGGTGATAC 
GGTGTAGGCA 
AGCCGTATTC 
ACCAAATATC 
AATCCGCAGA 
GCCGGATCGA 
TGCATCGGAG 
GAACGCCGCT 
CTGCAAAACA 
ATCAAAGCGG 
GCGATAGATG 
AAAGAAATCG 
GTAAAGGCTC 
CAGCAGATAG 
GTGGACGAGA 
CTGAGCACGG 
GTAGGTCAGG 
CTTCGCAATG 
AAGACCTATT 
AGGGTGGATA 
CCGGGATATG 
TATTCCGTGG 
TTACAGCjI'GA 

aacaccgtga 
ggtatcgg-:;? 

ATCCAAAAAG 

atcctcttcg 
gccctcttgg 
gatgtgatag 

CTCCAGAACG 

gggcagacgc 



CGGCAGCAGG 
AGATACTCAG 
AGAGTGCCAT 
TTTTCGACAA 
GCGCACAGTT 
TCATCCTCTT 
TGGATACGGC 
CCACTACGCT 
TCCAGAAGGT 
TCAAAGAGAA 
CAGTGGAACT 
CCATGGACGA 
AGATACTGGA 
AGAACTACGA 
CGGAAGAGAA 
ATGTAGl'GGC 
GCGAAGGCGA 
ACACAGCCAT 
AAAAGAAACC 
TGGCCAAGAA 
TGAGCGAGTA 
TGGGCTATGA 
TTCTCTTGGA 
TGGACGAAGG 
TCATCATCAC 
TCCCT rCGGA 
CTCTGAACAA 
ACCAACTGGG 
CGCGCATCCA 
CGACGAAGGG 
AAGTGGAGGA 
TTACGCTCTC 



CAAGCTCGAC 
CCGGCGCAAA 
CGTGGAAGGA 
GCGGATCATC 
CGAAGAGCGG 
CATCGACGAG 
CAATATGCTC 
GGATGAGTAT 
GCCGATAGCC 
ATACGAGGAC 
GACCGATCGC 
GGCCGGCGCG 
GGCCGAATTG 
ACTGGCTGCC 
GAAAAAATGG 
GCATGTAGTG 
ACGTCTGCGC 
CGAAAAGATG 
GATCGGTTCT 
GCTCGCCGAA 
TATGGAGAAG 
AGAAGGCGGC 
TGAGATCGAA 
TCAGCTGACC 
CTCCAACGTG 
AAAAGACGAG 
GACGTTCAGC 
CAAGACGGAG 
TCGTGCCGGA 
ATACGACXrrC 
TCGCCTCACG 
TGCTCGCGAT 



CCGGTAGTGG 
AAGAACAATC 
CTGGCCGAAC 
AGCCTCGATT 
TTGAAAGCCG 
ATACATACCA 
AAACCXTGCTC 
CGTAAGAACA 
CCCTCGACTG 
TATCACGGTG 
TATGTATCCG 
AGCGTCCATA 
GCATCGCTGC 
TCCTTCCGCG 
GAAGAGCAGA 
GCGTTGATGA 
ACGATGGCAG 
GTGCATGCCA 
TTCCTTTTCC 
TACCTGTTCG 
TTCTCCGTTT 
CAACTGACGG 
AAGGCGCATG 
GACAGTCl'GG 
GGTACACGCC 
GAAGOSAACA 
CCCGAATTTC 
AITCGCCGGA 
TACGACCTCG 
CaATACGGAG 
GATCTTATCC 
GGCGAGATCA 



GTCGGGAGCA 
CGGTGCTCAT 
GCATCGTGAA 
TGGCTCAGAT 
TGCTCGATGA 
TCGTGGGAGC 
TTGCCCGTGG 
TAGAAAAGGA 
CAGAAGAAAC 
TACGCTATAC 
ATCGTTTCTT 
TCAOCAATGT 
GAGAGAACAA 
ATCAGGAOCG 
TGTCCAAGCA 
CAGGCGTTCC 
ATGATCTCAA 
TCCAGCGCAA 
TCGGCCCCAC 
AGGATGAGAA 
CGCGTCTCGT 
AGCGCGTAAG 
CCGATGTCTT 
GACGGCGCGT 
AGCTCAAAGA 
AGGAGCATAG 
TCAACCCTTT 
TGGTGGACAT 
TCCTTACCGA 
CACGACCGCT 
TCTCCGGACA 
TCGTACAAGA 



GGAGATCGAA 
CGGCGAACCC 
CAGGAAGGTG 
GGTAGCXrGGC 
GCTGAAGAAG 
AGGCTCTGCA 
ACAGGTACAG 
CGGAGCACTC 
GCTGACXIATC 
GGACGAAGCG 
CCCAGATAAG 
GGTGGCTCCG 
GCTCTCGGCC 
GCGCACTCAG 
CCGCGAGACG 
GGCTGAGCGG 
GACCAAA6TA 
TCGTCTGGGA 
GGGGGTAGGC 
TGCCATGATC 
GGGTGCCCCT 
ACGCAAACCC 
CAATCTCCTC 
GAATTTC7UVG 
CTTCGGGCAG 
CCGTTCCGTG 
GGACGATATC 
AGAGCTTAAA 
TGAAGCCAAG 
CAAGCGCACA 
GAT CG AGAAA 
ACAAGCA 



(2) IHFOWIATIOt^f FOR SZQ ID HO: 165 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) Lr^GTH: 1251 base pairs 

(B) TYPE: nuclsic acid 

(C) STRAIJDEDtJESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: Dt4A (genomic) 

(111) HYPOTHETICAL; HO 

(Iv) ANTI- SENSE: HQ 

Ivi) ORIGINAL SOURCE: 

lA) ORGANISM: PORYPHYROMONAS GINGIVALIS 

( ix ) FEATURE : 

(A) HAJ1E/KEY: mlsc feature 
(B| LOCATION 1...1251 

(Xi) SEQUENCE DESCRIPTIO^^: SEQ ID MO: 165 



ATGAATTACC 
ATTGCTTTCC 
ACAGGCAGAG 
GTGGGTAATA 
CCTTTGGCGC 
ACTATCATCA 
AATATGATGA 
CTGTCTAAAC 
GTGCCTACAA 
TCCGGAGAAA 
GATTTTTCCG 
GAGTTGCAAA 
ATCATTATCT 
TTTCGTGGGC 
TATGCCAATA 
GATGAACTTG 
GGTGACATTG 
ACCTATCTGG 



TGTACATACT 
TTTCTTCGGA 
CGTTAAACTT 
ATATCGTTTT 
AATGGATTGA 
TACTGTTTAC 
TGAGGGTATT 
TCTTCACCGG 
CAGTAGGGTT 
ACGAACAGAA 
GTATTCACGT 
CGGATATTGA 
ACAGACAGAA 
AAGACTGGCA 
AACTGATGCG 
GAGGTACGGC 
AGGACGAACA 
TCAGTGGTCG 



GATTACACTT 
CAAACTGCGT 
GCTGTATCGA 
GGTAGTCTAT 
TAACGATGCT 
CGGGGAATTT 
OjCCCTCCCT 
TTTATCTCGC 
GGGGCGCGTA 
CGACTTGACT 
GCGAGACTGC 
AGTACTCAAA 
CATAGATGAC 
AAAACGTATC 
ACTACTCATG 
CGGAATGGTC 
CGACACTCGC 
TATGGAAATA 



TTACrCTCCG 
CTTGAGTTGG 
CATCCGGACC 
GGTCTGCTGA 
ATGATCGTCG 
CTACCCAAAG 
ATCGTAGCGA 
TCTTTTATTC 
GATCTCGATC 
ACCGAACTGA 
ATGATCCCAC 
ACGACTTTTA 
GTAGTAGGAT 
AATACTACTG 
CAGCGCAAGA 
ACATTAGAGG 
AAGATCATAG 
GATGATGTGA 



GCTTTTTTTC 
ACAGGAATAG 
AACTGGTGAC 
TGGCGGGATT 
TTCTCCAATC 
CCATTTTCAA 
TCTATTATCT 
GTCTGGTGGA 
ATTATTTGGC 
AAATCATCCA 
GCAATGAGAT 
TCGATACCGG 
ATATCCATTC 
TArrCGTACC 
AAAGCATTGC 
ATTTGGTAGA 
CCAAACAGCT 
ACGAACGTTT 



CGGTGCTGAG 
AGGCGATCTC 
TACTCl'CCTT 
GCTGGCCGCA 
TGTCTTATCC 
GACCAATGCC 
GCTTTATCCT 
CAAGAATTAT 
AGAAAATATG 
6AATGCGCTG 
GATAGCATGT 
TTTGTCCAAG 
GAGCGAAATG 
CGAAAGCATG 
GATCGTCATC 
AGAGATTTTC 
CGGCCCTCA'J* 
TGGGTTGTCC 



720 
780 
840 
900 
960 
1020 

ioao 

1140 

1200 

1260 

132C 

1380 

14«I0 

1500 

1560 

1620 

1680 

1740 

1800 

1B60 

1920 

1980 

2040 

210C 

2160 

2220 

22 8 G 

2340 

2400 

2460 

2520 

2577 



60 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
7 2C 
780 
840 
900 
960 
1020 
1080 
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TTGCCTGAGT CTGACGACTA CCTTACCGTG GCCGGATTTA TCCTGAATAG CCATCAAAAT 1140 

ATCCCACAGG CCAATGAGGT CGTGGAGATT GCTCCTTATA CTTTTACCAT TCTCAGATCr 1200 

TCTTCCACCA AGATCGAACT GGTGAAAATG TCCATCGACG ACCAATCGAA C 12S1 

(2) INFORIIATION TOR SEQ ID NO: 166 



(i) SEQUENCE CHARACTERISTICS: 

(A) LDJGTH: 87 9 base pairs 

(B) TYPE: nucleic acid 
IC) STRANDEDHESS: double 
(D) TOPOLOGY: circular 

(il) HOLECULF, TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 



(iv) ANTI-SENSE; NO 

20 {vij ORIGINAL SOURCE: 

(A) ORGANISH: PORYPHYROMONAS GINGIVALIS 

(i.x) FEATURE: 
- (A) NAUE/KEY: rolac_feature 

(B) LOCATION 1...879 



(>:1) SEQUENCE DESCRIPTION: SEQ ID HO: 166 

'kCi ACTACTTCAA AAGAGTCTGC TCACTGOTT GGCTGGTTTT ACCCATGCTT 60 

ATTATGCCAT TGGAAGTAGC AGCTCAAGAG ATTATTCCGA ACGAAGAGGT GTTGGAATCA l?n 

TTGACTTTCG TTGCACCGGT TGAGGAGACA GACGCAATAG AGGCAGAGGT AGAAGCTCTG 180 

CAGGAGATAG TCGCTACTGA GGAGATTGCG GAGCAGGCTG TTCGTTCTTA TACCTACACG 24 0 

^I^IS^*^*^ ATGGCQTGAA GATTGCTTCA GGATTGACTG AGCCCACTTT TCTCGATGAA 300 

GATGTTCCTG CCGCCGAACA TACCTACTGC GTAGAAGTAC AGTATCAGGG AGGCGTATCC 360 

JO GACAAAGTAT GCGTGGACGT ACAGGTGAAG GACTTCAAAC CGGTTACCAA TCTCACOGGA 4 20 

ACTGCTTCCA ATGACGAAGT TTCTTTGGAC TGGGACGGTG TGGAAGAGAA AGCTGAAGAG 4 00 

CCGGCAAGTG ATAAAGCAGT CAGCTACAAC GTCTACAAGA ATGGAACCTT GATCGGTAAT 54 0 

ACAGCTGAAA CTCATTATGT GGAGACCGGT GTAGCCAATG GTACATACAT CTACGAAGTG 600 

GAAGTAAAGT ATCCTGACGG TGTATCTCCG AAGGTGGCTG TAACCGTGAC CGTGACCAAC 660 

^'^^^'^^^ GCAATGTAGA TGGACAGGCT CCTTACACAT TGCGAGTAGA AGGCAAG?AG 720 

ATTATTGCGG AAGCCCATGG TATGATCACG CTCTACGACA TCAACGGACG TACCGTGGCC 7 BO 

GTAGCCCCGA ATCGATTGGA ATACATGGCG CAAACCGGTT TCTATGCAGT GCGCTTCGAT 84 0 
GTGGGGAATA AACACCATGT ATCGAAAATA CAAGTAAGA 



45 



{2) INFORIIATION FOR SEQ ID HO: 167 



87 9 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 936 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 



(Iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRO(-!ONAS GINGIVALIS 

(ix» FEATURE: 

(A) NAIIE/KEY: mi*c_f eature 

(B) LOCATIOli 1 . . .936 



(Xl) SEQUENCE DESCRIPTION: SEQ 10 NO: 167 

7n ^l^l"^'"^^ TAAGCGAGAG TTTTGAATCA GGTATTCCAG CTATATGGAA GACCA'iTGAC 60 

SSJ^^i^^"^ ATGGCTATAA TTGGATGCAT TTGACCAATT TCACGGGACA GAGTGGTCTr^ 120 

CGGCITCATA CATAGGCGGC GTCGGAGen* TGACTCCGGA CAATTATCTG 180 

^J^?^"^^^ AATTAAAACT ACCCACAGAC GCGTTGCTGG AAATAATCTA TTGGGTATGT 24 0 

^^—^^^ TCACTCCTCC ATCGGAGCAC TATGCCGTTT ATTCCTCTTC TACAGGCAAT 300 

7*5 ^^J^^""^"^ ACTTTGTTAA TCTCTTATAT GAAGAGACTT T6ACTGCCAA ACGGATACAA 360 

75 TCCCCCGAGT TGATCCGCGG AAATCGGACA CAAGGTGTTT GGTATCAAAG AAAGGTGGTA 420 
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CTCCCl'AACG 
CTCAATTTGG 
CATCCGG(5TG 
TCGGCATTGG 
GTCAATTATC 
GCAAACATCT 
GCGAGTGCTT 
GGCTGCGATA 
AACGGAACTG 



ATACTAAATA 
ATGAAGTATC 
GTTACACTTA 
CATATATCGA 
TGCAAGGAGA 
ATGGCGCGGA 
TCAAAGGAGA 
CGCTTAGGTA 
TCTATACTGA 



TGTTGCTTTC 
TATCCTGTAT 
TTCTGTATTC 
TACGGATG-l-A 
CTCGTATAAA 
TAAGCCTTTT 
GATCACTCTT 
CAAAGCGGAA 
GAAAATCCAA 



CGCCATTTTA 
ACCCCTCTTC 
CGTGATGGAC 
CCGTATGGGA 
GTCTGCAAAA 
GCGTTGACCG 
TATGACATTC 
AATGGl'lTTT 
ATCCAA 



ATTCCACGGA 
CCCGAAGAGC 
AAAAGATAGC 
CTCAAGACTA 
ATATAGTGGT 
TGGTTGGCAA 
GTGGCCGGCT 
ACCTCATTAA 



TA'\TTTCTGG 
TCCGTGTCCG 
GAGTGGATTG 
TTGTGTCCAA 
GGCAAATTCT 
CyVCCATTGTA 
GATAGCTTCC 
AATACAGGTA 



(2) INFOW-IATIOH FOR SEQ ID NO: 168 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 9 base pairs 

(B) TYPE: nucleic acid 
IC) STRAflDEDNESS: double 
(DJ TOPOLOGY: circular 

(ii) MOLECULE TYPE: DKA (genomic) 

(iii) HYPOTHETICAL: NO 

(iV) AI4TT- SENSE: NO 

(vi) ORIGIMAL SOURCE: 

(A) ORGANISM: PORYPHYROhlOTJAS GINGIVALIS 
( ix ) FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATION 1...2529 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168 



ATGAAAAAAA 
TCTGCTCCGG 
TCCTTGCGAA 
CAAGGAGATG 
GTAGCTAATC 
"ACAATTTTAG 
CGCATGTGGC 
CTCAATGAAG 
AACGGTCATT 
GAACCACTGC 
CAAATCATGC 
GGTTCATl'AG 
CCCGGAAATC 
ATGCGTGATG 
AGCGTTTATG 
CATGTACGC6 
TCCGGAAGCC 
GATGGTTATG 
GGCTTCTACA 
GGTTTTACCA 
GCCGGTACAG 
GAATCCGGAT 
GACCTCGGAT 
ATCAATATCT 
CAGTTGTCAC 
TGGGAGrCGG 
GACCCGAACA 
AACAGCTTTG 
AGCGACAGCC 
GCGCACGATG 
TATCCGGTGG 
TATAGATTTT 
AAAACACCGA 
TATACGCTGG 
TTGCCTTTCA 
TTATGGGCAG 
GTTGTCGAAG 
GGACAACAAC 
ACAGCGGTAG 
GATTATGTGG 
CTGTCAGGCA 



GTTTTCTTTT 
TTACGAAAGA 
TGGGACAAAC 
CTGAGAGGGG 
GTG6AAATAA 
CCTATTCACC 
TACAAATTTA 
AGATATTACG 
TTGCCAACGA 
TTCCTAATGG 
GCTACCATAG 
TTGGCAACTG 
CCGACCTTGA 
TGAGTGCATC 
TAGTAGGAGC 
CCTTATATAC 
CGGTCTATTA 
CTTCGGATGG 
AACTAACACT 
TTTATCAAGA 
ATGCCTTGCC 
TGAACGTAGG 
ACAGATTGAA 
CTTGGTACGG 
AAGGAATCAA 
TACGGCATGC 
ATGTCGTAGT 
TCGCAGATCT 
CTGATGAGAT 
TAATATCTTT 
TTTGGTCTAA 
CCATCAACAA 
CAGAGTATAC 
ATATGGCACA 
ATGGTGAGTT 
CTCAAGAAAC 
GCCCTATACC 
AGTTGTACCT 
AAGCAATAGA 
AAATATCGGC 
AGATTGTCAT 



AGCCATAGTA 
GCGAGCTTTG 
AGCAGTATCT 
TATCACATCA 
TGAGGGCTAT 
CATTGGCCGT 
CGATCAGGAA 
TACCGAGGCC 
TCCCATGCGA 
CAATCATGCC 
CTGGCCGCTT 
GTCCGGCACA 
TAATCTGACT 
TGTTTCGATG 
CTTGCGAAAC 
CTCACAG6AG 
TGCAGGGAAT 
TACTTTCCAT 
CCTCTCGCCG 
GATCATCACC 
GATCTTGGCA 
GTATTCGATA 
CAAGGCTGAC 
ATACGGAGAG 
CACCATCACC 
ACAGGGAGGA 
CACGGTAGAT 
GAATTCII'AT 
CCGTACACCC 
GGGCTGGGTA 
AGACGTTCTC 
CCAAAAGGAT 
GCACCCCTTA 
CAACAGAGTA 
GGTTG'ITGTT 
AGTACATATC 
TGATGGATCC 
CAAGGGGAAA 
ATCGTCAGAA 
ACCTTGCATT 
GAAGAATAGT 



ATGCTCTTTG 
AGTCTGGCTC 
GACAAGATTT 
CAAGAGGAAG 
GCTCTTGTAG 
TTCGACATGG 
ATAGGCCTGA 
GTACCGGCTG 
TGGAATCAAG 
TATACCGGCT 
CAAGGTGAAG 
rrrGGTGAAA 
CAATCTCAAG 
AGTTTTTATG 
AACTTTCGCT 
TGGCACGATA 
AACCAGAGCA 
TTCAACTGGG 
ACTTCGTTGG 
GGTATCGAAC 
CTGAAAGACA 
TATAATACAG 
GGAGAAGTCA 
CATCCCGAGA 
Cl'ACTTTATC 
TATGTCAATA 
AATAACGAAG 
GAACATAGTA 
GTAGCCTTTG 
ATGGCTGAAG 
ACTCTCTCGG 
QAATGGAAAA 
TTCGAAGTGG 
TTGCCCGACT 
TTCCGCCAAA 
AAGCAAGGAG 
TATCGTGCGA 
AGGAACl'ACA 
GAGATCAGAG 
CCCCAAGAAA 
TTATCAGCGG 



GCATTGCCAT 
GGCTGGCTTT 
CCATCGATTA 
GCTCTCCTGC 
CAGCAGATGA 
ACAGTATGCC 
TACTTTCCGG 
AAGTACATGC 
GTTACCCATG 
GTGTTGCTAC 
GCTCTTTCGA 
TGTACGACTG 
TGGATGCCTA 
AAAATGGAAG 
ACAAGCGTTC 
TGATCCGCGG 
TAGGACATGC 
GTTGGGGAGG 
GTATCGGAGG 
CGGCTAAGAC 
TAGAAGCCGA 
GTGAAGAGCA 
TAGAGGTGAA 
GTTTCTCATT 
GTCGCACAGG 
GCATTAAA6T 
GCAAGCl'CAG 
CGATTACAGT 
CTCTATCTAC 
TTCCGGGCGG 
AAGGCGACTA 
AGATCGGAAG 
GCCATAATCA 
TTACACTCAA 
CACAATCCTC 
AAACTTTCGT 
CCCTCCATGC 
CGGTGAAGAT 
TAITCCCTAA 
CATCTATCAT 
GGCATGGCAG 



GCAGGGACAT 
GCGACAGGTA 
CGTTTATCGG 
ATATTTTTAT 
CAGAATACCG 
GGACAATCTT 
AAAAGCTCAG 
TCTGATGGAT 
GAACAATAAG 
TGCTGCAGCA 
TTATCATGCA 
GATCAATATG 
CGCCACACTG 
TGGTACGTAC 
ACTGCAGCXA 
GGAACTTGCC 
TTTCGTTTGC 
TGTTTCCAAC 
TGAGGGAATA 
TCCCGCTGAA 
GTATAAAAGT 
ATCAAATCTT 
AACTTCATCT 
GGCACCTAAT 
CACCCAACAG 
AAATAC6ACA 
TATCGTCCCC 
ACAGTTCAAT 
ACCAGCTACT 
TAGCAGCAAC 
TACATTGTGG 
CGTGTCAGTA 
AACTTCTACC 
AAATCTCGGA 
ATCGGGGTCT 
ATATAAACCT 
ATTCGTAAAC 
CGTCAATGGT 
TCCGGCACGC 
TCTTTTCGAT 
AATGGATGTC 



480 
540 
600 
660 
720 
780 
840 
900 
936 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1060 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



1 rP*"' ^ -ir^i ™^ M-Ht »f»'ii 

3 -^^i! ™ii -iQi iiz; c:h o m. u-ji in::: a iUiXJ 



WO 99/29870 PCT/AU98rtll023 

115/490 



ATAGrGCAc"^ ^^'^''^^^^ CTACATCCTT AACGTGGATG GATATACGAC GAAAATAAAT 

5 (2) INFORIIATICM FOR SEQ ID NO: 169 

(i) SliQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs: 
<BI TYPE: nucleic acid 
iC) STRAW DEWJESS: double 
(D) TOPOLOGY; circular 

(ii) MOLECULE TYPE: DtlA (genomic) 

15 (lii) HYPOTHETICAL: MO 

(1v) AMTI -SENSE: NO 



20 



25 



60 



65 



(vij ORIGINAL SOURCE: 

I A) ORGAIIISM: PORYPHYROllOfJAS GIUGIVALIS 

(Ix) FEATURE: 

(A) NAI'!E/KFY: misc feature 

(B) LOCATION 1...870 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 169 



^IS?^^^^ TATTTCTCTC GCTCACGAGT CTTGTAATGG TCTTCGCTGT TCCAAGTTGC 
ACAAGGATCA AACCCTCTTG CCGGCTCCGA CCAATGTGAC ACCCGATAAT 
OU CCGGAiGACA ATCCTTCGGA GATCGACATT ACGCAGACGC ACACAGAftAA ATATGTTTTG 
GCTGAAGAAT TTACCGGCCA AAAATGTCTC AACTGTCCGA AaSg^S^^S S^^J^G 
S^HIS^i??? AGCAATACGG TAAGAGATTG ACT6TTGTCG GTATACATGC CGCCCCTGGA 
IJ^^fl;;?™ CACCTCT-ITT CCGTACAGAA GCCGGAGACG CATATTATAG CAAGTTCGCC 
i^^^^I^?^ CTCTCCCTGC GCTGATGGTT TCGCGCAAAA AGTTCGGCTC TTCCTACCTT 
InJ^^^^S^ GCTACAAAAC GTGGGACGTG CCTATTGCCG AGCAGATGGA CCAAAAG^G 
TCTTTGCCGT GGCCGAATAC ACCGATACCC AAAAGATCAA GGTGACTGTA 
TACTGGAGGG GAATACACTC CCGAAGTCCA T.3GT1'CAGGT GTATCTGTTG 
^S^^^.'^S TGATCGCTCC GCAGGTGGAT GGCAATACGA CAGTCGAGAA ITACGAGCAC 
An ^^^V'Z TGCGTGGAGC CGTTAATGGT ATTTGGGGCG AAGAATTTGT GAATCTCAAA 
ATACTTACGC CGTTGAACCG CTCTCGGGTA TGTCCTTCGT ACCCGAGAAT 
TATTCGATTG TGGCTl'TTGT ATACGATGTG CAGACGTTCG AAGTGTATGA CGTTGTGCAT 
GTAAAGATCA ATCCGCAATC CGATGGCAAA 



45 (2) TIIFORI-IATIOM FOR SEQ ID NO:170 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 
-p, (B) TYPE: nucleic acid 

(C) STRAMDEDNESS: double 

(D) TOPOLOGY: circular 

(lit MOLECULE TYPE: DNA (genomic) 
55 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 



Ivi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRCtlONAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: misc featur- 
fB) LOCATIOJ.I 1,..669 



2520 
2529 



<xl) SEQUEtlCE DESCRIPTION: SEQ ID NO:no 

nV^^Z JSi^GTCTAGT AGCCTCAGTT TTGGCCGTGG CTCTCGTGTT CGCCGGTTGC 60 

70 n^n^'^n^ ATATGGCAAA AGGCGGCCTT ATCGGCGCCG GAGTAGGAGG TGCCATTGGT 120 

70 GCCGGACTAG GTAACGTAGC CGGAAATACG GCTGTCGGTG CCATCGTCGG TAC^GCAGTC 180 

GcS^^C rr^S^^r AAGATGGACA AGcI^^ IgZ^cT^c L o 

TT^Sfr^S 5^^^^"^*=^^^ GATl'CAGACA GTAAATGACG GAGAGGCTAT TCTGGTTACT 30n 

gc^gJ^a tgcgacgaac tccagcactc tgaotcccaa CTCACGCAcr Is'o 
75 G^^A^f^ n^^^SZ'''' aaacatgaac aaaaaccccg acacggatat tcgtatcgta 
/3 ggcci<taccu acaataccgg ctccgacaag atcaacgatc ctctgtctga gagacgtgca 



420 
480 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



lUii r^i „iL li::-; o o lu JL.iS Oi ip:: io* ^lU'-W 



WO 99/29870 

116/490 

.^^^^^^r ?cS? I^^^^-? -0 

5 

(21 INFORMATION FOR SEQ ID NO:17l 

lO iJtQtiEHCE: CHARACTERISTICS- 

(A) LEHGTH: 1011 base pairs 
(B» TYPE: nucleic acid 
iC) 5TRA/4DEDNESS : doubl- 
ID> TOPOLOGY: circular 

15 (ii) MOLECULR TYPE: DIM (genomic) 

(Hi) HYPOTHETICAL: NO 

(iv) AIITI-SEIISE: HO 



20 



(Vi) ORIGINAL SOURCE: 

(A) ORGANISIi: PORYPHYROf^ONAS GINGIVALIS 



2c FEATURE: 

tA) NAIIE/KEV: mlsc feature 
( B ) LOCATIOI I 1 . . . 1 01 1 



50 



(2) INFOWIATION FOR SEQ ID NO: 172 



(i) SEQUE1^ICE CHARACTERISTICS: 

<A) LENGTH: 453 base pairs 
(BJ TYPE: nucleic acid 
55 STRAf^DEDTiESS : double 

<D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DtUK (genomic) 

gQ HYPOTHETICAL: NO 

(iv) AIJTT-SENSE: tiO 

(vl ) ORIGINAL SOC^RCE: 
65 ORGANIStl: PORYPHYRd-IOflAS GINGIVALIS 

(ix) FEATURE: 



70 



(A) NAHE/KEY: misc featur- 
(D) LOCATION 1. . .i53 



PCT/AU98A)I023 

540 
600 
66U 
669 



(xij SEWENCE DESCRIPTION: SEQ ID 110:171 

CGTACAAAGC CCTCGGACTC TGT^Sgtac aSt^^IS^ ACAGCAAAGC AAAGACCGAA 120 

TTCCGCAACA AGAl-lTcSc O^S^^g ^I^^^? 12^^^"^*^ TCCCCTGGAG 180 

^. GCGATGGAAG CACTCAATTA ^CTGcSSf Si^S^S^ CCGAATATGA AAATCGGCTG 24 0 

35 , TATGTAAACC CTTTCGTGGG ^S^^CC ^?^ci?In SI^'^"'^ TTGGAGCGAG 300 

GATTGCTCTT CGTTCGTGA? GCrcGT^AA TTCCGAACTC CTATGACAIT 360 

CGTCGGCGTT TCGGACGGAT rrnr?!^^^ GATAAGCAGG TCACCTCTCA ATTTGGCTAC 420 

- ATACGAGCAG 'c^^^ G^^?^? G^JcG^^^^^ ^^^If^^^" TGGCGATACG tlo 

TACTACATAG TCTTGCGCCA TCCGAAr^A S^^nf??^?^ ATGAAGCGCG TGGCTATGGC 54 0 

40 -CAAITGGTAG ACGAGAa5^A GA?^S?5^^ 600 

ACGGGTCGAA GCACCGgJcS ^^5^^ ^G^^ ^2^1^^^^^ AGGAGGCAGC tic 

-AATCCGAGTA CCATTATAGA C^TC^S^ J^^^^^^^ GCTTCATGGG TATTCCCATC 720 

AAACGAGGC^ G^TC^^ S^T^S^ G^^r^ l^^'^'' ^^^^ACATTC 78^ 

^. AAAGGGAAGA AAGGCAGACA AgStcttPt rSlS^^™ CTTCTTCTCG CTATGCAAAA 940 

45 AC^ITTGGAAA CAATAGCC^ J^^Sc aS^Jc AG^^J^ AAAAGGCGAT 900 

GGCATTGGCA AGAGTAAAAT TTTGAC^^CG ^^^^ ^G^S^ ^^^ACCAAT ^960 



60 



(xi) SEQUENCE DESCRIPTIOtJ: SEQ ID HO: 172 

I^S^^^ S^f?^-^ CTCGGATCAG AAaAC«..TC 
TATCTTTTCr GTGATGACGA AM^I^^r AAGCGGTGGG A6AAATCTCC 120 

75 rACTACACCO AC..CA?^^^ ^^^^^ ^AATC AO.AA„^ COA^CA,^,, .SO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



■ii^'h -iF»ti iP-'h «n -n^'h -rtJ' iF'^) ^1 fi^Hi' -ri^ni 

U n,'3i Os 'Cai >a' U O ^ HJ •iL„ii 



WO 99/29870 PCT/AU98/0ia23 

117/490 



CTTATCAGTC TCGATACCGT ACGCTCGAAT GCXCGTGCTC T1*GATCTTCG ATACGAAGAC 300 

GAACTGCATC GTGTCATTAT CCACGGCATA CTGCATCTTT GCGGATTGAA AGACAAGAGC 360 

AAAAAGGATG AAGCCCAAAT GCGTGCAGCC GAAGAGAAAG CCCTTGTCAT GCTGCGAGAA 420 

ACCATCGGAT CGiSAGCTTTC CCTATTGCAT ACA 453 



{2} IHFORIHATIOIt FOR SZQ ID HO: 173 

(i) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 1173 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDIIESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

(iil) HYPOTHETICAL: HO 

(Iv) Arm -SENSE: MO 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAHISH: PORYPHYRmOMAS GlNGrVALIS 

(ix) E*EATURE; 

(A) NAJIE/KEY: misc_featur9 

(B) LOCATION 1...1173 

(xi) SEQUENCE DESCRIPTIOH: SEQ ID MO: 173 

ATGAACGTAA AGTACi'TAAT GCTCACATTG GTTGGAGCAA TTGCACTGAA CGCAAGTGCA 60 

CAGGAGAATA CTCTACCGGC AACGGGTCAG TTACCCGCTA AGAATGTTGC TTTTGCTCGC 120 

AATAAAGCAG GCAGCAATTG GTTTGTAACA CTGCAAGGCG GTGTTGCAGC GCAGTTCCTC 180 

AATGACAACA ACAACAAAGA CCTCATGGAC CGCTTAGGAG CCATAGGTTC TCTTTCTGTC 2 40 

GGAAAGTATC ACAGCCCTTT CTTTGCAACT CGTTTGCAAA TTAACGGAGG TCAAGCCCAC 300 

ACTTTCCTCG GAAAAAATGG CGAACAAGAA ATCAACACCA ATTTTGGTGC AGCTCACTTC 360 

GACTTTATGT TTGATGTGGT TAACTACTTT GCACCATATC GCGAAAATCG TTTCTTCCAT 420 

TTAATTCCAT GGGTAGGTGT TGGCTACCAA CACAAATTCA TCGGTAGCGA ATGGAGCAAA 4 80 

GACAATGTGG AATCACTGAC GGCGAATGTA GGAGTTATGA TGGCTTTCAG ATTAGGAAAC. 54 0 

CGAGTAGACT TTGTGATCGA AGCACAAGCA GCTCACTCCA ATCTCAATCT AAGTCGCGCA 600 

TACAATGCCA AGAAAACTCG CGTATTCGAA GATCCCGCAG GACCTTATTA CAATGGATTC 660 

CAGGGGATGG CTACAGCAGG TCTTAATTTC CGCCTGGGAG CCGTAGGCTT CAATGCCATT 720 

GAirCCAATGG ACTACGCACT TATCAATGAT CTGAATGGTC AGATTAACCG TTTGCGCAGC 780 

GAGGTCGAAG AACTCTCAAA ACGTCCTGTA TCATGCCCCG AATGTCCTGA AGTAACTCCT 84 0 

GTTACTAAGA CAGAAAATAT ACTGACGGAA AAAGCTGTAC TGTTCCGTTT CGACAGCCAC 900 

GTTGTGGACA AAGATCAATT CATCAACCTG TATGACGTAG CTCAGTTTGT AAAAGAAACT 960 

AACGAGCCGA TTACCGTTGT TGGTTATGCl' GATCCTACGG GTAATACTCA ATACAACGAG 1020 

AAATTGTCTG AGCGTCGGGC TAAAGCCGTT GTTGATGTTC TGACAGGTAA ATATGGTCTG 1080 

CCTTCCGAAT TAATCTCTGT AGAATGGAAG GGCGACTCTA CGCAACCGTT CAGCAAGAAA 1140 

GCTTGGAATC '3TGTTGTAAT CGTTCGCTCC AAG 1173 



(2) IHFORMATIOH FOR SEQ ID NO: 174 

(1) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 1155 base pairs 

(B) TYPE: nucleic acid 
(CJ STRAJIDEDNESS: double 
(Dl TOPOLOGY: circular 

(11) MOLECULE TYPE: Dt4A (genomic) 

(iii) HYPOTHETICAL: MO 

(iv) AMTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIJAS GINGIVALIS 

tlx) FEATURE: 

(A) NAIIE/KEY: misc_faatura 

(B) LOCATIOM 1. . .1155 

(;:i) SEQUENCE DESCRIPTION: SEQ ID HO: 174 
ATGACATACA GAATTATGAA AGCTAAATCT TTATTATTAG CACTTGCGCG TCTCGCAT*^- 60 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



iP^h lP-"3l if™ «il J,*: qKI}. ..t** -^il tPiq, .tnj}, 
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118/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



ACATTCAGTG 
TTCCAACGTG 
GCTCTCTCGG 
TTCGGTATCG 
GACRTCTATG 
CACCTTGACT 
TTCCATATCA 
AATCGTGAAA 
CTGAAATTCC 
AAGATGAACT 
GGTCTAACGT 
TTGGTCAATG 
CGTCGTCCTG 
GTCGTTGACA 
ATCAATGTTT 
G-STTACGCTG 
GCAAAAGCGG 
GAATGCAAGG 
ATGACTGCAG 



CAACAGCCCA 
ATAAGCCCTC 
GATGGAATAA 
GTAAATGGCA 
GATTCCCGCA 
TCATGTTCGA 
TCCCATGGGC 
AAGTAGGAAG 
GCCTATCAAG 
TTATCGGGAC 
TCAACCTTGG 
ACCTGAACAA 
TTTCATGCCC 
ATGTGGTTTA 
ACAATACAGC 
ACGAAAAAAC 
TAGCCAAGAT 
GCTCATCAGA 
CGGAA 



AGAAGCTACT 
CGATCATTGG 
TGATGTAGAC 
TGAGCCTTAT 
AGGGAGC7\AG 
TCTGACGAAC 
AGGTATAGGA 
TAAAGATGAT 
AGTCGTAGAC 
AAAGAGAGGA 
CAAGACAGAG 
CCAAATCAAC 
TGAATGCCCT 
CTTCCGTATC 
TGAATATGCG 
CGGTACTGCG 
GCTTGAAAAG 
GCAAATCTAT 



AGACAGAACA 

TTCATTGACA 

TTTGTAGATC 

TTCGGTACTC 

GAGCGTAACC 

TATTTCGGTG 

TTTGGTTATA 

ATGACCGGAA 

TTCAATATTG 

AAAGCAGACT 

TGGACA6AAA 

TCACTTCGCG 

GAGCCTACAC 

AATAGTGCAA 

AAGACCAACA 

GCCTATAACA 

TATGGTGTTT 

GAAGAGAACG 



AAGCAGGGAT 
TTGCAGGTGG 
GTCTAAGTAT 
GTCTCCAATT 
ACAATTACTT 
TATACCGTCC 
AATTCGATAG 
CAGTTAATGT 
AAGtiACAAGC 
TCCCl'GTAAT 
TTGTTCCTAT 
GTCAAGTGGA 
AGCCTACAGT 
AGATTGATCG 
ACGCACCGAT 
TGAAGCTTTC 
CTGCGGATCG 
CTTGGAATCG 



GCACACCGCA 
AGCAGGTATG 
CGTl^CCTACT 

cacaggattc 
tggaaacgcc 
caatcgtgtc 
cx;aaaacgcc 
cggtttgatg 

TTTTGCCGGA 
GGCTACAGCA 
GGACTATGCT 
AGAGTTGAGC 
TACTCGrOTA 
TAATCAAGAA 
CAAGGTAGTA 
AGAGCGTCGT 
CATTACAATT 
TATTGTAGTA 



(2) INFORMATION FOR SEQ ID NO:17S 

(i) SEQlfENCE CHARACTERISTICS: 

<A) LENGTH: 57 0 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDMEi'S : double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

(iiij HYPOTHETICAL: NO 

(Iv) Aim -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAiaSH: PORYPHYRa-lONAS GINGIVALIS 

iix) FEATURE: 

(A) NAME/KEY: mlsc feature 

(B) LOCATION 1...S70 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 175 

CATAGCGGCG GTTTTCGTTA ATAACGTCGT 
^T^^^^ TATGCCCATT CTTAGGCGTA TCGAAGAAGG TAGACACCTC 

TGACATTCGT ATTGGCACTG GCTACCTTGG TTACCTTCCT 
IJ^^JZl^ ATCGTTTCGG ATTGGGCTTT ATGCAGACCA TTGCATTTAT 

IJSS^^^^-*^ GGAGATCATA CTCAAGAAAG TATCTCCTCC 
rrrJ^^^ TATTCTTGCC CTTGATTACG ACGAACTGCT GTGTGCTCGG 
II55IIJ^IE? AGAAGGATTA TACCCTGCTC CAGAGCTTCG TCTATGCAAT 
^ISf-^IS^ CCTTGGCAAT GGTTACTTTC GCAGGTATTC GAGAGCAACT 
J?^-^?^^ AAGCTATGAA GGGAATACCT TC6GCACTCT TGGCTGCC^G 
ATGGCTTTCA TGGGCTTCAG CGGTATCGCC ^^^^^'^ 



GCTGTCGCAG 
AATCGGTATG 
GATTCAGAAG 
TTTGGTCATT 
CCTCTATCAG 
TGTGGCTATT 
ATCCACGGCT 
CGATATGACC 
TATATTGGCT 



120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
11S5 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
570 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID NO: 17 6 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS : double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: UNA (genomic) 

(liij HYPOTHETICAL: NO 

(iv) AliTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANIS!^: PORYPHYRaiOIIAS GINGIVALIS 

(ix) FEATURE: 

(A) IIAl IE/KEY: roi-c feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



^r:?^i £yi lir; iCji U3 m . ij^i o iCh tCs ujii 



WO 99/29S70 



PCT/AU9SA>1023 



119/490 



10 



15 



(XI ) 



(B) LOCATION 1. . .558 
SEQUENCE DESCRIPTION: SEQ ID NO: 17 6 



ATGTTATTCA TAGCGGCGGT TTTCGTTAAT AACGTCGTGC TGTCGCAGTT 
12^221'^*^^* TAGGCGTATC GAAGAAGGTA GACACCTCAA TCGGTATGGG 
ACATTCGTAT TGGCACTGGC TACCTTGGTT ACCTTCCTGA TTCAGAAGTT 
CGTTTCGGAT TGGGCTTTAT GCAGACCATT GCATTTATTT TGGTCATrGC 
CAGATGGTGG AGATCATACT CAAGAAAGTA TCTCCTCCCC TCTATCAGGC 
IISI™'^ TGATTACGAC GAACTGCTGT GTGCTCGGTG TGGCTATTTT 
AAGGATTATA CCCTGCTCCA GAGCTTCGTC TATGCAATAT CCACGGCTAT 
TTGGCAATGG TTACTTTCGC AGGTATTCGA GAGCAACTCG ATATGACCAA 
GCTATGAAGG GAATACCTTC GGCACTCTTG GCTGCCGGTA TATTGGCTAT 
GGCTTCAGCG GTATCGCC 



CCTCGGTATA 
TGCAGCCGTG 
CGTTTTGGAT 
CGCCTTGGTG 
ACTGGGTGTA 
GGTTATCCAG 
CGGTTTCACC 
TCTCCCCAAA 
GGCTTTCATG 



60 
X20 
180 
240 
300 
360 
420 
480 
540 
558 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(2) IHFORHATTOH FOR SEQ ID NO: 177 

U) SaOLiEHCE CHARACTERISTICS: 

(A) LENGTH; 24 99 base pairs 

(B) TYPE: nucleic acid 

(C) StRAT^DEDNESS : double 
(DJ TOPOLOGY: circular 

(ii) HOLECULE TYPE: DNA (genomic) 

(lit) HYPOTHETICAL: HO 

(iv) Al IT I -SENSE: HO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISIl: PORY PHYROMON AS GINGIVALIS 

{ix) Ir-EATURE: 

(A) NAUE/KEY: mlsc_feature 

(B) LOCATION 1...2499 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 177 



ATGAAACAAC 
TCGGCTCAGC 
CCTATCGCAT 
AACGGAAACT 
ATGGGTTACT 
GACCTCTATC 
AACGAGACTT 
CTCAAAACCA 
GAGGACAACT 
TATTCGCAAA 
TTGGAGCAGA 
GCTCTGTTCG 
CGCAATTCGG 
AGCTTCCAAA 
GTCATGGTAT 
ACCGAACTAC 
CTCTACAGCA 
AGACTGGACA 
GGCGGAAGTT 
GCTTCAGCAC 
CTGCTGAACG 
ACGGCTCTTA 
CGTCATACCT 
CTCGATGACA 
CAGTACGACC 
GACTATGTTC 
CCTAGAGCCA 
GAAGGATTCC 
ACTCCTATCA 
GCTTCTTTCG 
TTCTCCACCT 
GGCAAAGAAT 
GGTGTGAATC 
ACATGGCAQA 
GGACAAGCCG 
GTTGCTACGG 



TAAACATTAT 
AATCGGGCGG 
ACGTACAAGT 
ACTCGATCAA 
CCACTTGCGA 
TGACCGAAGA 
TCCGCCGTCA 
ACTCTACCAA 
Gl'CAGAACTG 
TTCTTATCGA 
TGCCTGCCAA 
GCTCTAATGC 
CCGAGATCAG 
ATACGACCCA 
TCGGCCAACA 
CCAATCTGCG 
AAGCAACCCT 
ATCCTCCTTT 
TCAAATTCGA 
AAGACGTTCA 
GAGCAGTTCA 
CTTCCTACGG 
TCGGAGAAAA 
GAAGCGGCTA 
AGCTCGAATA 
TCCTCAATCA 
ACGTACGATA 
GCGCTCCTCA 
GCCGTGTCCT 
ATTATTACCA 
TTATCAGCAA 
GGATCATTCG 
TGGAGGGAAG 
GAAGCCGCTA 
AGATCTCTGT 
TACGTCCTAC 



CAGCTTCATC 
ATCCGTTACA 
ATTCGTCAAA 
GGGAATCCCT 
AGAAAAAGTA 
GATTCTCTCT 
AGCACCCTCG 
CCTGAGTCAG 
CGGTTTCAAC 
CAGCCATCCC 
TATGATCGAA 
TGTGGGAGGC 
CCATTCTACG 
GTTCAACGGT 
CAACTACCGT 
CAACCGCTCG 
CGAATATCAC 
CGAAGCCCAG 
TCAGGGCTTC 
GCGTCGTAGC 
GAGTGGAAfiC 
GACTACCAAG 
CTGGGACTTT 
CAGACCGAGC 
TAAGACGGAG 
GGATGGCAAA 
CAATCCCAAT 
GTATTTCGAT 
TTOCCCCAAT 
CAGAGCCGAC 
TCAGTTCAAA 
TACCATCTAC 
AATCGCCTAC 
CGGAAGCATC 
GAAAGACTAT 
CGAGCACTTC 



MTGCTTTCC 
GGTACCGTAG 
GGAACCACTC 
TCGGGTAATC 
CATATAGAAA 
CTCGATGGGG 
TTGGTAACGG 
GGACTTAAGT 
CAAGTTCGTA 
ATCTTCAG'IT 
CGTGTAGAAG 
GTTATCAACG 
ATGACCTTCG 
TCTATGCTGA 
CCCGGACAGG 
CTCGGTTTCC 
AGCATGCAGG 
ATAGCGGAAT 
AGCGGTGGCA 
TACTACGGGG 
ACCGAATCGG 
GGAl-rCGATT 
ACCGGCGGAC 
AAAATAGATC 
AAGTTAAGTG 
CGCTATATCG 
AAGAATCl'CA 
GAAGATCTGC 
CTGAAAGAAG 
GAATGGCAAT 
CCATCCGATA 
AACGACAAGG 
AACAT^TCGT 
TATACCGCTG 
GTACGCACTC 
GCCATCAATC 



TATTCTTAGG 
TGGACAAAAG 
TCGGAACTTC 
AAACTATCGT 
AGGGTGGTTC 
TAGTGGTATC 
TACTGTCGCC 
TCCAGCCCGG 
TCAATGGACT 
CGCTTGCCGG 
TAATTCGCGG 
TAATTACGAA 
ACCACGCGAA 
CGGAAGACCG 
ATATAGACGG 
GCTCATACTA 
AGTACCGTCG 
ATCTCCAGCA 
AGGATTTCTT 
GTGGCGACTA 
ACGAATACAA 
TGCAAGGAGG 
TCGAATATAT 
AGAATACCTC 
CCCTTATCGG 
ATCCGCTCTT 
GCTTCCGACT 
ACGTAGAGTT 
AACGTTCACG 
TCAATATCAT 
AGGTCGAAAC 
ATGGAGTATC 
TCGACCTCCA 
TGGAAGCGGA 
CGAATCTGTA 
TCTCCGGTAC 



AACGAGCGCA 
CTCAAAAGAA 
CACGGATGCA 
AGCCCGACTC 
CCGCCACGTA 
TGCCAATAGA 
GGAACTTTTC 
TCTGCGCGTG 
CGAAGGAGCC 
TGTCTATGGC 
TGGAGGTTCG 
AGAACCGCTT 
AGGGTGGGGG 
CAAAGCCGGT 
CGACAACTTT 
TAAGACCGGT 
TGGTGGCGAC 
CTATATCAAT 
CAGTCTGTAT 
TACCGAAAAT 
CGATGCTTTC 
AGGTATGTAC 
CTACGGCCAA 
TACTTTTAGT 
AGCACGTATC 
CATTTTCAGT 
CTCATACAGC 
GGCCGGTGGT 
AAGCATCAGT 
GGGAGAAGCC 
CACGAGCGAT 
GAAGGTATAT 
GCTCGGCGG7 
CAAAACAACG 
CGGCTATTTC 
ATTCACGGCC 



60 
120 
180 
240 
300 
360 
4;>0 
480 
64 0 
SOO 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
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wo 99/29870 



PCT/AU98/01023 



120/490 



AAAATGGATG 
GACGGATCGT 
GCCAACiCTCG 
CACCTTGCl'T 
TATCAGAAAG 
CAGCCCAGAA 



TAGTACACGA 
TCGACTTTGA 
TCAAGACTCX: 
CCACTATGAC 
ACACGGACAA 
GGATTTTCGT 



AGCCTATGAA 
AATGAATGGT 
GGCCTTCGCC 
CTTGGAATTG 
GGGACCGGGT 
CGGTACAAAS 



GGCOATATTC 
CAGCAATTCA 
GATATAGACC 
AATGCCGGAA 
AGAGCTTCTA 
ATCAATTTC 



CCCCAGAACA CATAGCTCCG 
AAGGTTTGGC CGAAGGTCAT 
TCAAGCTGAG CCACGACTTC 
TACAGAACAT ATTCAACAGC 
CTTACGTATA CGGTCCTATG 



2220 
2280 
2340 
2400 
2460 
2499 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



12) INFORMATION FOR SEQ ID NO: 178 

(1) 5EQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 2673 baae pairs 
(B! TYPE: nucleic acid 
(C) STRAHDEOdESS : double 
ID) TOPOTOGV: circular 

|ii) MOLECULE TYPE: DMA {genomic) 

(iii) HYPOTHETICAL: NO 

tivl ANTI-SENS'E: UO 

tvi) ORIGIMAL SOtJRCE: 

(A) ORGAHISll: P0RYPHYRa4011AS GIMGIVALIS 

(ix) FEATURE: 

(A) nrnK/KEY: misc_feacure 

(B) LOCATION 1. . .2673 

(Xi) SEQUENCE DESCRIPTIOtl: SEQ ID NO: 17(1 



ATGTACAAAA 
GGACAGTCCT 
AAGCCTATTC 
TTTGTCTTGC 
GCCATGTCTG 
ATCGCGGATA 
CGCATCAGTA 
AAAATCGGTC 
ATCGTACAGA 
GAACCGGATC 
AAAACCAAGG 
AGAATGGCGA 
TTCTTGAAAC 
CGATTGATAG 
GTCGTGAAGG 
AAGTATTATA 
GAACGAG'J'GC 
CTCAATGAAG 
TGGGTCGATC 
GCGGAGGGGA 
CAAGACGTAG 
ATCATTAACT 
CCCCGTCCGA 
CGTAGCAGTG 
GGAGCCATTA 
AAAGGGATCA 
AAGTACTATC 
GATATGTTCA 
AATAGCAATG 
AGTTATTACA 
GATCGTTCGC 
GACAATTGGT 
AGCTACAATA 
CGTCTCrCTC 
GTTTCTGTTG 
AACCTCAGCG 
CGAGTTTTTA 
CGAGTGGAAG 
TTCTATATGG 
GGTTTGCGTG 
TATATGCGGC 
TGGCTCTTAG 
TTTAACCTGA 
CTCGGTATCG 



AGATTATTGC 
CGGATCTGAC 
ATAAGACCAT 
GCAATCTTTC 
CTGCCGT6AA 
AATATGTCGG 
AGGTTACTTT 
TTCGCGAGGG 
AGTATTTTAC 
TTTCCAAAGA 
TGAATGAAAT 
TG/UVGAACAC 
TTTTTAGTAC 
AGAAGTATCA 
CTCCTGACGG 
TTAAGGATGT 
TCGGAATAAA 
ATGAAGATGC 
CCGTGGAAAC 
AGCAGGCCAA 
TACGCCGAGA 
CTATTCGTCT 
TTCCCAATCC 
ACCAATTGGA 
AGTTCACGAA 
TTCCGCAAGG 
AGCAGTATAG 
GCTTCAGTGC 
CCGGCAACTA 
ACGGTATGTC 
T'l'CAGATGTT 
TCCAGATTTA 
CCTTCCAAAA 
GTACTTCCAT 
CTGCTACTCT 
TAAGCGATCG 
CTCCAITGCT 
GAGCAGTTCT 
GAGGTGATGC 
GTTATAAGAA 
TTACGATGGA 
CTTTTGCCGA 
AGCGATCGGC 
ATTG<3GGATA 



CGTAGCAGCT 
CCCTCAGGAT 
AGCATCTATT 
AGGCTTGGCT 
TAGAATTATG 
CAATAAA6TC 
TAGC6GGGTA 
GATTCAGATG 
TGAGAAAGGT 
TGGCTTTGTC 
TTATTTTTCC 
CAATGCCAAA 
TCATAAGTTT 
GGAATATGGA 
CAAAAGAGTG 
CAACTTTGTG 
ATCCGGAGAT 
TGTGGGGAAC 
AAATGTAGTG 
TATCAATAAG 
GC^-rTACACA 
CATCAATCAG 
CGAAACAGGA 
GCTTTCTGTC 
crrCTCTGTC 
GGA7GGGCAA 
TGTCACATTT 
ATTCTATTCC 
CTATAATGCC 
GAACTATACC 
AGGTACTTCG 
TACTTCTCTG 
TTTCCATCAT 
CGATAATCCT 
TCCTTATTCT 
TTACAGATTT 
CAATCCTCCT 
TGGTTCGTAT 
TATGTCCAGC 
CGGATCTArr 
ACTACGTTTC 
AGGAGGCAAT 
CGGTGTAGGA 
TGGCTTTGAC 



CTCTTCTGCG 
ACTATATATA 
GAGATCGAGG 
GTAGGTGATG 
CGTCAGGGCT 
TATCTGAAAA 
AAGAAGTCTG 
ACCAGAAATA 
TATCGCGATG 
AATGTGCTTA 
GGCAACAAGG 
TTCAGTCTTA 
GTGGAAGAGA 
TATCGTGATG 
GATATTTATC 
CGCAATTCAC 
GTGTACAATC 
CTGTACTATA 
GGGGATTCTG 
CTGATCATCA 
AAGCCCGGCC 
CTTGGGCATT 
ACAGTGGATA 
GGTTGGAGTC 
GGCAACTTGC 
ACACTATCAC 
ATGGATCCAT 
AAGACTACGG 
TACTATAATA 
GGCGACCTCT 
ATCGGTTAOG 
AACTACACCT 
GGCTCGGCTA 
ATTTATACCA 
TTGTGGGACA 
ATCGAGTATC 
ACGCATAAAT 
AATTCCAATA 
TATTATGGTG 
GCCGGTAATA 
CCGATTCTGT 
GCGTGGCGCA 
TTGCGTGTAA 
CGTCCGGACA 



CCAGCATAGG 
GCCCTGAAAT 
GAATGACGTC 
AAGTCCTGAT 
ACTTCTCAAA 
TCATTGTCAC 
AGAGAGAAGA 
ATGAAGACAA 
CCAGCATAGG 
TCTCGATTGA 
CCCTTAGCAA 
GAAAGCATAT 
GCTACCGTGA 
CTGAAAT7\CT 
TCAACATCGA 
AATATCCATC 
AGAGACGATT 
ACAATGGCTA 
TTTCGCTTGA 
AAGGAAATAC 
AGCTCTTTAG 
TCGATGCCGA 
TAGAGTATGA 
AGTCCGGACT 
TCCATCCCTC 
TGAGTGCTCA 
GGTTTGGGGG 
CGATTGACTC 
GCTACTACAA 
ATACTCAGGC 
GTAAGCGriT 
ACTATAGACT 
ATGATCTCAA 
GAAGCGGATC 
ATCA'l'GACTA 
ACAAGTGGAA 
ATACACCCGT 
AGAAATCTCC 
GCT'ACATGAA 
ACTACGACTA 
TTGAAAACTC 
GTATCGACAA 
CGTTACCGAT 
ATTCTCTACA 



GATCCTGAAA 
ATCCTATGCC 
TTTCGATGAC 
TCCTGGAGAT 
TGTGCGAATC 
TGAACGTCCT 
TCTTGAAATG 
GGTCAGGCAA 
GATAACGCAG 
GAAGAAAAGC 
TCATAAGCTA 
TCGCTCATCT 
AGATTTGGTC 
GACCGACAGT 
AGASGGGCAG 
GGAGTATrrG 
GGCTAAGCGT 
TATTTTTGCG 
TATTCGTATA 
TGTCGTGTAC 
TCGC6AGGAT 
AAAATCTATT 
ITTGCTGCCG 
TCTGTTCCGA 
GATGTATAAG 
GACCAATGGA 
CAAGC6GCCG 
CAAGTTCTAC 
CAACTATAAT 
CAGCGATCCG 
6ACTTGGCCG 
GCGAAATTGG 
CrTGGAGCTG 
GGATTTCATG 
TGCCAGCCAG 
GTTTAGAGGA 
GCTCATGAGT 
TTTCGGTACT 
TGAGACTATA 
TGCATATGCT 
ATTCAATGCG 
TTATAATCCC 
GGTCGGAATG 
GCGAGGAGGA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
€60 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
264 0 
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121/490 



(2) lUFORI-lATIOlJ FOR SEQ ID NO: 181 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 baye pairs 



2673 



AGCAATGTCC ACTTTGTGCT CGGACAGGAG TTC 

(2) INFORMATION TOR SEQ ID NO: 17 9 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucl-lc acid 

(C) STRAJJDEDTiESS: double 
(U) TOPOLOGTt-: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iJl) Hi POTHETICAL: NO 

tiv} AllTI-SENSE: NO 

(V1) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROf-IONAS GIHGI'/AF^IS 

(ix) fEATURE: 

(A) NAI IE/KEY; raisc_f eature 

(B) LOCATION 1. . .522 

(Xil SEQUENCE DESCRIPTTaM: SEQ ID NO:179 

ATGAATGGCG ATATGAAACG GmTTGATT TTGATCGGCT TTGCACTGGC GGTAGCTTTC 60 

TCCGGTTTTT CCCAAAAGTT CGCTTTGGTA GATATGGAAT ATATCCTCAG GAATATTCCT 120 

GACTATGAGA TGATGAACGA ACAGCTGGAA CAGGTGTCCA AGAAATGGCA AAATGAAATC 180 

GAAGCTCTCG AAAATGAAGC CCAATCTATC TATAAGAAGT ATCAGAGCGA TCTCGTATTC 240 

TTGTCTGCTG CACAGAAGAA AACCCAAGAA GAGGCTATCG TAAAGAAAGA GCAGCAAGCA 300 

TCCGAGCTCA AGCGGAAGTA TTTCGGCCCG GAGGGGGAGC TGTATAAGAA ACGCTCCGAT 360 

CTGATGAAGC CTATTCACGA TGAGATTTGG AATGCTATCA AAGAGATTGC CAAGCGTAAC 420 
AACTATCAGA TGGTGCTTGA TAGAGGTACG TCCGGAATTA TCTTTGCCAG TCCGTCTATT 
GACATTAGCG ACCTTGTACT GAGCAAGATG GCCTTTAGCA AG 



460 

522 



(2) INEORJIATION FOR SEQ ID NO: 180 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SIO base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa^OHAS GINGIVTM.TS 

(Ix) FEATURE: 

(A) NAl-IE/KEY: miec feature 

(B) LOCATION 1...5i0 

<xi) SEQUENCE DESCRIPTION: SEQ ID WO: 180 

^TGATl'lT GATCGGCTTT GCACTGGCGG TAGCTTTCTC CGGTTTTTCC 60 

^S^^fS CTTTGGTAGA TATGGAATAT ATCCTCAGGA ATATTCCTGA CTATGAGATG 120 

AGCTGGAACA G-JTGTCCAAG AAATGGCAAA ATGAAATCGA AGCTCTCGAA 180 

AATCTATGTA TAAGAAGTAT CAGAGCGATC TCGTATTCTT GTCTGCTGCA 240 

S^S^ CCCAAGAAGA GGCTATCGTA AAGAAAGAGC AGCAAGCATC C6AGCTCAAG ^00 

CGGAAGTATT TCGGCCCGGA GGGGGAGCTG TATAAGAAAC GCTCCGATCT GATGAAGCCT 360 

^11?^^'^'^ AGATTTGGAA TGCTATCAAA GAGATTGCCA AGCGTAACAA CTATCAGATG 420 

^^SS"^^ GAGGTACGTC CGGAATTATC TTTGCCAGTC CGTCTATTGA CATTAGCGAC 480 

CTTGTACTGA GCAAGATGGG CTTTAGCAAG ^ou 



510 



SUBSnrUTE sheet (Rule 26) (RO/AU) 
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122/490 

(B) TYPE; nucielc acid 
fC) STRAI^IDEDNESS: double 
(D) TOPOLOGY: circular 

(ii) r^LECUl£ TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

llv) ANTI-SEMSE: HO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROtlONAS GIUGIVT^LIS 

(ix) FEATURE: 

(A) HAJIE/KEY: misc feature 

(B) LOCATIOlj 1, . .4 89 

(Xi| SEQUENCE DESCRIPTION: SEQ ID 110:181 

ATGAAGAAAT TTTTTCTCAT GCTTCTGATG GCTCTTCCTT TGAGCCTCTT GGCACAAAAG 60 

GTGGCAGTGG TAAACACTGA GGAGATCATT TCCAAAATGC CGGAACAAGT AGCTGCTACC X20 

AAACAGCTCA ACGAATTGGC CGAAAAGTAT CGCCTTGATC TCAAGAGTAT GGACGATGAG IflO 

TTTGCCAAAA AGACAGAAGA ATTTGTAAAG GAAAAAGJKCT CTCTACTGGA GAACATCCGC 24 0 

^l^fl?:'^^ ACCA&3AACT TCAGGATATT CAAACTCGTT ATCAGCAGTC ATACCAAACG 300 

rtTGCAGGAGG ATTTGCAAAA GCGCCAACAA CAGCTTTTTG CTCCTATCCA ACAAAAGGTG 360 

GCTGATGCCA TCAAGAAAGT GGGTGACGAA GAAAACTGTG CCTACATCAT GGAGGCCGGT 4'0 

ATGATGCn-r ACACCGGAGC TACTGCTATT GACTTGACCG CAAAGGTAAA AGCGAAACTC 4 80 
GGAATCAAG 

(2) INFORf>IATION FOR SEQ ID NO: 162 

(1) SEQtlENCE CHARACTERISTICS: 

(A) LENGTH: 2481 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(lii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

<Yl) ORIGINAL SOURCE: 

(A) ORGANISl-l: PORYPIIYROMONAS GINGIVAL IS 

iix) FEATURE : 

(A) NATiE/KEY: misc_feature 

(B) LOCATION l.,,24 01 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182 

CTATTCCCCG AAAGAACAAG TATATAAAGC TCAACGGTAT ATACAGATTG 60 

^IIS^P*^ TGCTATGCTG CCTGCTATGC TCTCAGGCAG CTATGGCACA AGGCGTCAGG 120 

S^J^ISS?*^^ ATGTGCTCGA CCGTGGGGAA AAGCCGATCC CGTTCGCCGG AGTCAAAGTG 180 

GGACAGGCGC AACGACGAAT CTGAAAGGAT ACTACGAGTT TCGGATGAAG 24 0 

^.^^ ACAGCATCAC GATCGAGTTC AGCTCCATGG GGTACCAAGG GGTAACTCGC 300 

^'^GACCAA GGACACTCGG CTGAATGTTC GTTTGGCAGA GGCCGAGATG 360 

f^^^iT?^ GCGTGACGGT ACAGGCCACA AAACGCAGAC TCAACACGAT GGAGCGCGTC 420 

AATACCCbAG ACCTTCGTGT CAATGCAGGG CCTACGCGAG GGGTGGAATC GCTCATCAGT 4 CO 

GAGTAACGCA GAACAATGAA CTAAGCTCGC AATACTCGGT TCGCGGAGGA 54 O 

i™™^T*^ AGAATATGGT CTATGTAAAC GGAGTGGAGG TTTATCGCCC GCTGCTGGTT 600 

Sl^SS*^ AGCAGGAAGG TCTGAGCTTC GTCAATCCGG ATCTGACACA ATCCGTACAG 660 

GAGGGTTCAC GGCCGACTAT GGCGACAAGA TGTCCTCCGT ACTGGATATT 720 

AACCGGAGGA GAAGGAAGGA GCGGTACTCC TCGGGATGCT ACAATCGA'3T 780 

^S^fl^Ji SSSi?^'^*^ CGGAGCCTtC AGCCAAATCA CGGGTCTACG CTACAAGAGT 840 

J?:?S^?TS?S TCTTGGC^C TACGGACACG AAAGCCGAAT ACGATCCGAT CTATGCGGAC 900 

TCATGACGTA CCGTTTCAGC CCCAAGCTGT CGGlTAGrTT CCTCGGCAAT <^€0 

inj^^^^^ JI?^'^^^ GTTTGTCCCT CAGACCCGTG AGACGAGCTT CGGTACACTG 1^20 

ip^^ASSr^ AAAAGTTGAA GATCITTTTC GACGGTCAGG AACAAGAT03 TTTCCTGACC lOBO 

^^^^^ ^Sn*^^^'^ GAACTTCGTG CCGGACGACA AACAGCGGCA TACGGTTACG 1140 

TCAACAGTAA CGAACGGGAG ACCTACGATA TTCAGGGA-^A ATACTTTCTG 
AACGATGTGC AGCTGGGGGC GGACGGAACT GCTTCGATGG CTTCGGGCTC AGAGAACTCC 



1200 
1260 



AACGGCTTGG GCATCGGGCG CAATCACGAG CATGCGCGCA ACAGGCTGAG CTACCGCGTG 1320 



SUBSTmJTE SHEET (Rule 26) (RO/AU) 



^P^H, ip^JJ „n '-"^ ■iF'^i ^ftf- .,1-- ^-1! -tpaj,- ,piiij 

:r3 o it:: o «, u lo c:; iSSi iui li^n 



WO 99/29870 
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CTGAACATGG 
GTATCGGCAC 
GTAGGATACA 
6ATACGCAGA 
ATGGGAGGAG 
AAGGAGTTGC 
TTGGTACTGC 
CAGACGCATA 
CAGGGAC5CTT 
TACAAGTTTA 
GAGAACGTGA 
GATCTCAAGC 
ATAAAAGCCC 
TACAATTTCT 
CTGCGGGCTG 
CCGGCCTTTA 
CACCCGGATG 
ATAGGGGCTG 
TCGGATGCCT 
CTGCGTCTCC 



GTTACAGAGG 
AGATGGAGAA 
ACCTACCTCA 
TGAGGGGAAC 
GTACATTTTC 
TCGTCAGCCC 
GTGCAGCCGC 
AGGATGCCGA 
TTCACATTCT 
CGGCAGAGGC 
AGATCCGCTA 
TCTTCGGCGA 
GTCAGAAACT 
CCl'TCTTCCT 
CACTAAGCGG 
CCGCACCGGC 
ACTCCTTTGC 
ACCTCTTCAA 
ACCAACAGCA 
TCGTCGAATT 



GGAGATGAAG 
AATAGCCGAC 
CTCGGAGACC 
GCGCTTGTCG 
TCTCATTCCG 
ACGTATCAGC 
CGGACTTTAT 
AGGCAATAAC 
CGCAGGAGCA 
TTACTACAAG 
TCTGGGCGAA 
ACTGGTACCC 
GGATGGCTAC 
TCAGGAGTAC 
AGGATTGCCC 
CTATAAGCGT 
CGGCCGAAGC 
TCTGTTCCAC 
ATACGCCGTA 
C 



CTGAACGAGA 
CATATCAGCG 
GTATTGCTGA 
GCATTCGTAC 
GGTATCAGAG 
GTGGGTTATT 
TATCAGGCAC 
GTGGTTGTCC 
GACTATACCT 
AGCCTGTTCA 
AACATCGGTT 
GGAGTGGATT 
GGTTCTTTAC 
GTGCCGGGCA 
CAGCTCAATC 
GTCGATCTGG 
AAATGGCTAA 
ATGACCAACG 
CCGAACTACC 



AGCATCGCCT 
AATGGGAACG 
rCTACAATAA 
AGGATCGATT 
CTTCXSTGGTG 
CTCCCGAAAG 
CGTTTTACAA 
TCAACGAGAA 
TCGAAATGGG 
ACATCAACCC 
CGGGTTATGC 
CGTGGCTGAC 
CACTGATGAA 
ATAAACGCAT 
CGAGCAAAGG 
GGGTAATGTA 
TGGGAGTAAA 
TCAATTCTTA 
TGACACGCCG 



GCAAGCCGGC 
GAGGGATTCG 
CCTATATGCC 
CAACTTCAGC 
GAGCTTCAAC 
CAACXXGGCT 
AGAGCTAAGG 
GATCCGCTCT 
GGGGCGAAAA 
GTATATAATA 
TGCGGGTATC 
GGCTTCCATT 
CGCACCCACT 
CACAGCCACC 
GCTTAGCTCG 
CAAATGGCTC 
AGGGGCCTAC 
CTACTGGGTG 
CXTAATTCAAC 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
23-10 
24 00 
2460 
2481 



(2» THFORMATIW FOR SEQ ID NO: 183 

tl) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2016 bas« pairs 
(Q) TYPE: nucleic acid 

(C) STRAMDEDHESS: double 

(D) TOPOLOGY: circular 

(in MOLECULE TYPE: DNA (genomic) 
(iiil HYPOTHETICAL: NO 



(ivj AJITI -SENSE: NO 

Ivi) ORIGINAL SOURCE: 

(A) ORGANTSIt: PORYPHYROtHONAS GIHGTV7U>IS 

(ix) FEATURE: 

(A) NAIIE/KEY: iftisc_f eatur** 

(B) LOCATION 1. . .2016 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 183 



ATGTACAGCG 
TTTGCTGCCT 
TATACCAAGG 
GAGATGAAAG 
GCCAAAGCTT 
CTGCTCCGTT 
TTCCGTGGAT 
TC^TCTCTTTfi 
GAGTGGAATT 
TATTTCACAT 
AAACCCAACG 
AGCGTGTCCG 
GGTAGTACCA 
ATCTATTCCG 
ATGCGCGATT 
TATTTCGTCA 
TCGGATCGTT 
GAAATGTTTC 
GGTCTGGGAG 
GTCAATATGG 
AAAGGCAAAA 
AACCGAGGCG 
ACCGAGATTC 
AGGATCGTAG 
TCCTATAAGA 
TATTTGAATC 
GTGGACTTTT 
TTCGATAAftG 
CTCACCGACA 
GATGCTTACA 



GACATCATAA 
GCAAGAGCGT 
CTGCCGATAT 
CTTATACGGC 
TGCGTGGCTA 
TGGCACAGAC 
ATCTGGAAGC 
CCCGCCAGCA 
CGGCACGGGG 
CGAGCAGAAG 
ACATTTATAT 
GAGGAATCAA 
TATATl'ATAC 
CCGCTCGGAG 
CGCTCCGTAT 
GCAATATAGG 
CTTATGGTTC 
CCTTCATAGA 
GACTGGATAT 
GACAACCGGT 
ACAAAGAAGA 
ATGCACGCGG 
AAGGTTATGT 
GCGAACGCGG 
TGAGCGTGCA 
AGTACGTAGA 
TCCTTGCATC 
CTACTCTTCG 
ATCCGGATAT 
ACCTCGGACT 



AATCCATTAT 
GAAGTTGAAA 
GTACAATACA 
rrTCCGATCC 
TCTGAATGCC 
TTATCAGCAA 
TTATCCGAAA 
AAAGGAATAT 
CGACTTCGGC 
CAAAGACGAC 
CATCAAACGA 
CACTCCATGG 
GTTGGCGCAG 
CGGAGAAGGC 
GGCTGCl'CAT 
CGGTAGCTAT 
ACCGGAGAAT 
TGGGGATAGT 
TTTCAAAGCC 
CAATTCCTCT 
AGCTTTGCCG 
ATGGCCGCAC 
GATGGACAGA 
CCCCGTAGGA 
GGGCGATACT 
ACTCAAGACC 
GCGTGAGAAA 
CCCCGAAAGC 
TCGGATCGAA 
ATCTGACCGC 



CCTTTTCTTA 
GATGCGGAGA 
TTATACAGGC 
GGTGAAAACT 
AGACGCTACG 
GGAGGTAACT 
AGTTATTTTG 
CCTACACGTT 
CCGGCCTATG 
GGTTTGGATA 
GATGCACAAG 
GATGAAGGCG 
CAAGGAGCCG 
GGTTGGAGCA 
CCCTCTATGT 
GGCGGCAAGG 
TTGGGGCCl'G 
ACCCTTTTCT 
ACGCTGGACT 
GCCGATGATT 
GACAACGGAG 
CTCTTCCATT 
GAAGAAAATC 
CAGGGATTCG 
CGCTATGTAA 
GATACCGCCA 
GCCGAGGGCT 
ATGAAGAGCT 
TTGGGTTCGC 
AGAGCCAA^i.T 



TCCTGTTGGT 
AGGCACATGA 
GTACCCGACG 
ATCGTGCCGC 
GGTATCCGGA 
ATAAGGAAGC 
CAGCTATCGG 
ACCGGATACG 
CACCCGATGC 
ATAGCAGCAT 
GAC6ATGGGG 
TGCCAACGAT 
ATTACGACCG 
ACGGTTCGCT 
CGGCATCCGG 
ATATTTATCG 
ATATCAATAC 
TCGCTTCGGA 
CTACCGGCCA 
TCGGCTTGGC 
TCAAAGGTGT 
TCGAACTGCC 
CCATAGCCGG 
TGACTACTCG 
TGCTTGCCGG 
AGCAGAGTGA 
TGCAAAATAT 
TGGACGAACT 
ATGCCGACAG 
CCGTGGTGGA 



ATGCCTTGCT 

TCGCCAAGAG 

AAAGCAGGTG 

CGGGAGACAA 

TTCTGTGGTA 

CGAGGTACTC 

TTTGGAGGGG 

GCGAGCTGCC 

TTCGGCTCTC 

AACGGGACTG 

ACGTCCCGAT 

CACGCCCGAT 

TACGGTACAG 

CGTGGACATT 

CGATTACCTG 

TGTCAAGGTG 

GCCGGGGGAC 

CGGACACGCC 

ATGGCATGTA 

TGTGGAGCCT 

ATTTTGTTCC 

GGCTATCTAC 

AGCCACTGTC 

TGACGATGGC 

AGCATCGGGT 

GACCTACTAT 

TTTCTATGAT 

GATTCGTATC 

6AAAGGCCCC 

TTACCTCACG 
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180 
240 
300 
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AGTCGTGGCA TAGCGGCCGA CAGGCTTACG TGGAAAGCCT ACGGTAAGTC TGTCCCCAAG 
ACGGTGACAG CCAAAATTGC CGAACGGCAC GATTTCCTGA AGGAAGGCGA TGTGCTCACC 
GAGGAATTCG TAGCACCTTT GACCGAGGAG CAGCAGTCAG TCTGCX3ACCA ACTGAACCGT 
CGTACCGAGT TCCGTGTGAT CGAAGAAGAG TTGCGT 



1B60 
1920 
1960 
2016 



10 



15 



(2) rMFORMATTOH FOR SEQ ID NO: 184 

(1} SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2124 base pairs 

(B) TYPE: nucleic acid 

(C) STRATI DEDNESS: double 

(D) TOPOLOGY; circular 

(li) riOLECULE TYPE: DHA (genomic) 
(lii) HYPOTHETICAL: NO 
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(iv) AIITI-SEHSE: NO 

ivi) ORIGINAL SOURCE: 

tA) ORGAMISII: PORYPH-i Ra-IOHAS GIWGF/ALIS 

Ux) FEATURE: 

(A) NAI^IE/KEY: ml.-?c_f eature 

(B) LOCATIOM 1. . .2124 

(Xi) SEQUENCE DESCRIPTIOtvI: SEQ ID NO: 184 



ATGAAAAAGT 
ACGGACAACG 
TCTACACGCA 
CGCAACATCA 
CTCGATGTCA 
TCCGGCAAGT 
ACGCTCAACA 
TACGGCACCA 
ATCCATGGCA 
TTGGGTGGCC 
AACAAGGACT 
ATAGTAGATG 
ACGGGACGTC 
AACGTATTCC 
TCCAAAAAAA 
CACACGCTTC 
GATCCCACCG 
GACAAGATTT 
ATGGAGTCAG 
GCCACGAACA 
TCGATATCTG 
d'CAACAATG 
TATGAGTTTG 
GATGCTTTCC 
GACCTGAAAC 
TGCGGGATCC 
TTGTCCAGCC 
ATGAGCGGTA 
TCTCTCCGCG 
ACCGATGCCC 
GAATATCGTG 
ATCGAGCAAA 
GCAGCAGAAG 
TTCAATGCCT 
AACATCTTGG 
ATGGGTAAGG 



TTTTCTTCGC 
TCCCGACAGA 
CGGCCGTACC 
AGCAGTCCGG 
TACAATACCC 
ATGTAACCGT 
CGAGCAACAT 
ATGCCATGGG 
ACGTTTCTCT 
GCTTCGAGGA 
ATAAGACCGG 
TAAATGCTAC 
TGCGTTTCGG 
TCGGAGATGC 
ATCTGAATCG 
AATTCTCCCC 
GTTTCATCAA 
CCTTTGGAGG 
AAAGATTCGA 
ATATCGGTTT 
CCGGTGCACG 
AAGCCAAACA 
TGAAAGGCCT 
AAAAAGCAGG 
CCGAAAAGTC 
AAGCCGACGT 
CTGACTATGC 
TCGAGGCCCT 
CATTTGCGAA 
CTTGGAGCGA 
GCAAAGAAGG 
ACJ'GGTATGC 
ACCCTGAATT 
CTGCTTACTA 
ATGAGCTTTA 
TTATGGTCAA 



GCTACTATCG 
TTCGCTACGA 
TCTGAAAAAG 
CTTTAACAAC 
GGGCTTTAGT 
ATTGGTAAAC 
CGAACAAATC 
CGGTGTGGTG 
CTTCGGCGGT 
TATTTTCTCA 
ATCAAACAAT 
CAAAAACAAG 
TATCGACTTC 
GATCCCCGTA 
TTCTTCGACC 
CTACTTCAAC 
CTACAAAAGC 
ACAAAATATC 
GCAGGCAGGA 
GTTCGGACAG 
TGCCGACTTC 
GGAAACTCAT 
TACAGCTCAT 
CCAATACGTA 
CATGACCTGG 
AACCTTAACC 
TAATAATATC 
TTTGTCTTAT 
TGCCACGATC 
AATGTACTAC 
ACTTGAAGTG 
TTACTACCCC 
GGCTGCTCAG 
CCACATGAAC 
TACGGAGAAA 
CTTC 



ATTGGTATTT 
GTACACAATC 
ATACCGGCCA 
ATGACCGACA 
TCGAACATCG 
GGCATCCCTG 
GAGATCCTCA 
AACATCATCA 
AGCTACCAGA 
TTCGATCTTA 
TTCCTATCCC 
AAAATGAAGG 
ACGCCCGAAT 
GGAGGATCTA 
TCTTTCGAGC 
ATAGAGAAAT 
GACTACTACA 
GTACTCGGTG 
GTGAATACAA 
GCCAATTTCT 
ATGTTCTTTG 
AACGTAATCA 
GGTACATTCG 
GGCCCG-rrCG 
GACTTCGGTA 
TATTTCCACA 
ACCACATACA 
GACTTCGGCA 
ATGCTCAATT 
GTTCGCAAGC 
ATGCTCAACG 
GAAGTTCGCC 
GGACTGCTCC 
AAGTATCTCA 
GACGGCTACC 



CAGCGCAGGC 
TTCAGACCGT 
AGATGGAACT 
TCCTCAAGAC 
GTATCCGCGG 
CGGGAACGGA 
AAGGCCCGTT 
CCCACPJ\ATC 
CCATGGCCGG 
GTCTGGGCTT 
TGAGCAAACT 
GGAGCGACTA 
GGTCCCTGAA 
TATGGGGCGT 
TGCTCGGCAA 
CGGAGAACTA 
CCTATGCTGC 
TCGACAGCCG 
AGCCATACAA 
ACCTGei'GAA 
ACCTGAAAGC 
ATCCGAATGT 
GTAGTGCATT 
GCACGACCAT 
TCGGATACAG 
CCGACCACAA 
TCAATGCCGA 
GCCTCTTTGC 
CCGAGATGAA 
AGAACATCAC 
GTCGCTTCAT 
CCGAACTCCA 
GTCATCCGCA 
CCTTCGGTGT 
ACATGCCCGG 



TTTTGCCAAG 

CACGGTCTAT 

CATCTCATCG 

GCAAAGTTCG 

TTTCAAGCCC 

CAATATCTC? 

CTCTTCCATC 

CAAGGACAAG 

ATCATTCAAC 

GGACAAGCAG 

GGAAGAAGCT 

TACTGTAGCA 

TCTGTATCAA 

TTACGGAGAA 

ACATGGCTGC 

TAACAATGCC 

CCTACTCCAG 

AAACATGACG 

CCCCGGATAT 

CGATGCTCrA 

GAACGAGTAT 

CGGAATCAAA 

CAGTGCTCCC 

AGGCAATCCT 

CAATGCACGC 

AGATCTGATC 

CAAGGCTCGT 

CAACAAGTTC 

GAAAAGCCAG 

CTTCGGTATC 

GGGACGCAGG 

GCAACTGCTT 

AGCAATGGTG 

GAACrTGAAC 

ACGTAACATC 



60 
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1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 
174 0 
1800 
1860 
1920 
1980 
204 0 
2100 
2124 



(2) IHFORIIATION FOR SEQ ID NO: 185 

(1) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1386 base pairs 
IB) TYPE: nucleic acid 

(C) STRA/JDEDNESS: double 

(D) TOPOLOGY: circular 
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(ii) MOLECULE TYPE: DHA (genomic) 
(Hi) HYPOTHETICAL: HO 
(iv) Aim -SENSE: NO 

Ivl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYRa-IOHAS GINGIVALIS 

lix) FEATURE: 

(AJ NAME/KEY: nasc_fcature 

(B) LOCATION 1...1386 

(xl) SEQUENCE DESCRIPTION: SEQ ID riO:185 

ATGAACAGGT TTTCAAATCA TTGGCCCTGC ATCCTCGTGG G-oTTrGTACT CTCGTTTGTA 
TCGGCGftGTC GGACTGTGGC ACAAAACGCC TCCGAAACGA CGGTATCGTA CGATACGGAT 
TCTCCGAAGC CGATGTGCTT CGGATCGCTC TTAGTGAGAA TGCCACAGTG 
AAAGTGUCCG ATATGGATGT GCGCAAACAG GAATATGCAC GTAGGGCAGC ACGTGCCGAT 
CTCTTCCCGA AAGTA'3ACCT CAATGGCGTT TACAGCCATA CCCTAAAGAA GCAGGT^A 
TATATAGATA TGCCCGGTTT CAGCAGTAGC GAAGGTATCG AAATGGGGCG TACACACAAT 
ACGCAAGGAG GGGTGAACGT CTCCATGCCA TTGGTGTCGG CACAGCTTTG GAAAAGCATT 
SSSI^^^^*^ GAGAACAGCT CGATCTGGCT CTGGAGAAAG CTCGCAGCTC CCGAATCGAT 
AGGTGAAGAA GGCTTACCTC AGTGTATTGT TGGCCGAGGA CTCTTATGGC 
^li^™^^^^ GCAGCtATGA CAATGCTCTG GCCAATTATA AGAACATATC CGACAAGTTC 
^^T™??^" TTGTGGCCGA GTATGATAAG ATTCGAGCCA ATGTACAC-GT ACGCAACATC 
TCTTGCAAGC GCAGAACTCC GTAGCCCTTG CTCTCTGGCA GCTCAAGGTC 
CTGATGAGCA TltGAAGTGGA AACTCCGATC AGACTCTCCG GTTCATTGTC CGACTATAAA 
ATACCGGCTA TTTTGCCGCC GATACGCTTA TTTCCAACAA CTCCTCCCTG 
SS^.'t^^S^? ATATACAGCG TCGTCTGGCT GTCAGTGCAG ACAAGCTGAA CAAGTACAGC 
CACTCAATCT GGGAGGGCAG TACACCTATT CGCTCAACAG CAACGACATC 
AAATTCTGGG GCGAGGGACA ACGCTGGACG CCTTTCTCCA CCATATCGCT CAGCCTGTAC 
ATTCCTATAT TCAATGGAGG CAAACGTCTG TACAACGTGA AGCAAAGTGC TTTATCGATC 
ATCTGCAACG ACGCCACATA GAGCAATCCA TCCGAATGGG AATCAAGAAC 
CAAAAPGACC GTCTGCGTAC CTGTATGCAG AGATTTGTGG CCTCGGAAGA GGCTGTCCGA 
r^Z?.?^^^ AGGGCTATCA GATAGCAGAG AAACGCTATC AGACAGGCGA AGGCACTCTC 
fl^SSSS^S; ACGATGCCGA TGTGGCTCTT TTGCAGGCTC GACTCAATTA TAATCAGGCC 
ATATTCGACT TTATGACCGC AAAGGCCGAA TTGGACAAGA TGAACGGCAT GGGGATTCCC 
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(2) INFORMATION FOR SEQ ID NO: 186 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 147 6 base pairs 

(B) TYPE: nucleic acid 
(CI STRANDEDNESS: double 
(DJ TOPOLOGY: circular 

(il) MOLECULE TYPE: DT^A (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGATJISM: PORYPHYRaiOf^AS GINGIVALIS 

(Ix) FEATURE: 

(A) NAI-1E/KEY: misc_featuxe 

(B) LOCATION 1. . .1476 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1^6 
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ATGTGGGGGG 
GCCTTAAGTG 
AAAGAAACGG 
CGCGACTTGC 
GATCGTACCA 
GCGGATATAG 
GAGGTGGCTG 
CTGTATAGGG 
ATCCCGGCCA 
TATAATCAQA 
GGGAAATCGT 



ACAGCCATG«3 
AATCCCTTCC 
AGGAAAAAGT 
CGGACTACAA 
CAGCCCTTTC 
GGCCTGTTGC 
TCGATGGCCG 
GGTATGAGGC 
TTTTGAAAAC 
CGGATCTGCC 
GGAGCGATCC 



AGTGGCGCCG 
TCCGGGTGCA 
CACCGCCCTA 
AGGGCGAGTC 
TGCGGATTCG 
TTCTTTTTCC 
CCCTTTGCCG 
CCrCTTTGTA 
GGCTAATGGA 
GGAGGATATA 
CAGGATTATC 



AACCAAGTGC 
AAACAGATTC 
TATCTCCTTG 
TCTTACGATA 
GTAGCCGGAC 
CGATCCGATA 
TTGAAACAGC 
CCCG'3TGATG 
ACACTCATAG 
GATATAGTCA 
CTACAGGGAG 



GCCGAACGCT 
GTATCGGATT 
TGACTGATTC 
GCTTCCCGAT 
GCCGCTTCTT 
CGCTGACTGC 
TGTCGCCTGC 
GCGGATCGCG 
CGATGGCCGA 
TGCGGCGCAG 
AGGGGCGCAA 



GGTGAAGGTA 
CTCTCTTCCG 
TTTAGCGGTG 
CTCAAAGGAA 
TTATTTGGCT 
CCGTGTGGAA 
CTCCCGTCGT 
GAACTATCGT 
CAGACGAAAA 
TACGGACGGA 
TCATGGCTTT 
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(il) UOLECULe TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) AUTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 
(ix) FEATURE: 

(A) NAME/ KEY: mlac feature 

(B) LOCATION 1. . .735 



(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 828 base pairs 
IB} TYPE: nucleic acid 
tC) STRANOEDUESS: double 
(D) TOPOLOGY: circular 

(iil MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(ivj AMTI -SENSE: HO 

(Vi) ORIGINAL SOURCE: 

(AJ ORGANISH: PORY PHYRCflONAS GINGIVALIS 
(Ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATIOtI 1 . . .828 



720 



^a^JS^ CCCTGGTGCA AACCCAAGCA GGAAAGCTCC TGATGATCTT TGTCGGTGGA 

^^^^C SS^il^^^ CCTCAGCGCA CTTATA^Sc l^l^o 780 

TGT^^^C ^^S^I^ S^'^'''' GATATAACCC ATTTCATCTT CGGCAAGGAT 840 

5 OTOT^^? J^^"^*" GCCTCCTTTT GTGCTTCGGG ACAAGGGCTT 900 

GTrSr^nPl CACGTrrCTG GCTGCCATCC GCGAATCAGG GCAGGAGTAC 960 

^^^^ GATGAGGGCG ATACATGGCA GCTTTC^C 102^ 

AtS^?^ Irri^^"" CrrrCATTGA TGCCCGATGG CAGGGTACTG 1080 

GAOG^^C ^f^f^SS^*^ AGCCGACAGC GTTTCTTCGC TCTCTCCTCC 1140 

10 AAtcSI^A ^^^S^T CAGTTCGAGG GCATCCATGA CCCCGGCl-GT 1200 

CTCGG^rrr^ lSS^lS?f^5^ GAAAAGGAAC GGAAGGGATC AAGTGCTGCA CTCCCTGCCT 1260 

EiE ---- --c?^ 

(2) INFORIIATION FOR SEQ ID HO: 187 

^f. ti) SEQUENCE CHARACTERISTICS: 

tA) LtNGTH: 735 base pairs 
(R) TYPE: nucleic acid 

(C) STR/VNOEDNESS: double 

(D) TOPOLOGY: circular 



(3:1) SEQUENCE DESCRIPTION: SEQ ID NO: 187 
Jl^^^^SiJ^^ AAAAACTTTG GATTGCGATC GTCGCCGGTT TGGCTTTCGT ATTGGGCCTT 

J^^nSS^^"" gccgcagtgt cgctcagcta cgccgctctc agccttcggt ^otgSc 

S^T^ A?^C^ ^SS^^ CTGATCGTTT GGAC.rGCT?^ ^Z^^^c 
GACTA???^ ^SJ^CA ^SS^S °^TTGAAGG AAAAACAGAT ATTGGTAGCA 
4c: AAAACAAGCA GCTGCCCGAT TCGTCTTATA TCTTCTCAAG CGTARrr-aTf 

^i^fi^l ACAACTACTA TTACGATCCT CGGCAGGAAC AAA^S5^G ^^^Ic 

^SctS^g I^^?^^^ agttacggtg acctcacagg acatcgI^^ ?^t^^ 

^CT^?^ A?^cS^^ ^^^l^l ^^<^TAG AGATTACCTC CCACCGTcS 
)^X~^i™I, A^CCAAGCT CAATGATCTG AAGGTGGAGA TGCTGCGCAA TGCCTCCGAA 
50 SS^SHI^ ATCGTGCTTC GGTC^TTGCG GAGGGGAGCG GTTCCTCCGT GGOTA^^T^ 

735 

55 (2) INFORMATION FOR SEQ ID NO: 188 
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(Xij SEQUENCE DESCRIPTION: SKQ ID NOrXBB 



ATGAAAAAAA 
AAGAAAAATG 
ATTTTCTACG 
ATTGATTCGG 
TACAATCTGG 
GATATOGACA 
GCCATGAAGC 
GAAAACAATA 
TTTAAAAAAG 
GAAGGTACGG 
CrGGTCGATG 
GTGATTAAAG 
GTAATCCCAC 
ACCCTGACGT 



CAATTGCAAT 
CTGACACTAC 
GACAGGATTT 
TAGCTTTCTT 
GAGCCATCTA 
AGTTCTATGC 
CTGCAGATGC 
TGAAGCAGTT 
AAGATGGTGT 
GAGCTACTCC 
GTAAAGAGTT 
GCTGGACGGA 
AGGAGCTGGC 
TCGAGATGGA 



TATCGCCTCA 
CGCTGTCAGT 
TGCCAATCAG 
GGACGGTTTC 
TGCTTCCAAT 
AGCCATGCGT 
ACAGGCTTTC 
TGGCCAGAAC 
AACTGTTACG 
CTCTTTGGCC 
CGACAAAAAC 
GATGCTCCAA 
TTATGGGGAG 
ACTTATCGGG 



GCCCTCTTGG 
GAAAAGGATA 
TTCGAAATGT 
AAATATGGTA 
ATAGCTCGTC 
GCGGCTCTTC 
ATGCAACGAA 
ATCGAAAAGG 
ACAACTOGTC 
GATACTGTAC 
GAAGAAGGAA 
CTCATGAAGG 
ACCGGCAACT 
ATCAAGCCCG 



CTTTAGGAGC 
GCATAGCCTT 
CCCGCTTGCA 
TCGATACGAC 
AGCTGGCTCA 
TTAAAGACAC 
TCCAAGCCAA 
GTAATGAATA 
TGGCATACAA 
GTGTCAAGTA 
TCGAATTTGC 
TCGGTCAGAA 
ATACCATCGA 
GGA7J\AAG 



CGTCGGCTGl* 
GTCCATGGGT 
AGGCCAGCCG 
GCGCTTCTCG 
TGATTCCATC 
CGTATCTATC 
AAAGCAGCGA 
CATCGATACC 
GACTCTTCAG 
TGTGGGTACT 
CGTTACCGGT 
AGTTCGCGTG 
ACCGTTCTCT 
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{2» TNFXDRHATIOH FOR SEQ ID NO: 18 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 2325 base pairs 

(B) TYPE: nucleic acid 
iO STRAI^DEDNESS: double 
(D) TOPOLOGY: circular 

{111 MOLECULE TYPE: DlIA (genomic) 

(iii) HYPOTHETICAL: HO 

(Iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: P0RYPHYRa40NAS GIHGIVALIS 



(ix) FEATURE: 

(A) HAME/KEY: iiasc_feature 
<B) LOCATION 1...232S 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 189 



ATGAAAGTAT 
TCCACCACAA 

atactccg'Ik; 

CTGAAAGTTA 
CCATl'CGGGC 
ATATTCGACA 
GCTAAGGTGG 
AGTGTGACCA 
GCCTCTCCTT 
ATTCTGGCTT 
AAGCTGCACG 
TTCCGCCCAC 
CTGCGAGCCA 
AAACGGACAG 
GAGGATATGA 
CGCTTTCCTT 
TCCTTGGCTC 
AriTCCGGCC 
TTAGAGACCT 
CTTGCTCGGC 
TATGAGTGGG 
TCGTACAACA 
CTGGATAAAT 
AACAGGGCAC 
CCCTCCAAGG 
CATCAGACAG 
CAGAGTCAGT 
AAAAGTCCTC 
CTGATCTATC 
TTCTCTCAGT 
CAGTCACTGG 
GCACCCTATA 
CGTAGCATCG 
CAGGTGGGCG 



TACGGCAAGT 
AGAATCTGCC 
AGGACAAGAG 
CACCCAATGG 
TATGGCTATA 
AGTTTGCAGC 
CGACGAACAT 
CTCTGAAAAA 
ATCATTACGA 
ACAGGCAGAC 
AAGAGCGTCA 
AGGATATTAT 
AGCTCTCGGA 
CAGTCCTGCT 
AAGTCCTTTA 
TCTTCTCCGG 
GTTTGGGAGC 
TTTTGGATGT 
TGTTCACGAG 
GCAATGTATT 
AGACCGGCAA 
TGAATACGGC 
ACTATTACTA 
ACTACTTTAG 
AACACCGGCA 
AAACTTTCCA 
TCCTTGCTCA 
ATCATCTTTG 
TGGCAGCCGG 
TCATCAAAGC 
CAACCCGTTT 
GCGAGCAGTT 
GCCCCGGACG 
AAITCAAACT 



ATTCCTCCCC 
GGAAGGCGAA 
CCACGCCGGC 
AGCTATTTTC 
CAACAGCTTC 
GAAGCCGGTT 
CCTCCGCGAA 
GGACTCGCTC 
CAGCATCATl' 
TCCGTCTTTG 
GACCATCAGT 
CTACGAAGCC 
AGATACTCCA 
CGGAATGAAC 
CTATCGTAAA 
CAATCTGTAT 
CTTCTCCGTT 
GC6ACTGCTA 
CAAAAGCAAT 
CGGCGGAGGA 
TCGTCCCGAA 
CGTGAACCTC 
CCCCACGACT 
CATGTACTCT 
TGCTATTTTC 
GGCCATTACG 
AATGGGGTAT 
GATGCAATTC 
CAAGAAGTAC 
CACGGGAGAA 
CGGGACAGGC 
CTATGTAGGC 
GTTCAATCCG 
CGAAGCCAAC 



CAGCTGTATA 
CAACAGGCTC 
GGCAGTGCAA 
GTGGGGGATT 
TTCATCAGTC 
CACGGGTACT 
AAAGCCAAAA 
CCCTTACCGA 
ATCAGGAAAG 
GCCCTGCTGA 
GATACCCTCC 
CCCCAAGCCA 
GGAGAAAGCC 
ATGCCGGTTC 
CGGCAGAAAG 
ATCGATCTCA 
ACCACCCTCG 
GACTTCATCG 
GAAAATCTTT 
AATAGCAGCA 
TCGTTTCCCT 
ACGACriTTC 
TTCGGCTTTT 
CC6CTCAAGC 
GCGAACAATC 
ATCTATACGT 
GGACTATCCG 
AGCGACACCA 
CTGCGCTATT 
GTGATATATA 
GGTGCCAATA 
'3ATTCCGACA 
GTG'SAATAa-A 



TCCTACTGAC 
TCGGAATGGG 
TCACCGAAGT 
GTGCCTCCTT 
CCACTGTCAT 
AGGTCAAATC 
TCGATGCTAA 
TCTCCTATAC 
TCAGCACTTT 
GAGACCAGTT 
GAGACAATGG 
TCGTAAGAGG 
TGCGCCCGTG 
CGACAGACTC 
GCCCCAAGAT 
ACGATGAGAC 
ATTTTTTGCA 
ACAAACCTTG 
GTCCCGGACT 
CTTGGAATAT 
ATCGGCTGAT 
CGATTGTATT 
AGGCTTCTGC 
CGACCACCTA 
TCAACTACAA 
CGCCCCTGCT 
TCAACAAATC 
AGGCAGGCAA 
AGAATTTCGT 
CCTATACCAT 
GCTATGGCAA 
GTATCAGAGC 
ATCAGTATTC 
GAGGCAAGCT 



AGGTGCCTGC 
CAAGACACAG 
GGAGAGTACA 
ACCCAAGATA 
TTCGAAATGG 
CGATAGCCGG 
AGTAAAAAGC 
GGTGGATATG 
CCCCGACAGC 
CAATTTGGCA 
TTACTACTAC 
TGCCGTATGC 
GAGGATAGGG 
GCTCGAAGTG 
TTTGGCCAAA 
GACACGCAAA 
ACGCGATTCC 
GGATGCATCA 
GAATTTTGCT 
CGGTGGATCG 
CGATATAAAT 
TCCCCCTCTG 
CACCGCGCTG 
CGAATTTCAG 
CCTCCTGGGG 
GCTCAGCCTT 
CGTTTCAGAG 
TCTCCTGAAT 
CGGCGTCCCC 
AGACCGCAAT 
TATGCGAGTG 
TTTCACCGTC 
CTATTTGGAT 
TTTCGGGGAT 
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CTCCACGCAG CCGTTTTCCT CGATGCGGGC AACGTTTGGC TCTTGAGGGA GGATTCTTCC 
CGTCCGGGCG GTGCTCTGTC CGAAGTGGGA TCGGTGAGCA ATTTCCTGAA TAGCATCGCT 
CTCGGCACCG GTGTCGGCCT TCC3CTACGAT CTGGCATTTC TCGTGGTTCG TGTCGATGTC 
GGCTTCGGTC TCCACCTTCC TTACAATACG GGTAAGAAAG GTTACTACAA TATCCCACGC 
TTTAAGGATG CCATCGGTTT CCATTTGGCT GTCGGCTATC CCTTC 



2100 
2160 
2220 
22B0 
232S 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



(2) IWrORHATION FOR SEQ ID NO: 190 

(i) SEQUtNCE CHARACTERISTICS: 

(A) LENGTH: 2322 base pairs 
(Bl TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGir4AL SOC»RCR: 

(A> ORGAN I SH: PORYPHYRa-IONAS GINGIVAL! S 

(ix) FEATURE: 

(A) NAtlE/KEY: micr_f eature 

(B) LOCATIOtI 1...2322 



(xi) SEQUENCE DESCRIPTION; SEO ID 110:190 



ATGTCCTCGC 
GCTTCCTCCT 
ACGATCGCAA 
CAGCAACCCA 
.TCGAATCCGA 
GTCCTCATCG 
GGCGATGGCT 
GCTCGCATTA 
GATGTGAAGA 
AAGGTCGGGA 
GCGATAGCTC 
TATGAAGCAG 
GTGGTCAATG 
TATGATCCTC 
CGTGGCGATT 
ACGCGGTCGG 
TATATCAAGC 
AATGGTAAGG 
ACCGTTCCGG 
GACTTCGGGG 
ATGTTCAATA 
ATGGAATATG 
GACGAAACGC 
ACACGTCCGG 
TACCTGCACA 
CCGTACATCG 
ACGCAGCAGT 
GAGCGTACCG 
CTGCAAGCCA 
TTCGGTCTGC 
CTCGAAAAGG 
AATGCTCGCC 
GGCTGGAGTG 
TTCGATCAGA 
TGGAAGTTTC 
GAGAATCAGG 
YACGGTCTGG 
AAAGCCTACG 
TCTTCCAATT 



ATTCCGTTCG 
GCTCGGTCAC 
GCGAAACGGG 
ATTACAGACT 
ACAGCAACAG 
ATTCTGTCCT 
TTCTCGATGC 
CTTATCTGAT 
ATCCACTCCT 
TCAGCGAGGG 
GTCATATCCG 
ATACTACCGT 
GCATCGGGCG 
TCGAATCGGA 
ACACTGTTTA 
TGTCCGTTAC 
TGAATGCGCT 
ATGAGATTGC 
CCAAGAGCAA 
CGGCTTTGTC 
TCAAACTCAA 
GGGTGGAAAG 
GCCGGCGGCT 
AGTTTGATCG 
ATCGTCTGCG 
ATCCCGACTT 
TTATCCTCGG 
TATCCAATCC 
TTTCTTATCT 
ACTATGCTCA 
ACAATACTTT 
ATATACCCTT 
TCCGTACCCT 
TGGGTGATAT 
GCGCAGCAGC 
AGGACGGTCT 
GGCTTCGTCT 
ATCCTCAGCA 
TCGCTTGGCA 



GTATCTAATC 
CCGTTATGTG 
CAGTATCGCT 
GTTCGGGATG 
CTGGTGGAAC 
CACCGATCGT 
TACTGCTCGT 
TCAGCCCGGA 
TCCTCCCGTT 
TTCTCCCTTG 
CAACAACGGC 
TTCAGGAGGA 
TTATCCATAT 
CTTCAGAGTT 
erATGGGAGT 
ACCGGGAGCT 
CCCTATCGTT 
TCTGGCGGAT 
ATCGTTCGAA 
TCTCGGTTTC 
GGGTGCTTAC 
CrrCGCTCCGT 
ACGGGCATCC 
GGTGATTCTC 
TCATACGATC 
CGCCCAATCC 
CTCGGCATAT 
TTTTACGGCA 
GACCGATTCT 
GTTCCTCAAG 
GGCACTGCAT 
TGAGTTACGT 
CGGCCCGGGG 
TCGTCTGGAT 
TTTTGTCGAT 
CTTTCGTTTC 
CGACTTCGAT 
GACACG3CGT 
CATTGCAGTA 



GGCATTGCCG 
CCGGACGGTA 
CTGCCGGAAG 
ACTCGCTGGC 
CGTTCGCTCC 
ACTGCCAACC 
GCCGTGGTAG 
AGCCGTTATT 
GCGCTTGGCA 
TCGCCCATCG 
TTCTGGAAGT 
TCGGGTACGA 
CGGATCGGCA 
CAGGAGCTGC 
AGGGGACGTT 
TTTTTCTGCG 
CGGAACGTGA 
AGCTCTCGCC 
GCCGAAGTCC 
ACCGATCGCA 
GAAGCCATTC 
TTCCCTCGTC 
ACGGAATGGA 
TCCGCTCAAC 
CGCCTGCTGG 
CTTCCGCCTA 
ATACTGAACT 
CGGTTCAGTA 
CCGAAAGACG 
CTCGATCTCG 
CTGGGTTTCG 
TACTTTGCCG 
AGTATGAAGA 
CTGAATGTCG 
GCCGGCAATG 
GATCGCTTCT 
TATTTCCTTG 
TACAAATGGG 
GGCTATCCGT 



GCTGCTTGCT 
GCAGACTATT 
ATATTCGGGA 
TACTGCGCGT 
GGAAAATGGG 
GTCTGGCAAA 
ACACCGGCTT 
ATATACGCAA 
ATTCGCrrCC 
TACTCGATGA 
TCTCCGCCGA 
AATCTGCCGA 
GGGTATTCTT 
CACGTATCGA 
ATATCCGGGC 
AGGATGATGT 
ATATCCGATT 
TTGTGGACTG 
TCGGCACCAA 
ATTTGTTTCG 
GCAAGGGTTC 
TCCTCTTCCC 
AGATCGGGTA 
TCAATTATTC 
ATGTCGATTA 
CGACTGCACT 
ATACCACGGC 
TCCAGACAGC 
AACACGGGTT 
ATCTGGCTAA 
GACTGGCTTT 
GACGATCGAA 
TGACTCCGGA 
AATACAGGAC 
TCTGGACGAT 
ACAAGGAAAT 
TGCGGCTGGA 
CTATCACACG 
TC 



CCTCATGCTT 
AGACAGGGTA 
CTATACCCTC 
CTATAGCAGC 
CGAACCGCCT 
GGCGATGGCC 
GTACAAGAAA 
TATGGCTTTG 
TTCGGCATAC 
AGAGAGAAAG 
GGATGTTTAT 
TCTGAAATTA 
TCATGCCGAT 
TTCGATTTCG 
ATCGGCrCTC 
GGAACGCTCT 
TGTGGAGCAC 
CTATATTCTT 
TTCCGCTGGA 
TGGGGCGGAG 
GGACAGCTTC 
ATTCATTTCT 
TAATTACCAG 
ATGGCAGACC 
TCTCCATCTC 
GTATAACTAC 
TTCGTCCATG 
CGGCAACCTG 
GTATAAAATG 
AACCGTTCTT 
CCCTTATGGC 
CAGCGTTCGC 
CAAGACCTTC 
AAAGCTGTTC 
AAAGGAGTAT 
AGCTTTGGCC 
TGCCGGACTG 
CCCAAACCTT 



60 
120 
180 
240 
300 
360 
420 
400 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1030 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
n40 
1800 
1860 
1920 
1900 
2040 
2100 
2160 
2220 
2280 
2322 



75 



(2) INETORHATION FOR SEQ ID IIO:19i 

U) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 2601 bass pairs 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



f'"Vi iCZH TfT ^"'^ -'f^' -qL ^"^» " 

iLJi -.:H . iCi* -SZ, ii3 m lUH iujj C O hL^j 



WO 99/29870 



PCT/AU98/01023 



129/490 



10 



15 



20 
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30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(U) 'nPE: nucleic acid 

(C) STRAIJDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

ttv) Ajrri -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAMISM: PORY PHYROMOHAS GINGI^ALIS 

( iy, ) FEATURE : 

(A) NAI IE/KEY: mlsc feature 
tBl LOCATlOlvI I.., 2601 

{^-.i) SEQUENCE DESCRIPTION: SEQ ID NO: 191 



ATGAGAAAAA 
CTCGCCATAG 
CTCATCGGTG 
ATGGATGGTG 
GGTTATGCTA 
CTCCTCGATC 
AAAGACCGTA 
TCGTTGAATG 
GGAAGCGGAG 
GGTGTACTCA 
AATTGGAGTG 
TCCAAGCTCG 
GCCAACACCG 
TTCTCCATTT 
ACGGGTCTGG 
TCGAAGAAGT 
CACAACCAAC 
AATCAATCCT 
TACCACAAGC 
TATACcgCAN 
AGTAAGTGGG 
CCCGATGGAC 
TCGGAAGCAA 
TTCAAGAAGA 
CGTGGCGACC 
CCCAAGACAA 
AATCGGGACT 
TCCGAATGGA 
AACTATGGCG 
cn'CCGTGGA 
GCCAATGGCG 
ATCATTCCCA 
TTCACGAATC 
CGCGTGACCT 
CACTGTGGGG 
ATGTTCTCTC 
GCCGGCAATG 
GCACAGATGA 
GGTAAGTACA 
CAGCAGTACG 
CTGTTCGATC 
TTCAACATGG 
GGTAAGAAAC 
ACCGGTATTC 



GAATTCTACA 
CGCAAACAGT 
TATCCGTAAG 
GCTTCCGATT 
CCGTAACTCC 
CCCAGGCCAT 
TGACGCCGGT 
TCGAATTTCC 
GTTTCGGTGA 
TCAACGGAGT 
GTCTGAT6AA 
GTATCAGCTC 
GAGGTTCGGC 
CTACGGGTAT 
GTTATGTGAA 
TCAATGAACG 
GTTCTTCCAA 
TCGGCTATCT 
CCCAGTTCTC 
CCTACGCATC 
TATTGATCAA 
rrATCGACTA 
TTT'rrGCCCT 
AACTTAATAG 
ACTATGACAA 
AGCTCGCATA 
ACACAGGCGA 
TCGATGCATT 
GTAGCAAGTC 
GTGGCAAGGC 
GTTTCTTCAC 
ATGACAAAGC 
ACAAAAACTT 
CGAAGAGAAT 
TAGAGGCTGA 
TCGGTGACrG 
AGACAGGGCA 
CGGCTGCTGT 
AcnTCCTTGG 
AAGCGGAl'GG 
TGTCTGCATC 
ACAACGTAGC 
ACGATGAGGC 
GTGTAAACTT 



ACTTTTCCrG 
GGTGACCGGT 
CACCGGTCAG 
CGAAGTACCG 
CTCTATAGGC 
CGGCTTGGAT 
ACCCGTTTCC 
CGAACTGGTT 
TGGTCGTACC 
TCCTCTCAAT 
TCAAGCCAGT 
GGTAGGTGGT 
TTATGTCGGT 
GAACGACGGT 
GGGGCTGAAG 
TCATACCCTC 
ATATTCTGTA 
GCGAGGCGAA 
GCTGAACCAC 
rrTGGCTACC 
CTACAACACC 
CGATGCCGTA 
TGGCTCCAAC 
TTCGCTGACT 
GATCACCGAT 
CCATGCGGAA 
AATCATGTGG 
CGTATCAGGA 
CACCGGCTAC 
AGCrCTGAGC 
ACGTGCACCA 
CAATATGGAA 
CGAGTTCAAT 
CGGAAACGAG 
GGTCAGCTAT 
GACTTGGCAA 
GGATATAACC 
ATCGGCAGAC 
CAAGAACTAT 
CAAAGAAATC 
CTACAATTTC 
CGACAAGCGA 
TTCGGCTCTC 
C 



ACCGCATTGC 
AAGGTGATCG 
CGAGCATCCC 
GCCAAATCTG 
AGAGGTTCTC 
GAGATTCAGG 
AATATCCGTG 
AAATCCACTC 
AATGTGCGTG 
GGTATGGAAG 
ACCATTCAGA 
ACGATGAACA 
ATGGGTAATG 
TGGGCTATCA 
GGACGTGCAT 
TCTCTTACCG 
GCCGACTATG 
Cl'GACTCCTA 
TTCTGGAAGA 
GGTGGAGGTC 
GGACAACCCT 
CTGGCTGCCA 
TCTCACAAGT 
TTGACAGCCG 
CTGCTCGGCG 
GGTCAGCAAC 
CACGGCCTCT 
TCTATCAACT 
CTGCCCGGCG 
TACAAGTTCG 
CTCTTTGGCA 
AAGGTGCTTA 
ATCAACGGAT 
TATGTTTATC 
CGTCCTATTC 
AACAATGTAA 
TATATCAAGG 
ATAGAGCTGT 
GCAGGATTCA 
GT5GAATCAT 
AAGCTTGGTT 
TATGTGAGCG 
GTATGGTACG 



TGCTGGCATT 
ATTCAGAAAC 
TCCGCGGTGT 
TCTTGACTTT 
AAGAAGACCT 
TAATAGCCTC 
T'3*3CTGATAT 
CCTCTACCTA 
GATTCGACAC 
ACGGGAAAGT 
TTCAGCGCGG 
TTATCACGAA 
ATGGATTGCA 
CCATTGCAGC 
TCTCTTACTT 
GATTCGGTGC 
ACAAATACGG 
CGGCTTATGC 
TGGATGAAAA 
GTCGCGCTTA 
ATGAACAAAC 
ATGCTGCGGC 
GGTTCGGTCT 
GATACGATGG 
GTAGCTACTA 
TGAAAGTGGG 
TCGCACAGAT 
ACGAACTATA 
TATCGCCGTG 
CACAGGGACA 
ATATCTATGC 
CA-3GAGAGGT 
ACTATACGAA 
TCAATGGCGT 
GTCAGATCGA 
GTTACACTTC 
GTCTTCACGT 
TCAAGGGTTT 
ACCCCGCAAC 
GGAAGTTGCC 
CACTCAGCAC 
ATGCCGACGA 
GTTTCGGCCG 



AGGCTCCTCT 
GTCCGAACCy 
AACCACCGAT 
CCGTTGCGTA 
CGGTAC3ATT 
TGTGGTGCCC 
TCAGGCAGCA 
TACGACAAAA 
TTACAACTTC 
ATATTGGAGC 
ACTCGGAGCC 
GACTACGGAC 
CAAAGAATCG 
CTCCCATATG 
CTTCAACGTT 
ACCACAATGG 
CATCCGTCAC 
TTACAATACG 
TACCTCTCTT 
TGGAAAGAAC 
AAAGGTGACT 
GAGCAATGGC 
ACTCTCTTCA 
GCGTTACTAC 
CATAGAGGAT 
TGACATTGTA 
GGAGCATT CG 
CCGCAATCAC 
GAAAAGCTTC 
CAATGTATTC 
TGCGGGGGCT 
CGGCTATGGA 
GTGGATGGAT 
TGATGCTGTT 
CCITCGCGGT 
TTACOACGAA 
CGGAGATGCA 
CCATGTCATA 
CCGTAATGCA 
CGATGTACGT 
CACATTCTAT 
CAATATCATC 
CAC'ITGGTCT 



60 
120 
180 
240 
300 
360 
420 
400 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
ISOO 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
24CO 
2460 
2520 
2580 
2601 



(2) INFORltATlON FOR SF.Q ID NO: 192 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1293 base pairs 

(B) TYPE: nucleic acid 
to STRA^^DEDNESS: double 
(Dl TOPOLOGY: circular 

(11) nOLECULE TYPE: DNA (genomic) 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



O ^'^^ ii^-- ^i^i' ^ ^ "^"^ " 



WO 99/29870 PCT/AU98/01023 

130/490 



(lii) HYPOTHETICAL: NO 



Uv) AIJTI-SENSE: NO 

5 fvi) ORIGI^fAL SOURCE; 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

(ixj FEATURE: 

(Al WAHE/KEY: misc feature 
10 (B) LOCATIOIJ 1...1293 



(xl) SEQUEUCE DESCRIPTIOM: StQ ID NO:192 



ATGAAGTTTT CAATCCGCCT TTTCCTCTGC 

15 CCTCCTCTCG GACAAAAATC CAAGCAGGTA 
CTCAAAGCCA TCGAAAAAAC CGATCGCGAA 
AAACAAAAGC ATCTCAACCT CCTGAACAAG 
CTCTTGGACA ATGAGGTCAA AGAGTTGCAA 
CATCAGCTCT CTGTAGAAGA GAAAGCCCGA 

20 ATGCAAAAGC GGAAACGCTC GTTGGATCGC 
GACGAAGGCA TGCGACGGAT GCGTTTCTTG 
TCTGTCCGGC TGCGCGATAC ACGTAGCAAG 
GCCAAAAAGG AGAAAGGACA TCTCTTAGTC 
G6ACAGCAAG CC<3AGCAACG TCGGCAGGTG 

25 GAAGCGCAGC TGCGAAAGCA GAAAAAGCAA 
CASATTGCCA AGC3AAATAGA ACCTGCCGAA 
GCACGCGAAG CCAAAGCCAA GGGTAACCCG 
ACCAAAGGCG GCTATGCTAT GGATGCCTCT 
AACAAAGGTC GCCTGCCCGG CCCCGTTCGC 

30 GTGCATCAGC ACAGTGAGCT GAAAAAAGTA 
GTAGCAAfJAG GATCCGATGC TACCAGCGTA 
ATACCCXSGTT ATAATTCGGC CGTAATGGTT 
AATCTGAGCA AAGTGTATGT AAATTCCGGC 
CGTGCCTATA CGGATCCTTC CAACAACCAG 

35 CGCAGCAAAC AAAACCCAAG ACTATGGTTA 



ATCATCTTTC TCCTCTCTGC ATTTATCCTG 60 
CAGCGACl'TG AGAAGCAAC6 TAAGGAGGCC 120 
CTACGAAATA CCAAGAAAGA CAAGCAAGAC 180 
CAGGTTGCTC AACGCAAGCA GATGGTACAA 240 
TCCGACATTG ATTCCATGAC GG 3TGTATGT 300 
TCCGATGAAT ATGCCCAAGC TCTACAGTCT 360 
ATCCTTTTCA TTTCATCGGC CAAGAGCTTT 420 
GAACAATACG CTTCTGCATA CAAGCTGGCA 480 
TTGGAGACTG AACGTGCGAC TGTAGAAGAC 540 
ATCAGAGAAG AGGAAAAAAA GAAACTCGAA 600 
CAGGCTTTGG GAGCCAAACA AAAAGACTTG 660 
GCCGAAGCTC TGAACAGAAA GATCGAGAAA ''20 
CGTCGTGCTC GAGAAGAAC3 TGAACGGTTG 780 
GTTCCrGCCG AACCGGAACG GAAGGCGGAC 8J0 
GAGCGTGCTC TCTCGGGCAG CTTTGCACAG 900 
GGCAGATACC GAATCGTAAG CGACTITGGC 960 

CAAGTTAATA ATGGAGGTAT CGACATCGCT 102C 

TTCGATGGTG TAGTGTCCAG TGTATTCGTG 1080 

CGTCACGGTA. ACTATATCAC GGTTTATGCG 1140 

AGTCGTGTTA AAACGGGTCA GGCTCTTGGT 1200 

ACCATTATTC ACTTCGAAAT CTGGAAAGAA 1260 

CSA 12 93 



12) IHFORMATIOIJ FOR SEQ 3D NO: 193 

40 (i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 999 base pairs 

(B) r/PE: nucleic acid 

(C) STRAIJDEDNESS: double 

(D) TOPOLOGY: circular 

45 

Ui) MOLECULE TYPE: DMA (gsnomic) 
(iii) HYPOTHETICAL: HO 
50 (iv) AMTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIJAS GINGIVALIS 

55 U>0 FEATURE: 

(AJ HAME/KEY: mis cofeature 

(B) LOCATION l..,999 



60 



(Xi) SEQUENCE DESCRIPTIOr-l: SEQ ID NO: 193 



ATGAAAAAGT ATTTGTTATA TGCCTCGTTG CTAACGAGTG TTrfGCTCTT TTCCTGTTCA 60 

AAGAACAATC CTAACGAGCC GGTGGAAGAC AGATCCATCG AAATTl^CTAT AAGGGTAGAT 120 

GATTTCACCA AAACC.GGl'GA GGCAGTACGC TATGAAAGGA ATCAAGGAAG TGCVGCCGAA 180 

AGGCTCATTA CCAATCTTTA CCTCTTGTTG TTCGATCAGT CAGGGGCGAA TCCGGCGAAA 240 

65 TACTATATTA CCGGTAACAC TTTCACCGGA GGGACCTGGC TTCCTGACGA TATGAAGGTG 300 

AAGTTGGATA TGACACAATC CGAGGCCGGA GAGCGCAAAG TATATGTCGT AGCCAATGTT 360 

GATAATGCGG TTAAAACGGC TCTTGATGCT GTCGCTAACG AAAGCGATTT GCAGACTGTA 4 20 

AAGAGGACGA CTGCAATGCC GTGGTCGACC GATATAGCCT CTCCTTTCCT GATGTCCGGA 4 60 

AACAAGACAC ACGACTTCTT GGCCAATCGT CTTTTGGACA ATGTGCCCCT TGTGCGTGCC 540 

70 ATTGCCAAGG TGGAGCTGAA TATCTCGCTG AGTGAGAAAT TTCAGATTGT GCCGATAATT 600 

GTCAATGGTA GTTTGAGTGA GTTCAAGTTC AGATACGTAA ACITCGACAA 6GAGACCTAC 660 

GTAGTGAAGC CAACGACCAA GCCGGACAAT CTCATTAGTT CTGCTAATGG TGTTTGGCCT 120 

CAGATTACAG ATTGGACTGT ATGGGGTGCT TCCTTAAATA CTTCTCCTGC TCCGGATGCG 780 

GGCACAGGTT ATACATTGGA TGCAAATGGG AAGGTAACGG CACTACGGAT TGTTACCTAT 840 

75 CTGAATGAGC GCGATAGCAA AGGGGCTACG GTAGAGGTCG CATTGCCTCG TGTGGATGAT 900 



SUBSnrUTE sheet (Rule 26) (RO/AXJ) 



u rsii ,UL o mi « iu\ o icn -113 u 



wo 99/29870 PCT/AU98/01023 

131/490 

GGCACCCTTC CTCCTCCGGA ATTCGGTCCG GAGCTTTATC GTTTGCCTTT GCCGGACTVAG 960 
ATCCTGCGCA ATCATTGGTA CAAGTATGAA GTCGAGATT 909 



5 (2) THFORlIATIOfJ FOR SEQ ID Noels'! 

(i) SEQUENCE CHARACTERISTICS: 

(A) X.EMGTH: 945 base pairs 
(D> TYPE: nucleic acid 
10 (C) STRANDEDNESa: double 

(D) TOPOLOGY: circular 

(ii» MOLECULE TYPE: WA ( genomic » 

15 (ili) HYPOTHETICAL: NO 

(iv) AI^TI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
20 (A) ORGAN ISI-I: PORYPHYROMOWAS GINGIVALIS 

t ix ) FEATURE : 

[A) NAME/ KEY: rairc_f eature 
(D) LOCATION l..,945 

25 

(xi) SEQUENCF. DESCRIPTION: SEQ ID NO: 19-1 

ATGATCAGAA CGATACTTTC ACGATATGTA TCCTCGAACT TTTGGAGTCG GGGAGCTACC 60 

TTTTTTTTCA CGATTTTCCC GGCCTTCATC CTCGCCGCTA CTGCTTTGCC GGCTTGTGGA 120 

JU GGGGGTACTG CTTCAGGCTC CGATCGTACG CTGGCTGTGA CCATCGAGCC ACAGAAATAC 180 

TTCATCGAGT CCATTGCGGA TAAGTCGGTG CAGGTGGTGG CATTGGTACC GGCCGGCAGC 240 

AATCCGGAGG AATACGACCC TTCGCCTACC GTGATGAAGC GTTTGTCCGA AGCAGATGCC 300 

TACTTCTATA TAGGAGGACT GGGGTTCGAG CAAAGAAATC TCGCTGCCAT TCGGGACAAT 360 

AACCCTAAGC TCCCTCTTTT CGAAATGGGC AAAGCCTTGG CGGATGCCGG AAGTGCAGAT 420 

35 CTCCACGGCT CCTGCACAGA TCATTCTCAT ACAGACCTGC ATGCCCATGA TCCGCACTAT 4 80 

TGGAGCAGTG TGGTAGGGGC AAAGGCACTC AGTCGTGCTG CATACGACGC GCTTGTGGAG 54 0 

CTTTATCCGA ACGAGAAAGA CAAATGGGAC AAAGGGCACG ACCGTCTCAA CGGACGTATC 600 

GACAGCCTGA AGAGACTCGT CGATACCATG TTTGCCAATG GCAAAGCAGA CAAAGCCTTC 660 

GTCATATATC ACCCATCGCT CAGCTTTTTC GCCCAAGAGT TCGGCCTGCG GCAGATCGTC 720 

40 ATAGAGGAAG ATGGGAAAGA GCCTACGGCT GCCCACCTrC GTCGTGTGAT CGATCAGGCA 7 80 

CGTGCCGATG GTGTCAGAAT CGTATTTATC CAACCCGAAT TTGAAACGCG TCAGGCGGAG 84 0 

GACATCCCAC GCGAGATCGG TGCTCGTCCG GTAAGGATCA ATCCTCTGCG CAGCTCGTGG 900 

GAGGAGGA/\A TTTTACATAT TGCTCGCGCT TTGGCTCATG AACGG 94 5 



45 



55 



(2) INFORMATION FOR SEQ ID NO: 195 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2544 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 
{D» TOPOLOGY: circular 



(11) MOLECULE TYPE: DHA (genomic) 
(iii) HYPOTHETICAL: NO 
(Iv) AJITI -SENSE: NO 



60 ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOfJAS GINGIVALIS 

(Ix) FEATURE: 

(A) NAME/KEY: misc_feature 
65 (B) LOCATION 1...254 4 

ixl) SEQUENCE DESCRIPTIO^^: SEQ ID HO: 195 

ATGATCGGAA AAAAAATCTT TTTTATCCTG CTGGCGCTCA TTGCGTTCAG TGGGCTGAAC 60 

7U GCAGCGACAG ACACTGAGTT CAAGTACCCG ACCGATGCCA ATATCATCGG TCACGTCAAA 120 

GACAGCAAGA CGGGTGAACA CCTTGTCGGT ATCACTATTG CTATCAAAGG CACTACCTTT 180 

GGTACATCTA CAGATGCAAC CGGGCACTAC TATCTTCCTA ACTTGCGTCC GGGTGAGATC 240 

ACTTTGATTA TGCGTGGCAT GGGCTATAAG AGCCAGGAGC GCGTAGTCCG CGTAGAAAAG 300 

GACAAGACTA TCGAGGTGAA TTTCGAAGCA GAAGAGGATG CCATCAATCT GGACGAAGTC 360 

75 GTGATTTCGG CCAACCGCGA ACTCACGCTT CGCCGTCTTG CTCCTACTCT GGTAAATGTA 420 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



jp^) iV"h tF«" *p»h «ii rtr^h t?«»i .vtf ™Si ri^iT^ ' ^ 

iL.!i 'tin 'JlSi iCIi «JL i^i lOi ai iillji C a.;p-1i»31 , - 



"1 



WO 99/29870 PCT/AU98AH023 

132/490 

TT!^CGAAA AA<JTCrrCTC GCAAGTCAAT GCTTCTAACC TGGCTCAAGG CTTGTCATTC 4 80 

CAGCCuGGAG TTCGTGTAGA GAACAACTbT CA(SAACTGTG GTTTCAATCA AGTTCGTATC 54 0 

AATGGACTGG ATGGTCGTTA TGCACAGATC CTCATCX5ACA GCCGTCCCAT CATGAGTGCC 600 

CTTGCCGGTG TTTACGGTCT GGAGCAGATC CCTCCCAATA TGATCGAACG TGTGGAGGTA 660 

GTACGTGGTG GAGGATCGGC CTTGTACGGT TCTTCTGCTA TTGCCGGAGT GGTGAATATC 7 20 

ATCACCAAGG AACCTTCTCA CAATTCTTTC ACATTCAATG AATCTCTGAG CTTTACCGGT 780 

TTCAGCAAGC TGGATAACAA CACGAACTTC AATGCCTCCA TCGTCAGCGA TGACAACCGT 84 0 

GCCGGTGCCA TGGTATTCGG GCAGGCTCGT TACCGCAACC ATTGGGATGC TAACAATGAC 900 

GGTTATTCCG AATTGGGTAA AATAGATGCC CGCTCGCTGG GAGCGCATTC TTATTTGCGC 960 

lU TTGAGCGACT ACAGCAAATT GACGGGAGAG TTTCACACGA TCAGTGAATT CCGCCGTGGT 1020 

GGCGATCGTA TCGATTTGCC TCCTCACGTA GTGGGTGTAG CTGAACAAAC TGACCATAGC 1080 

GTAITTAGCG GAAACTTGAA ATACGATCTC TTCtCTTCCA ACTATAAACA CCACTTCCAG 114 0 

GCTTATACTT CCGGAC'AGAT CGTAAATCGC AAGAGCTATT ACGGAGGTAT CGGAGAGATT 1200 

GACGTCAATG GCCACCCCGG TGGTACGGAA GGCTACCCTA TCCCTCAAGA TCAATACGGC 1260 

10 AATAATTATG GCGTGACCAA AGGCAAGACA TATATGGGCG GTATCCAGTA CAGCTACGAC 1320 

TTGGACAAAT TCCTCCTCAT GCCTTCGCAA CTTTTGTTCG GAGCCGAATA TACGCGTGAT 1300 

GAACTCAATG ACGTGATGCC CATCCTTTCA TGGCAGACCG GCGAGGATGC CAATGGGAAT 14 4 0 

ACCATTCCCC TCTATCCCGA ATTGGATCAG AATATCAACA ACTACAGCCT ATTCGGTCAG 1500 

AACGAATGGA AAAATGACAG ATGGAGCATC CTTGTTGGCG CTCGCTTGGA CAAGCATAGC 1560 
GAAGTCAAGG ATATGATTCT GAGTCCTCGT ACCACACTGC GTTTCAACGT GAATCCGGAC 
ATCAACCTGC GCGCl'ACATA TGCAAAAGGG TTCCGCGCAC CGCAGGTATT CGATGAAGAC 
TTGCACGTAG GGGTTGTAGG CGGTGAGGCA CAGAAAGTAT TCAACGATCC GAACCTCAAG 



20 
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1620 
1680 

CCTGAAATTT CTCATGCATT CAGTTTGAGT CCCGATATGT ATCATCGTTT CGGTAACGTC 1800 

^'i^^?^^ TCCTTGTGGA AGGCTTCTAT ACTCGTTTGC TGGATGTATT CACCAACGAG 1860 

ATCAGCACGA TGGCATCAAA CGCTACACGC GTATCAACGG TAGCGGAGCC 1920 

AAAGTATTCG GTCTCAATCT CGAAGGTAAG GTCGCATACA AGTCCTTCCA G'JTCCAAGCC 1980 

GG-rCTTACCC TGGCCAGCAA CAAATACGAC GAAGCACAGG AGTGGGGTCT GAATACGGTG 204 0 

AAAGACACCA ACGGAGCTTT TGTTACCGAG GCCAATGCAA ATGGACAACA GGAATACAAG 2100 

SiS?™^^ TGACGGATAC GCAGATCACC CGTACCCCCA GCGTATACGG TTATTTTACT 2160 

TTGGCCTACA ATCCTGCTCA CTCATGGAAC ATAGCCCTTA CGGGAGCATA TACCGGTCAG 2220 

ATGTATGTAC CCCACGCTAT CGAATATGGT GTGAAGTCTG CCGAACTGGA TATTATGCAG 2280 

AACAATCCTG AGATTACCGA CGAAACCGGA AAGGCTCCCC GTATTGATGA GCTGAAGAAG 234 0 

ACACCTGCAT TCTTCGATTT GGGCTTGAAA GTGGGTTATG ACTTCCACGT ATTCCAGGCT 2400 

ACTGAGGrrc AACTCTATGT AGGTATGAAC AATATCTTCA ACTCTTTCCA GAAGGACTTC 2460 

CTGCACGTGA CAGCGGATAT ATCTATGGTC CTACGCAGCC GCGTACAGGC 2520 

TACATGGGCT TGGTAGTGAA GTTC 254 4 

(2) INFORMATION FOR SBQ ID NO: 196 



(1) SEOaENCE CHARACTERISTICS: 

(A) LENGTH: 606 base pairs 

(B) TYPE: nucleic acid 
-c: *C> STRAMDEDNESS: double 

ID) TOPOLOGY: circular 

{in MOLECULE TYPE: WJA {genomic) 
(iil) HYPOTHETICAL: NO 

ou 

ixv) Ai^JTI-SENSE: NO 

(vi) ORIGINAL SOURCE; 
gg lA) ORGANISM: PORYPH^'Ra-IONAS GINGIVALIS 

( ix ) FEATURE : 

(A) NATIE/KEY: misc_feature 
IB) LOCATIOt^ 1. . ,606 



(xiJ SEQUENCE DESCRIPTION: SEQ ID NO: 196 

ATGACAGTAA AGCGCGCAGT GCGAATAGCA CTTCTCACGC TGATAGGCAT TCTTTTTTCC 
TCACCTTCl'C TTGTTCGGGC GCAAAGTCTT TTCAGCACCG AACATGTCTT GCAACTATAC 



60 

65 TCTATGGAGA GTCGGCGGCG GATACCGTCG CAGAGAAAAC GGCAGGTGAG 180 

I^SSS^n^ CTTTTATAGA CAAACTCATC AATCTCGGCC GCACTTTCCT CGGCAAACCA 240 

l^J^^^^'^ GCGGTCCTTC CCCATGGCCG ATGGACTGCT CGGGCTATGT GTCTTACCTC 300 

l^^^l 'l'^^*=ATCAA ACTCCCACGT GGTC^GGCAG CACAGAGCCA ATATACGAAT 360 

r-^Tji GCGAGGATGT TCGTCCGGGC GACCTCCTTT TTTTCAAACG CCGCAATGCA 42 0 

^SS? GTATCGGGCA TGTAGCTITG GTCGTATCTG TCGATGAAGA TGATATTA^C 4 80 

!Z?^J^^^^^ GCCGCAATTC GCGAGCGATC GTGATCGAAA AACTCAATCG CAGTGCATAC 540 

TTCTCCCGTC GCTTGGTGAG CTATGGCAGG GTACCCGGAG CCAAGAGAGT GATCCCACGA 600 
AAAAGT 



75 (2) I^FFORllAT^ON rOR SEQ ID NO: 197 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



W^i iF^i tP^i iP^i .rrJ' tT^. -rtJ' '""ii ti«ii -ri-^h rf"^! 

^"^^ Cji .JL. iLji O la ILii 'O O 



WO 99/29870 



PCr/AU98/01023 



133/490 



10 



15 



20 



(i) SEQUEUCE CHARACTERISTICS: 

(A) LEHGTH: 1365 base pairs 

(B) TYPE: nucleic acid 

(C) STRAtlDEDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: KO 

(iv) AUTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROHOMAS GINGIVALIS 

(Ix) FEATURE: 

(A) NAI-1E/KEY: misc_f eature 

(B) LOCATION 1.,.1365 

(xi) SEQUENCE DESCRimON: SEQ ID NO: 197 
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35 



40 
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ATGAAACGGA 
GCACAAAATG 
ATCCGGATAG 
CGAATCATTA 
GGACACAACT 
TCGATGAATA 
CTGCACGACC 
GCGCtTTGAAG 
GAAATGACTC 
GCCGAAATGG 
ATGGCCAAGG 
GACTTGGGGC 
ACAGACGTTC 
ACGGCTCTCG 
GAAGGTCTGG 
AGCAACGGTT 
GAACAGTGGA 
TCTGCCATGC 
CTTCGACTCC 
GCCGTGGCAG 
GCATACGAAT 
GAGGCAAAAA 
ATATACAAAG 



CAATCCTCCT 
AATCCGAAGC 
CCAAGGAGCG 
GTCTCGATGC 
ATAGCTTCGG 
CCAATCTCAG 
TCAAGCAGCA 
ACCTCAGCCT 
GTACGGCAGA 
TTCGCGTAGG 
ACGAACAACT 
AAGCCCTCGA 
TCGTAGCAGA 
GCITGAAACC 
CCTCGGCTCG 
ACTT CCGCG A 
AGAACAACGG 
AAACGCAAGA 
TCGAAGAGAA 
CCGATAAGGC 
ACGCTCGCGA 
CAAAATACGC 
CCAAAGTTTT 



GACGGCACTG 
ATCAACCAAT 
CAACCl'GAAT 
AGCAOGACAC 
ACGTTCGAAA 
CATCGGAGCT 
AAAGTACAAC 
GGAAATCGCG 
AACACAGTTG 
TAAATGGGCA 
TCTCGTACAA 
ACTGGAGCAC 
AAGGTTGGGA 
GGCACTGCAT 
TGCGGCATAC 
CCTCGGCAAG 
CAGCTACAGT 
TCGCGTTCGG 
AAAAGCCCTC 
CATCGCAGCA 
CAGCTTCGAG 
CCTCAGCCAA 
GGATTTCTAT 



ACCGTCCTAT 
CCGATGTCAG 
CTGCGCAGAC 
AGTTTCCTGC 
GACAAAACGG 
TCGGTGGAAG 
GTGGAGGATG 
GCTCTCTATA 
GCACTGATTC 
GAGGGTAAGC 
TATCGTTCGG 
CCCGAAAGCA 
TCTCTCCTTG 
TCGAGCGAGC 
TTCCCGACGC 
GAGTATGCCG 
ATCGGACTCT 
AGCAGTCGCC 
TATAAAGAGA 
GCCGAAAACA 
GCAGGGCGCl* 
GTGGAAGAAC 
CAGGGCAAAG 



CTTCGCTCTC 
GCCTCTCCCT 
AGGAGATCGA 
CCTCGGTCAA 
GAGTAACCGT 
TATTCAGCGG 
GTATAGCCCG 
TCAATTTGCT 
GCGAGCAACG 
TCCTCGACAT 
AGGAGGAGCT 
TTGCAGTCAA 
CTCCCGAAGA 
TGCAAATAGC 
TCAGCCTCTC 
CCATCAACCC 
CTTTGAATAT 
TGCAAATACG 
TCAGGCAAGC 
GCAAGGCCGC 
TGTCTGCCTA 
TTCGTGCCAA 
ACTTC 



CTTGCTTCGT 
CGAAGACTGT 
ACAAGAAAAC 
TGCAGGCATC 
AGATCGCTCC 
CACACGTCGT 
ACTTCAAAAA 
CTTCCCTCAG 
CAATCGCACG 
AAATGCCCAG 
GGCTCGTCTG 
GGCTCCCCAC 
GATCTATCGC 
TTCGGCACGC 
TGCCGGATAC 
CTCCTTCTCC 
CCCCATCTTC 
CTCAAGCGAG 
ATACAGCAAT 
TACGCTCAAG 
CGAATATGCC 
GTATGACTTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1365 



50 



55 



60 



65 



70 



75 



{?.) INFORMATION FOR SEQ ID NO: 198 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 
(D» TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (gGnomlc) 

(iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: POKYPHYRaiONAS GINGIVALIS 

(ix) E-EATURE: 

(A) NAME/KEY: misc_featurc 

(B) l/XATIOH 1...1332 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198 

ATGCGTTTCC AACATTATCT CATCTGTACG GCTGCCGTAG CGGCTTTGGC TGCGAATCCC 
CTTACGGGCC AATCGAATAT GACCCTCGAA GAGTGCATAG ACTATGCACG CCGGCACAGT 
TCGGCCGTGG CGCTGTCCGC TGCGGAACTG GAGCAGTCCA AGGCCGATTA CCTTCAGGCC 
GTCGGCAATl* TTCTGCCCCG TGTATCGCCC GGAACCGGTG CTTCGTGGAA TTTCGGACGC 
GGATTGGATG CCGAGACGAA TACCTACACC GACATCAACA GCTTCAACAA TTCGTACAGC 



60 
120 
180 
240 
300 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



iJ^li ir'^h if=^i .ti! ™h n^i ■if^'h ^i" iF^Hi tP^, ti'^, 

liji "^^a .tioJ „.iL c:: o o aa ■ iUii a .u. ji 



WO 99/29870 



PCT/AU98/0ia23 



134/ 490 



ATACATGCCA 
CATGCACGCC 
GGCACCACGG 
CAGAAOTACG 
ATGAAGAGTC 
CTGACTCAAG 
TTCCCCATCG 
ATGGCCGAAT 
CTCCTCCCfl'G 
GCATTCCTGC 
AATGGATCGG 
ACTCTGAATC 
GCGCGTGCCG 
AGCGAGATC6 
AAGGAGCATA 
GCCCTGAATA 
CAGCGACT'.IA 
TGCCTTTCGG 



CGATGACCCT 
GGGAGGCTTC 
AGGCCTACTA 
AGGAGAGCAG 
GTCCCGATGT 
CGGACAATCA 
ATCACGAACT 
CGGACAGCTC 
CCAAACTCGA 
CGAGTGTGTC 
ACTATACGCC 
TGAGTATCCC 
AACGCAGGGC 
CCC AAGCCAT 
CCGACGCCAT 
CGGCCATCGA 
GAGCGGCCAT 
AC 



TTTCGACGGT 
GC3CCTCTCC 
CGACCTCGTC 
CCGCCTCCAC 
CCTCGAGATG 
GTGCATCATC 
CGTCGTAGAC 
GGCCGGCGTC 
CGAGCAGGCT 
GGTATCCGGA 
CTTCAGCGAG 
CATCTTTTCG 
GGCAATCGTC 
GGCCGACCGG 
GCAAACCGCT 
CCTGACCACT 
GACCTACCGG 



TTGCAGAGTG 
GTTCXICGAGC 
TATGCGCGCC 
CGGCAGACGG 
CAGTCGCGAA 
GCTCTGATCC 
GATATGCCGG 
TTCGCCCGTG 
GCCACCGACC 
GGATGGAACA 
CAGTTTCGGA 
GGATTCAGCC 
CCACGGGGCG 
GATGCCGCTC 
TACGAAGCCG 
CAGGCCAATC 
CTCAAATGCA 



TCTATCGACT 
AGCAGGAGCT 
AAATGCAAGA 
CTCGAATGGA 
TGGCCGGTGA 
GGCTCAAAGA 
CTGACAGTCT 
CTGCCCATCA 
GTTTGCGAGC 
CGGGATTCTC 
ACCGTCGGGG 
TTGTGAGCCA 
AAGCGGAGCG 
TGGCTTCCTA 
TCTTGCAGCG 
GGCTCCTGGA 
AACTCATAGC 



GCGGATGGCG 
GGCAGCTCTC 
GCfGGCCATG 
AGAGCTGGGG 
CCGTTTGGCC 
AAAAATGAAC 
CTCCGCCGAC 
TCATCCCGTC 
CGCGCGAGGT 
ACGCTTTTTG 
GGAATACGTC 
TCTGCGTCAG 
CAGGCTCTAC 
CCGCCAGGCG 
TTATGAGGAG 
TGCCCGTGTG 
CTATTACGGC 



360 
420 
480 
640 
6C0 
660 
720 
780 
840 
900 
960 
1020 
lOBO 
1140 
1200 
1260 
1320 
1332 



(2) IHFORIIATIOrV FOR SEQ ID NO: 199 

(i) SEQUEUCE CHARACTEFU3T1CS: 

(A) LEtJGTH: 2820 base pairs 

(B) T*jfPE: aucleic acid 

(C) STRAIN DEW^BSS: double 

(D) TOPOLOGY': circular 

(11 ^ MOLECULE TYPE: DNA (qenomic) 
(iii) HYPOTHETICAL; HO 



(iv) AHTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISM: PORYPHYRaiOHAS GINGIVAL IS 

(ix) FEATURE: 

(A) NAI IE/KEY: misc_f ea ture 

(B) LOCATION 1...2e20 

(Xl) SEQUENCE DESCRIPTION: SEQ ID HO; 199 



ATGAACAAAT 
ACTGCACTGA 
AGATCCGC:CG 
AATCCGGAAG 
CGGATAGGAC 
TCCATCGGAG 
ACGCTTTATT 
GACCATGAAA 
GCCACAGAGC 
ACTTTGCCTG 
CCCGTAACGG 
AACATCAATT 
ATCATGGTAA 
GGAGACTTl-A 
GGTGTAACGC 
TGCAGCAATG 
GCTTTCCTCC 
CCl'CTGCGCT 
GGTGCCGGTA 
CCGGCTCTGA 
TTCAAATACG 
AATAAGCACG 
TACGGCAGAC 
GACATCTATC 
GACGGTTATA 
GTCGAATTGA 
TACCACATAT 
GCCATCGACG 
ATTTATCCCT 
GACCTTGCCA 
GGAGGAGTAT 
GAAAGCCCCA 



TTTACAAATC 
CCGCTTCTGC 
GAACGCATTC 
ACCTGATCGC 
AAGTAATACC 
ACGTAGATGT 
ACGATGCATT 
TTGTCTTGG<3 
CGGTACCGGG 
ACATCAAGAT 
ATAACCATTA 
GTCCTGAAGG 
AAGGACAGTA 
CTCCGCTGAT 
AATCCGAGTT 
GTACATTGGC 
CGATCAAAGG 
ATCGTGTCTA 
TTCATCATCC 
ATACATGGAT 
ATCAAGGTGG 
TGGTCGGCAC 
TGAACAGTCA 
TGGATCCCCA 
AGCCTTTGCC 
ATTGGACGGC 
TCCGAAATGG 
AAAGTATTAT 
CGCCGTTGGA 
TAGGAGACAT 
CATTAAGCTG 
ATCCTGTGTT 



ACTTTl'GCAG 
ACAGATTTCG 
ATTCGACGAT 
ACAGAGCCGT 
GGTGGATGTG 
ATATCGCCTG 
CAATATTCCG 
AGCATATACG 
GAGTGAGCTT 
CTCCGGTGCG 
CGGGATCGGT 
TGCAGACTGG 
TATCTCAATG 
CATTTCTGCC 
GGATAAGTGG 
CATCTTCCGT 
TAAATCCGAT 
TTACAATGGA 
GGCCGGAGAT 
CTCCTCCACT 
TACGGAAGGA 
ACTGACCGGA 
TTGGAACGAG 
AAACAATGGC 
CTCTGTGCCC 
TGTTCCTGCC 
AAAGGAAATA 
CGGTAGCGGT 
TGGAGTGGAA 
TCAGACCAAG 
GAAAGTTCCT 
CAAAACCTTT 



TCAGGACTGG 
TTCGGAGGGG 
GCAATGACTA 
TGGCAATCGC 
GACTTTGCAT 
CAATTCAAGT 
GAGGGC'SGAC 
AACGCCACTC 
ATTATGGATT 
GGTTATATAT 
GAGGACGATT 
CAGGCAGAGA 
TGCTCAGGCA 
GGACACTGTG 
ATCTTCACTT 
CGCAACAGTA 
GGTCTCTTGC 
TGGGACAGTA 
GCCATGAAGA 
GGTTCCGGAG 
GGATCCTCCG 
GGTGCCGGCA 
TATGCATCCG 
CAGACGACCA 
CGGCTATTGT 
GATCAATATC 
GCTACGACAA 
ATCATTCGAT 
TCTTATAAGG 
CTCAAGCCGG 
TTCTTAAGCC 
GAAGTGCCCT 



CTGCCTTCGT 
AACCCTTGAG 
TCCGCCTTAC 
AAAGAGATGG 
CCAAGGCTTC 
TGGAAGGAGC 
GCCTCTATAT 
ATCGCCGCAA 
ATGAAGTGTC 
TCGACAAAGT 
CCGATTCGGA 
AGAACGGTGT 
ACCTGCTCAA 
CTTCCATAAC 
TCCACTATGA 
TCATCGGAGC 
TGCAACl'CAA 
CGCCCGATAT 
TTTCCATCCT 
GGACTGACGA 
GTl'CTTCTCT 
ATTGTGGCGG 

atggcaatac 
tcctca;».cgg 
tgcagtctac 
catcatctta 
aggagttgtc 
acgaagtaag 
atacggacaa 
acgtaacacc 
agttggtttc 

ATGTI'TCTGC 



gtcgatggca 
tttctcttca 
tccggatttc 
ccggcccgtc 
gcacatctct 
caaagccatt 
ctataccccc 
cggagctttt 
tcgcggaggg 
cggcggacgc 
ttgcgagatc 
ggtgcaaatg 
taatacgaaa 

AACCftATTl'C 
A7WWGAGGA 
TTCCATG7\AG 
CGAtGAAGTC 
TCCCTCGAGC 
AAAGAAGACT 
TCACTTCTAT 
CTTCAATCAG 
GACGGAGTTC 
GAGCCGCATG 
AACGTATCGT 
AGGCGATCAG 
TCAGGT CG AA 
CTATTCGGAT 
CGCACGCl'TC 
GACTTCTGCC 
TCTCCCCGGA 
CCGATTCGGA 
CGCAGCCGCA 



60 
120 
130 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
SOO 
960 
1020 
1090 
1140 
1200 
1260 
132C 
1380 
1440 
ISOO 
1S60 
1620 
1680 
1740 
1800 
1860 
1920 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ir^i fP-"' iP«M „ii ""'h tf^i iF=fi .rti' eF^i -ti=H! «l==h 

a-iii Z13f 43 «iL jC: lO m cm aCr. Q U„i! iUi 



wo 99/29870 PCT/AU98/01023 

135/490 



CAAACCCCCA ATCCTCCCGT TGGCGTAGTC 
CCCCAAAAGG CTGCTATCGC TGCCGTTTAT 
CACCTCTTCC TCAAGAGCAA CACAAACAGA 
TGGCAGGCCG GAACATGGTT GAGGATCAAT 
5 CATATGCTTT TTGCCGGTAT CAGAATGCCT 
TATGTAAGAA ATCCGGATAA CCTTTTCTCC 
GGAGTCTCTT TCGAAGGCTA CXSGAATACCC 
CTGGTGGTAA ATACCGATGC TCCGAAGATC 
AAGGGAACGA ATGTGGCTCC ATTCCCCGAA 

10 ACATTTATCG GCACACAGGA TCCATCCGTC 
AGCGATGAAT ACGAAATAAA ACTGGTATAT 
CAGATTGAGA ATAACAATGC TGTCGTTGCA 
ATTAAGAACG CTCATATGGT TCACGCTGCC 
CGTTCTTGGA ACAACCTCCG CAATGGCGTG 

15 ACTTATATGC TCGTTATGCA GACGGCAAAC 



ATTGCAGACA AGTTTATGGC CGGTACATAT 1980 

GTAATGCCAT CCGCTCCGGA CTCTACTTTC 204 0 

AGATTGCAGA AGGTGACAAC TCCCTCCGAT 2100 

TTGGATAAGC CGTTCCCGGT GAATAATGAC 2160 

AATAAGTACA AGCTCAATCG TGCTATCCGT 2220 

ATTACCGGTA AGAAGATTTC ATATAACAAC 2280 

TCGCTCTTGG GCTATATGGC TATCAAATAT 2340 

GATATGTCGC TTGTACAGGA GCCTTATCCT 24 00 

TTGGTCCCCA TATATGTCTA TAAGAACGGA 24 60 

ACAACTTATT CGGTTTCAGA CGGAACAGAG 2520 

AAGGGATCGG GCATTTCGAA TGGCGTTGCT 2580 

TATCCGTCTG TTGTAACAGA TCGTTTCAGC 264 0 

GCCCTCTACT CATTGGATGG CAAGCAGGTT 2700 

ACATTCAGTG TTCAAGGACT TACGGCCGGT 2760 

GGCCCTGTGA GCCAAAAGAT CGTGAAGCAG 2820 



(2) INFORMATION FOR SBQ ID NO: 200 
20 (i) SEQUENCE CHARACTERISTICS: 
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60 
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(A) LENGTH: 2010 base pairs 
(Bt TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 



(il) MOLECULE TYPE: DNA (genomic) 
tlti) HYPOTHETICAL: NO 
30 (iv) AJ^Tr-SEI-ISE; NO 

(vi» ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOtlAS GINGIVALIS 

35 (ix) FEATURE: 

(A» NAME/KEY: mlsc_feature 

(B) LOCATIW 1...2010 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200 



ATGAAATATC TTATCAGACl* CTTCTTATCA TTGATCTTAC TCTCTCTCTG GACGGGCTGT 60 

ACACACGAGG AGCTCTCTAT TTGCGATGGC GAGAATACGC TTGTTTTACG CGTAGAGACC 120 

GGTAAAGCCC CAAATGCTCG TGCCACAGAA CCCGGTCAGG GCATATACAA TGAGAATAAA 180 

GTACGCTCCA TTTCTGTGCT CTTCTATTTA GAGGGACAAC TTCGTTGGCA GGTGAAGTCT 24 0 

ACAGACTATC AAATCCATGA AGGGGCCTAT ATCATTCCGG TCAAAGAGCA AATGCGACCA 300 

CTATTCAATG GCAACAACAA CTTCAGCATC TATGTAGTGG CCAATCTCGA TTTCAATGCT 360 

CCGGCCACAG AAGCTGCGCT TTCTCAATTT GTGGTAGAGA AATCTATTGA AGTCTCTTCT 4'>Q 

ACGACAGCCC CTGCCGATTT CGTAATGCTT GCTCATGGCA ATAAGCAGAT CAATATGGCT 4 80 

ACGACAGAAG GGAAACTGTT GGGGGATTAT AAACTCAAAC GAGTGGCAGC AAAGATTCGC 54 0 

ATGATAAAAC CCACCATCAA TGTGCAAGGA TATGAAGTGG TCGGAAATAT ACAGGCAAAG 600 

TTTCGCAATT CGGTAACGAA GGGGTTCCTT ACGACAGAAG CTCAAGAGAT CCCAGCTGCT 660 

GCATCCTATA AGACATCGGA ATATCTTGAT ATTGCAGAGT CGGCACCTGC CAATTCTATC 720 

CATTTCTATT CTTACTATAA CAAATGGACA CTCTCCACAC CGGAGAAGCG ACCGGAATTC 780 

TTCATCATGG TCAAATTCAA AAAGACAGGA CAGCCGGACA ACACAGCCAA ACCGTACTAC 84 O 

TACAGAGTGC CCCTCGAATC TCAGGACAAT CAGGTCAAGA GCAATGTCCT CTATAATCTG 900 

AATGTGAAAA TCGAAATCTT GGGTTCTl'TA CAAGAGCCGG AAGCTGTTTC TGTAAACGGC 960 

ACACTCGCAA TAGAAGAATG GATTCTCCAT CAGGAl-GCAT TCAATCTGCC TGCCACCAAT 1020 

TACTTGATAG TGGAACAGCA CGAAATCTTC ATGAATAACG TGAACACATA CTCGGTGAAA 1080 

TATCAAACTT CGCAGAAACC AATCAGCATT AGCATACAGT CAGTTACCTT TAGCTACGTC 1140 

TCTTCTGATG GCACTCAGCA CAATGATCTT GTAGCAAGTA GTAGCGACCA GTATCCfACG 1200 

ATTACAAGCG ATAATACAAG CATCATAATC ACTTCCAAGA TACCGGTTAA TAACGTACCA 1260 

AAGAAGATCG TTTTTGAGGT AACTAATGGG GTAGCCGGTT TGAAAGAGAC TGTCACAGTA 1320 

CTCCAATATC CTGCACAATT TATTGTCAAT ACACTTGGCA CAGCATCGUC ATGGAGACCA 11 BO 

GACGGATCTT TGGCTCCGGG GCTTAACAAT AAAGCGATTT ACCATGTCGT AGTACTGGTT 14 4 0 

CCACCCGAGA ATTTATTTGA AGATGGGACA CAGACAATCA TCGGTTATCC CCCCACTGAA 1500 

ACAATTTCTT TTCATAAGAA AGAGAACAAT ACCTATCCGA TAGTATGGTC TGACACAAAT 1S6C 

ACGACAAAAC AGGACCTTGA GACATCAAGA ATGATTTCAC CITCCTTTGA GTTAGCCTCC 1620 

CAACTTGGGG CTACTCTCCC GATGCCCTAT CTCGAGTATT GGCCAGGGAC ATCATATCTC 1680 

CTTGACTATT CGGGAAACTA TAATAATAAG AGATACGCCT TGTTTAATTO CGCTTTTTAC 17 40 

TGGGAGAAAA GAAAAGTTAA TAACGAAGAA ATTAAATTCG ATGACTGGCG TTTCCCGACA 1800 

GAAGCTGAGA TCAAATTGAT AGATAAGCTG CAACATAATG AGCAGAGTGC TGTCCAAGCT 1860 

ATCATGACAG GGAATTATTA TTGGGATAGT TACTCTGCAA ATGGGTCTTA TAAAATGCAA 1920 

GGAGGAGGGG GCCAAGGAAA TTCCTCCAAA GCCTATGTTC GTTGCGTGCG GGATGTGAAA 1980 

AAGCCGATTC GTGACAAGAA GTCAGGTAAG 20 j 0 



75 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



-ff^*^! ^i"Ht iP"" tl'«»i „fl ™U tP^ni .11*' -*»*' ""i" tr'^i' 

ii„3 ^-off innii „aL in« q o a, o mr„ icii iUJ cjj 



WO 99/29870 



PCT/AU98A)1023 



136/490 



10 



15 



20 



25 



30 
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(2) rUFORMATIW FOR SEQ ID HO: 201 

(I) SEOUENCB CHARACTERISTICS: 

(Al LENGTH: 3846 base pairs 
(e) TYPE: nucleic acid 

(C) STRAWDEDTJESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE Ti-PE: DNA (genomic J 

(iiij HYPOTHETTCAr.: NO 

ilv) AlJTI-SEhlSt: NO 

(vi) ORIGII-IAL SOURCE: 

(A) ORGATJISH: PORYPHYROMOIJAS GIMGIVALIS 

(ix) FEATURE: 

(A) NAME/KEY: mlsc feature 

(B) LOCATION 1-..384 6 



(xi) SEQUENCE DESCRIPTIOI4: SEQ ID NO: 201 



ATGCGAAAAA 
ACTTGGCATG 
AGTTGTACCC 
GAAGGCAACC 
GGCGAAGCTG 
AACGTTGCl'G 
CCTACCTATG 
ATAAACAATG 
CAAGACGGGT 
AACCCTGT CC 
AATCCTCACC 
TTTATTAATT 
GGTACAACAA 
TTGGTTATTT 
CTGTGCGAAC 
GTATTGAATA 
TTCA'ITCGCT 
TACGTGCTAC 
GTCCCAACCT 
ATATGCGCCT 
rrTATCGGTC 
ATCAACAAAG 
AAAGGCTGCC 
6TCAACTCTA 
AATAATGGTT 
GGGCAGGGAC 
TTATGTACGA 
GAAGTCCTGA 
GCCACCCAAT 
TTACCTTATT 
ATTATCAATA 
CCTGCACTAA 
AACACCATTA 
AAAGGAGTTT 
GAAATAGGCA 
GTTTACGGCG 
AACTTGTTTT 
AGTGCCCCGC 
CGAA'jTGACA 
TCGGCGATAA 
GCAAAGATTA 
ACGCCAGGCT 
TACATTTCCG 
7\ACAGAACGA 
AGCACGCTTT 
GGAGTACGTC 
GAAGATAAGC 
TTCCCTCTCA 
AACGACACCT 
TTGTTTGATC 
GATGGATTGC 



TTTTGAGCTT 
GAGATCCGGA 
GAATCACGTT 
AAGTCTTTCA 
AATTGCCTGT 
TAAAAATCAA 
TCGTAGAGGA 
ACTATTATAG 
ATTTTCGCTC 
GACAAGAAAT 
CACCTTTACA 
ATGAAGCTGA 
CGTACATAAG 
ACGATGATAT 
ATAGAGCCIT 
GCTTCCCATC 
CAGTTTACTVA 
TGATCGGAAA 
CTTTTATTCA 
CCGACTATTT 
GATTTAGCGT 
AAATCTCATA 
ATGCTCCAAT 
TCTTAAAATC 
CCCATCATTT 
TCGACGTCAA 
GTCTATCATG 
CCACATACGC 
ATGCCGTTTA 
CTTTATATCA 
CTAATTCTGT 
ACATTATGGC 
TAAGCAGTCC 
TGCATTTTAC 
ATCAGGCTAA 
ATGGTCTTGC 
CTACGCATTC 
TGTATACAAC 
TTATTTCAAA 
CGGTATTCAA 
CCGACAATAC 
TAACAGCAAC 
GTAATAAAAT 
ACAGACTCCA 
ATAATTCCTA 
TCCTCAACAA 
AGACGTTTAT 
ACTTCCATTA 
ATACGAATCG 
CGAATCAGGT 
CAAATGGGAG 



TTTGATGATG 
CTCAGl'GGCA 
OoAGGTTGTT 
GCGCATTTCC 
TTTGAAAAAG 
AGAGACGGAG 
GTTGCCTGAG 
CCAAAATGTA 
ACAAAGATTT 
TCTATTTGCA 
AAAAAACACC 
TGGCAAATCG 
CGGAAATGrr 
GTTCAACGTA 
CTACAACGGC 
AAATGCCACC 
CCAAAGCAAT 
ACCATTGAGC 
TAATGTCTCC 
TTTTAGTTGT 
CACCAATGCT 
TAATCCTATT 
CTTACGTTTA 
TAATCAGGTC 
TTATTTTAAC 
TACTCTAACA 
TAGTTCGGCT 
ACCTAACAAG 
TTTAGAAGGC 
CAATCTCTCG 
TGATACGTAT 
TCATGGCATG 
GATAACAATA 
TAATAATGGC 
AATATCCGGA 
GATTAACAAG 
GGTCATGCCC 
GAACTGTATT 
GAATTGTGAC 
GCCTATGGCT 
TmTTTGCG 
CTCCAATGCT 
AGTCAATTGC 
CAATATCACA 
TGGTATTTAC 
CAGTTGTTTT 
TTCTAATAGG 
CAACAGCTTG 
CTATATTGAC 
TTTCAATACG 
ATCGGGCAAT 



TGCTCTCTGC 
GCCCTACCTT 
TTCCCCGGAT 
ATGCCGGGTT 
ATGATAGCCG 
ACA'i-rCGACA 
GGGGGGACTT 
AGCCTCCCTT 
ATCGAAGTTA 
AAAAAAATCG 
GGCATATTTA 
GCGATAGAAA 
GCCAGCAACC 
AATCAACAAC 
TTTGATGTAG 
TCATACATCA 
GCGAAGAGGA 
AAATATTTGG 
TTAATTCCAA 
GTTTCGCCCC 
CATGAATTGC 
GCACACAAAA 
TTCTTAAAAG 
TCTGCAATAG 
ACTCATGGAA 
GCCCGATTGA 
GTAGCAGATT 
GGATTCTCGG 
CCCTGTCCTC 
ACTGTTGTTG 
TCGAAATTCA 
GAGGTTAGTA 
AAAAATGGTG 
TCCATACAAG 
GAGACCGGTG 
CAGGTAGAGA 
AAATTTCATT 
GTGGAGATAA 
CTAAGCGTTG 
ACAAGCTCCA 
ACAGGAAACT 
GCCATCAAGT 
GATGAGGCTC 
CGGAATGTGA 
AACCGAAATA 
TATTTCGATA 
ACTTGGCAGC 
CAGGGGGGAG 
GTTTCAAATA 
CCAGACTTCT 
AGCTCTGCTG 



ATTTAGGTCT 
CTATCGGTAT 
TTTATAGTGT 
GTGGCTCGTT 
TTCCGGAATT 
ATTATAATAT 
ATCTGGTAGA 
CTACTCACTA 
CCCTGTATCC 
AGGTTACAAT 
ACAAAGTAGC 
ATGATATGGT 
TCCCTCAGAA 
CACACGACGA 
CTGCTGTAAG 
ACGAAACTAA 
CTTTAGATGG 
erGACACTGA 
GTCATCCAAC 
TTGATACTGT 
ACAATCTGAT 
ATATTCTTTA 
AAATCGCCTC 
ACTCGATATT 
TGCCGACTGT 
ACAATACATC 
CAACl'ATTAG 
CTTTCTTAGG 
CGTCAGAATT 
GCGAAATGTT 
ACTTCAATTT 
ATTGTATTAC 
GCTGCCTAAA 
TCATGTCCCG 
CTAACCCCAC 
TAGACAATAT 
TTGACAGTGT 
GCAATTGCGA 
AAAACAGTAT 
TCACCGGATT 
TCGCCTACCA 
TAGACAATAT 
TTGTACTAAA 
TAAAAAACTG 
AGATCAGTAA 
ATGCTCCTGT 
TCTATTCATC 
ATACAGATAC 
ATCACTGGGG 
TCATTTGGAT 
AAGCAGTAGA 



ACAATCTCAG 
TCAAGAGTCA 
GGAAAAACGA 
TGGGAATCTG 
TTCAACAGCT 
CTATCCTAAT 
GGCTTTCGCG 
TGTCTATTCT 
TTTTCGATAC 
AACTTTCGAT 
OrCCTCTGCA 
GTTCAGTCGT 
CTGTGACTAC 
AATCAAACGG 
TATAAAGGAC 
ACTGAAAAAT 
CAAACTGGGA 
TAATACAAAA 
TTTTGGTTCC 
CGGCGATTTG 
TGAAAAGACT 
CGCAGAAGGG 
TGGTl'ACACA 
TGACTGCTTG 
TTGGGGGATA 
TTCGCAGGGA 
ATCGCTTGGA 
AGGAAGCAGA 
TTATGAATAT 
GCTATCATCC 
GCTTGGCGAC 
ACTACCAAAC 
AATACCGGAA 
AGGAACTCTG 
CTTTATTACC 
AGACCGACTT 
GAAATTCAftC 
ATTTACCAAT 
GTTTAGCAGT 
ATCTACAAAA 
TATCACAAAC 
TCCTGAGTAT 
TAATAGTGGC 
TAGGATTGGG 
CAATCATATA 
AATCAATGAA 
AAACGGTACA 
ATGGATTTAC 
CAACAATGAT 
ACCTTTTTGG 
ATTCCAAACA 
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fiCATTGGACT 
GTTGAAACCT 
GAGAAAATGT 
ATCATCTCTT 
GTGTGTGAAA 
TCCTATCAGG 
TTAGACTCAC 
TCGCTCGAAA 
GGTACTCTCC 
TCTATTTCGC 
TCCTGTTCTG 
CCCAAACATC 
GGTTTCTACC 
ATCAAA 



GTATTGGCAA 
ACCCGGAATC 
CAGGCAACGA 
CCCAGAACTT 
ACTATCAGTC 
ACAGTGTTTT 
TCAGAGGGAC 
GCCATCAAAA 
TGCCTCCATT 
CCAATCCGGC 
TAATAAAAAT 
TATCCGAAGG 
TGGTAACGCT 



TAGCGATTAT 
CGACTTTGGA 
TTACGAAGGC 
GTTCrCGACA 
TGCCATCGAT 
TGCAGTCATT 
TGGTATAGAT 
TGTAAAAAAT 
AGAATGCAAC 
GAAAGCTGTT 
ATATGGAATA 
TTATTACAGC 
AAATGTTGAT 



CTTTCGGCAA 
ATAGCTGCTT 
•rrGAAAGAIT 
GCTCATTTCC 
TGGTACGAAA 
GACCTTGGTG 
TTGAACATAC 
TATTTGTTGT 
AAATCAAGCC 
GTAACAATAA 
AATGGAGCCT 
ATACAGTTCA 
CAGAAAATTA 



AAGTGGCTCT 
TGAAGGAATT 
ATTTCAGATC 
TGTCTGCGCG 
ATCGCTTG7VA 
ACATTTATTG 

rrrccTGTGA 

CAACTCTTCC 
TTGATAAATC 
TCTACTATAC 
CGGCTGATAT 
ATACATCCAA 
TAGATACGGA 



CAAGATGATG 
GTTCAGGATA 
CAATCCAACC 
ATGCGATATT 
TAGTGAAATC 
GAATATGCAG 
ACAAAGGAAA 
CGAATCAACA 
CAAGATAATC 
CGATAACCCT 
AACCGGGTTG 
CTTTGATCCC 
AAAATTACGA 
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(1) SEQUEtlCE CHARACTERISTICS: 

(A) LENGTH: 3822 base paira 

(B) TYPE: nucleic ac3d 

(C) STRAl^OEDNBSS: double 

(D) TOPOLOGY: circular 

(±1) MOLECULE TYPE: DIIA (genomic) 

till) HYPOrHETICAL: HO 

(Iv) ANTI-SENSE: WO 

(vi) ORIGIIIAL SOURCE: 

(Al ORGAIIISM: P0RYPHYR0I-10HAS GIWGIVALIS 

(i>:) FEIATURE: 

(A) HAIIE/KEY! mi£C_f eature 
iB) LOCATION 1...3022 



tx±) SEQUEUCt; DESCRIPTION: SEQ ID MO: 202 
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ATGATGTGCT 
GTGGCAGCCC 
GTTGTTTTCC 
ATTTCCATGC 
AAAAAGATGA 
ACGGAGACAT 
CCTGAGGOGG 
AATGTAAGCC 
AGATTTATCG 
TTTGCAAAAA 
AACACCGGCA 
AAATCGGCGA 
AATGTTGCCA 
AACGTAAATC 
AACGGCTTTG 
GCCACCTCAT 
AGCAATGCGA 
TTGAGCAAAT 
GTCTCCTTAA 
AGTTGTGTTT 
AATGCTCATG 
CCTATTGCAC 
CGTTTATTCT 
CAGGTCTCTG 
TTTAACArrC 
CTAACAGCCC 
TCGGCTGTAG 
AACAAGGGAT 
GAAGGCCCCT 
CTCTCGACTG 
ACGTAn*CGA 
GGCATGGAGG 
ACAATAAAAA 
AATGGCTCCA 
TCCGGAGAGA 
AACAAGCAGG 



CTCTGCATTT 
TACCTTCTAT 
CCGGATTTTA 
CGGGTTGTGG 
TAGCCGTTCC 
TCGACAATTA 
GGACTTATCT 
TCCCTTCTAC 
AAGTTACCCT 
AAATCGAGGT 
TATTTAACAA 
TAGAAAATGA 
GCAACCTCCC 
AACAACCACA 
ATGTA/GCTGC 
ACATCAACGA 
AGAGGACTTT 
ATTTGGCTGA 
TTCCAAGTCA 
CGCCCCTTGA 
AATTGCACAA 
ACAAAAATAT 
TAAAAGAAAT 
CAATAGtf^CTC 
ATGGAATGCC 
GATTGAACAA 
CAGATTCAAC 
TCTOSGCTTT 
GTCCTCCGTC 
TTGTTGGCGA 
AATTCAACTT 
TTAGTAATTG 
ATGGTGGCTG 
TACAAGTCAT 
CCGGTGCTAA 
TAGAGATAGA 



AGGTCTACAA 
CGGTATTCAA 
TAGTGTGGAA 
erCGTTTGGG 
GGAATTTTCA 
TAATATCTAT 
GGTAGAGGCT 
TCACTATGTC 
GTATCCTTTT 
TACAATAACT 
AGTAGCCTCC 
TATGGTGTTC 
TCAGAACTGT 
CGACGAAATC 
TGTAAGTATA 
AACTAAACTG 
AGATGGCAAA 
CACTGATAAT 
TCCAACTTTT 
TACTGTCGGC 
TCTGATTGAA 
TCTTTACGCA 
CGCCTCTGGT 
GATATTTGAC 
GACTGTTTGG 
TACATCTTCG 
TATTAGATCG 
CTTAGGAGGA 
AGAATTTTAT 
AATGTTGCTA 
CAATTTGCTT 
TATTACACTA 
CCTAAAAATA 
GTCCGGAGGA 
CCCCACCTTT 
CAATATAGAC 



TCTCAGACTT 
GAGTCAAGTT 
AAACGAGAAG 
AATCTGGGCG 
ACAGCTAACG 
CCTAATCCTA 
TTCGCGATAA 
TATTCTCAAG 
CGATACAACC 
TTCGATAATC 
TCTGCATTTA 
AGTCGTGGTA 
GACTACTTGG 
AAACGGCTGT 
AAGGACCTAT 
AAAAATTTCA 
CTGGGATACG 
ACAAAAGTCC 
QG*rrcCATAT 
GATTTGTTTA 
AAGACTATCA 
GAA6GGAAAG 
TACACAGTCA 
TGCTTGAATA 
GGGATAGGGC 
CAGGGATTAT 
CTTGGAGAAG 
AGCAGAGCCA 
GAATATTTAC 
TCATCCATTA 
GGCGACCCTG 
CCAAACAACA 
CCGGAAAAAG 
ACTCTGGAAA 
ATTACCGTTT 
CGACTTAACT 



GGCATGGAGA 
GTACCCGAAT 
GCAACCAAGT 
AAGCTGAATT 
TTGCTGTAAA 
CCTATGTCGT 
ACAATGACTA 
ACGGGTATTT 
CTGTCCGACA 
CTCAGCCACC 
TTAATTATGA 
CAACAACGTA 
TTATTTACGA 
GCGAACATAG 
TGAATAGCTT 
TTCGCTCAGT 
TGCTACTGAT 
CAACCTCTTT 
GCGCCTCCGA 
TCGGTCGATT 
ACAAAGAAAT 
GCTGCGATGC 
ACTCTATCTT 
ATGGTTCCCA 
AGGGACTCGA 
GTACGAGTCT 
TCCTGACCAC 
CCCAATATGC 
CTTATTCTTT 
TCAATACTAA 
CACTAAACAT 
CCATTATAAG 
GAGTTTTGCA 
TAGGCAATCA 
ACGGCGATGG 
TGTTTTCTAC 



TCCGGACTCA 
CACfffTCGAG 
CTTTCAGCGC 
GCCTGTTTTG 
AATCAAAGAG 
AGAGGAGTTG 
TTATAGCCAA 
TCGCTCACAA 
AGAAATTCTA 
TTTACAAAAA 
AGCTGATGGC 
CATAAGCGGA 
TGATATGTTC 
AGCCTTCTAC 
CCCATCAAAT 
TTACAACCAA 
CGGAAAACCA 
TATTCATAAT 
CTATTTTTTT 
TAGCGTCACC 
CTCATATAAT 
TCCAATCTTA 
AAAATCTAAT 
TCATTTITAT 
CGTCAATACT 
ATCATGTAGT 
ATACGCACCT 
CGTITATTTA 
ATATGAGAAT 
TTCTGTTGAT 
TATGGCTCAT 
CAGTCCGATA 
TTTTACTAAT 
GGCTAAAATA 
TCTTGCGATT 
GCATTCGGTC 
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ATGCCCAAAT 

TGTArrGTGG 

TGTGACCTAA 

ATGGCTACAA 

TTTGCGACAG 

AATGCTGCCA 

AATTGCGATG 

ATCACACGGA 

ATTTACAACC 

TGTTTTTATT 

AATAGGACTT 

AGCTTGCAGG 

ATTGACGTTT 

AATACGCCAG 

GGCAATAGCT 

GATTATCTTT 

TTTGCAATAG 

GAAGGCITGA 

CCX3ACAGCTG 

ATCGATTGGT 

GTCATTGACC 

ATAGATTTGA 

AAAAATTATT 

TGCAACAAAT 

GCTGTTGTAA 

GGAATAAATG 

TACA'SCATAC 

GTTGATCAGA 



TTCATTTTGA 
AGATAAGCAA 
GCGTTGAAAA 
GCTCCATCAC 
GAAACTTCGC 
•rCAAGTTAGA 
AGGCTCTTGT 
ATGT6ATAAA 
GAAATAAGAT 
TCGATAATGC 
GGCAGCTCTA 
GGGGAGATAC 
CAAATAATCA 
ACTTGTTCAT 
CTGCTGAAGC 
CGGCAAAAGT 
CTGCTTTGAA 
AAGATTATTT 
ATTTCCTGTC 
ACGAAAA'l-CG 
TTGGTGACAT 
ACATACTTTC 
TGTTGTCAAC 
CAAGCCTTGA 
CAATAATCTA 
GAGCCTCGGC 
AGTTCAATAC 
AAAl-rATAGA 



CAGTGTGAAA 
TTGCGAATTT 
CAGTATGTTT 
CGGATTATCT 
CTACCATATC 
CAATATTCCT 
ACTAAATAAT 
AAACTGTAGG 
CAGTAACAAT 
TCCTGTAATC 
TTCATCAAAC 
AGATACATGG 
CTGGGGCAAC 
TTGGATACCT 
AGTAGAATTC 
GGCTCTCAAG 
GGAATTGTTC 
CAGATCXlftAI" 
TGCGCGATGC 
CTTGAATAGT 
TTATTGGAAT 
CTGTGAACAA 
TCTTCCCGAA 
TAAATCCAAG 
CTATACCGAT 
TGATATAACC 
ATCCAACTTT 
TACGGAAAAA 



TTCAACAGTG 
ACCAATCGAA 
AGCAGTTCGG 
ACAAAAGCAA 
ACAAACACGC 
GAGTATTACA 
AGTGGCAACA 
ATTGGGAGCA 
CATATAGGAG 
AATGAAGAAG 
GGTACATTCC 
ATTTACAACG 
AATGATTTGT 
TTTTGGGATG 
CAAACAGCAT 
ATGATGGTTG 
AGGATAGAGA 
CCAACCATCa 
GATATTGTGT 
GAAATCTCCT 
ATGCAGTTAG 
AGGAAATCGC 
TCAACAGGTA 
ATAATCTCTA 
AACCCTTCCT 
GGGTTGCCCA 
•jATCCCGGTT 
TTACGAATCA 



CCCCGCTGTA 
GTGACATTAT 
G3ATAACGGT 
ACATTACC6A 
CAGGCTTAAC 
TTTCCGGTAA 
GAACGAACAG 
CGCTTTATAA 
TACGTCTCCT 
ATAAGCAGAC 
CTCTCAACTT 
ACACGTATAC 
TTGATCCGAA 
GATTGCCAAA 
TGGACTGTAT 
AAACCTACCC 
AAATGTCAGG 
TCTCTTCCCA 
GTGAAAACTA 
ATCAGGACAG 
ACTCACTCAG 
TCGAAACCCA 
CTCTCCTGCC 
TTTCGCCCAA 
GTTCTGTAAT 
AACATCTATC 
TCTACCTGGT 
AA 



TACAACGAAC 

TTCAAAGAAT 

ATTCAAGCCT 

CAATACTTTT 

AGCAACCTCC 

TAAAATAGTC 

ACTCCACAAT 

TTCCTATGGT 

CAACAACAGT 

GTTTATTTCT 

CCATTACAAC 

GAATCGOTAT 

TCAGGTTTTC 

TGGGAGATCG 

TGGCAATA'3C 

GGAATCCGAC 

CAACGATTAC 

GAACTTGTTC 

TCAGTCTGCC 

TGTTTTTGCA 

AGGGACTGGT 

TCAAAATGTA 

TCCATTAGAA 

TCCGGCGAAA 

AAAAATATAT 

CGAAGGITAT 

AACGCTAAAT 



(2) INFORMATION FOR SEQ ID MO: 203 

(i) SEQUENCE CHARACTERISTICS: 

<AJ LENGTH: 2115 base pairs 
(B) TYPE: nucleic acid 
{C) STRANDfclDMESS: double 
(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DI>IA (genomic) 

fiii) HYPOTHETICAL: NO 
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(iv) ATJTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAinSM: P0RYPHYR0I40IiAS GltJGIVALIS 

( ix ) ttJVTURE : 

(A) IJAI-IE/KEY: misic_f eature 

(B) LOCATION 1...2775 

(xi) SEQUEHCE DESCRIPTION: SEQ ID NO: 203 



ATGGCTATCA 
GCAGCGATCA 
GCTCCGGCTC 
GGCTGGCTTG 
TTCTCTGTAC 
TCAACAGCGG 
TGGCTATGCA 
GGGACTGCCA 
CCTCT7GTAT 
CGACATTACA 
TATAG'iTCGA 
AATGTGGGAC 
GAAGAGTTGC 
AAAGATGTCT 
CAAGTGGAGT 
GGCACGCTGC 
CCCAATG-^TT 
AATGCATACG 
GTTCCGGGTA 
GCCAAACGTG 
GCGGTGATGG 
ACCATGACAG 



TGATGAAAAG 
CGAATCCGAC 
AACCGGATAC 
AGATAGATGC 
CTTACGGACA 
GCAACTATCT 
ATCAGTATAG 
TTGAAGACTT 
GGCGTAGACG 
AAGTCACCGA 
TCGAAGGGCC 
GATTGACTTG 
AGCTTAGCGG 
CCATACTCGA 
ACTGCGTTAC 
ATTACGCCAC 
GGCTTGTGAT 
ACGCTTTTCC 
TAGGCCCGGT 
TCAAGTACTG 
CTTCGACAAC 
CGAAGCCGAC 



TATTGTTTTT 
TGCTCAAGAG 
TATCTTATAT 
TGATGCTGAT 
CAATGGCCTT 
GATTACACCC 
TACCAATCCG 
TGTTTTGTTG 
AATCGTGGAC 
CTCACACACA 
CGAACCTGCT 
GAACTATCCG 
CTACAACATC 
GTATGTGGAC 
AGCCGTTTAC 
GGATGCCATC 
AGACGCTGAT 
CGGCCATAAT 
GACTCCCGAC 
GGTAAGCACG 
GGGGACTGCT 
CGGCGCATGG 



AGAGCATTTC 
ATCTCAGGCA 
GAAAGTTTTG 
GGTGCCACTT 
TGCACCTACT 
AATATAGAAG 
GAACATTACG 
TTTGATG^TT 
TTACCGGAAG 
GAATTCTTGA 
ACCGACTTCA 
GAGGATTATC 
TATGCGAACG 
AGCACTTACT 
GATGAAAGCA 
CTTTATGAAA 
GGAGATG'SAT 
GGAGGCCATT 
AACTATCTGA 
CAGGATGCCA 
GTCGGAGATT 
TATGAAAGAA 



TAACGATTTT 
TGAATGCATC 
AGAATGGACC 
GGGGAAGCCC 
CCCATATACG 
GAGCCAAACG 
CAGTAATGGT 
CCATAACAGG 
GGACCAAATA 
AATTGGATGA 
CAGTAATCAA 
AACCGGAAGG 
GTACACTACT 
CTTTGCGAGA 
TAGAATCTTC 
ATTTTGAGAA 
TTAGCTGGGG 
GCTCCTTGTC 
TTACCCCCAA 
ATTGGGCAGC 
TCGTCATATT 
CCATCAACTT 



GCTCTCGTGG 
CTGTCTGGCT 
TGTTCCCAAT 
ATCAGGCTCT 
TTCCGGTATC 
GGTCAAGTAC 
ATCGACAACG 
GAAACCGACT 
TATTGCATGG 
TGl'CACTGTG 
TATTGGTCAG 
AAAGGGGAAT 
GGCACAAATA 
CAATCCCTTG 
GACCGTATGT 
TGGACCTGTT 
ACACTATTTG 
GGCTTCTTAT 
GGTTGAAGGA 
GGAACATTAC 
GTTCGAAGAA 
ACCTGAAGGG 
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ACTAAATACA 
GATATCACTG 
TCGCn'ATTG 
GATAAGACTG 
TCGCTCCTTG 
TCACGAGACG 
GACiTCCCAAT 
GAAGGCTTTG 
AATGTTAATT 
GCATCCCCTT 
CCCAGACTCG 
TCGCCTGAGC 
CTCTTGTTCG 
ACATTGCCTG 
TTCTTGCTCT 
ACTGATTTCG 
AACGGCTACG 
ATCTATGCAA 
GATGAGACTT 
TATAACGACA 
TTGGATAATA 
GTAAGGATAG 
ATTTGCATAT 
AATGACGGAG 
GAGATAAAGA 



TCGCATGGCG 
TATTCGGGAC 
AAAACAAC7W 
ATGATAAAGA 
TTCACATACA 
ATCAGGTGGA 
CGGTTTGCGA 
AGGCCGGAAG 
G-SGACTATTA 
CGTACTTACC 
AAGGAGCCAA 
ATTATGCTGT 
AACAGACAAT 
CAGGAACAAA 
TGGATGACAT 
aTGTCTCGCT 
AACCCGATAA 
ATGGCTCGCT 
ATTCTTCACG 
ATATCGAGTC 
TTCAATCTGA 
AGGGATTGAG 
TAAGGGAAGA 
TCTACTTGAT 
GGCCG 



GCATTACAAC 
TCCTGCATCA 
GGGACGATTA 
CCCATTGCAG 
AGAOCCGACT 
AGTGGAATAT 
TAAGCTGATT 
TATTCCTGAA 
TCCTTGGACT 
GATGATTGCC 
GCTTGTCAAG 
GATGGTTTCT 
GACCGCTAAG 
ATATATTGCC 
TACGGTTTAT 
TATTGAGAAT 
GACTGATGAT 
CCTTGTTCAC 
AGACGGTCAG 
CCAATCGGTT 
TACAAGCTTG 
TCGGAGCAAG 
GACTCATTCA 
TAAAGTAGTC 



TGTACXrGATA 
GAGCCCGAAC 
AAGTGGAATT 
CTTGCCGGCT 
GTTTTGGAGT 
TGTGTCACTG 
TATGATTCTC 
GGCTGGTTGT 
ATGTATGGAC 
GTTTTAACTC 
TATTGGGTAA 
ACTACGGGAA 
GCTAACGGTG 
TGGCGGCATT 
CGTTCTACTG 
AACAAGGGTC 
AAAAAACCAT 
ATACAAGACC 
GTGGAAATGG 
TGCGATAAGC 
AAAATATATC 
TCGACAATCG 
GAGAAAACGG 
GGTGGAAATA 



TATAITTCTT 
CTGTTACCGA 
ATCCTAACGG 
ACAATATCTA 
ATATCGATGA 
CCGTTTATAA 
AATCGGACAT 
TGATTGATGC 
ATGACAGTGA 
CGGATAACTA 
GTGCGCAAGA 
CTGCTGTTGA 
CATGGTATGA 
ATGATTGCAC 
AGACTGTTCC 
GCCTGAAATG 
TGCAGCTTAC 
CGACTGTTTT 
AATATTGTGT 
T6AACTATAC 
CTAATC03GC 
AG-ITGTATAA 
AAATCGATGT 
AAACAACAAC 



GAAGTTGGAC 
TTTCGTTGTC 
CTACGAACCC 
TGCAAACGGC 
GACTTATTCT 
CGACAATATC 
TATCTTATAT 
TGATGCCGAC 
GAAGT6TATT 
TTTGGTTACA 
TGCTGTTTAT 
AGATTTTGTC 
GCGAACTATT 
CGATATGTTT 
CGAGCCTGTT 
GAATTATCCT 
CGGCTACAAC 
GGAGTATATC 
CACTGCCGT7 
TATCACATCC 
ATCGTATGTG 
TGCGCTGGGA 
TTCACGTCTC 
CGAAAAGGTA 



1380 
XAAQ 
1500 
1560 
- 1620 
1680 
1740 
1000 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2200 
234 0 
2400 
24 60 
2520 
2S0O 
2640 
27Q0 
2760 
277b 



30 



35 



40 



(2) IHFORI-IATION TOR SEQ ID HO: 204 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2766 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDt^ESS: double 
fD) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) Af4TI -SENSE: NO 



45 



50 



55 



60 



65 



70 



75 



(VI) ORIGINAL SOURCE: 

(A) ORGANISM: tHDRY PHYR0MOHA5 GIHGtVALIS 

(ix) FEATURE: 

(A) NAUE/KE-f: misc feature 

(B) LOCATION 1.., 27 66 

ixi) SEQUENCE DESCRIPTION: SEQ ID NO:204 



ATGATGAAAA 
ACGAATCCGA 
CAACCGGATA 
GAGATAGATG 
CCTTACGGAC 
GGCAACTATC 
AATCAGTATA 
ATTGAAGACT 
TGGCGTAGAC 
AAAGTCACCG 
ATCGAAGGGC 
CGATTGACTT 
C7U3CTTAGCG 
TCCATACTGG 
TACTGCGTTA 
CATTACGCCA 
TGGCTTGTGA 
GACGCTTTTC 
ATAGGCCCGG 
GTCAAGTACT 
GCTTCGACAA 
GCGAAGCCGA 
ATCGCATGGC 
GTATTCGGGA 
GAAAACAACA 



GTATTGTTTT 
CTGCTCAAGA 
CTATCTTATA 
CTGATGCTGA 
ACAATGGCCT 
TGATTACACC 
GTACCAATCC 
TTGTTTTGTT 
GAATCGTGGA 
ACTCACACAC 
CCGAACCTGC 
GGAACTATCC 
GCTACAACAT 
AGTATGTGGA 
CAGCCGTTTA 
CGGATGCCAT 
TAGAOCCTGA 
CCGGCCATAA 
TGACTCCCGA 
GGGTAAGCAC 
CGGGGACTGC 
CCGGCGCATG 
GGCATTACAA 
CTCCTGCATC 
AGGGACGATT 



TAGAGCATTT 
GATCTCAGGC 
TGAAAGTTTT 
TGGTGCCACT 
TTGCACCTAC 
CAATATAGAA 
GGAACATTAC 
GTTTGATGAT 
CTTACCGGAA 
AGAATTCTTG 
T7VCCGACTTC 
GGAGGATTAT 
CTATGCGAAC 
CAGCACTTAC 
CGATGAAAGC 
CCTTTTATGAA 
TGGAGATGGA 
TGGAGGCCAT 
CAACTATCTG 
GCAGGATGCC 
TGTCGGAGAT 
GTATGAAAGA 
CTGTACCGAT 
AGAGCCCGAA 
AAAGTGGAAT 



CTAACGATTT 
ATGAATGCAT 
GAGAATGGAC 
TGGGGAAGCC 
TCCCATATAC 
GGAGCCAAAC 
GCAGTAATGG 
TCCATAACAG 
GGGACCAAAT 
AAATTG-SATG 
ACAGTAATCA 
CAACCGGAAG 
GGTACACTAC 
TCTTTGCGAG 
ATAGAATCTT 
AATTTTGAGA 
TTTAGCTGGG 
TGCTCCTTGT 
ATTACCCCCA 
AATTGGGCAG 
TTCGTCATAT 
ACCATCAACT 
ATATATTTCT 
CCTGTTACCG 
TATCCTAACG 



TGCTCTCGTG 
CCTGTCTGGC 
CTGTTCCCAA 
CATCAGGCTC 
GTTCCGGTAT 
GGGTCAAGTA 
TATCGACAAC 
GGAAACCGAC 
ATATTGCATG 
ATCTCACTGT 
ATATTGGTCA 
GAAAGGGGAA 
TGGCACAAAT 
ACAATCCCTT 
CGACCGTATG 
ATGGACCTGT 
GACACTATTT 
CGCCTTCTTA 
AGGTTGAAGG 
CGGAACATTA 
TGTTCGAAGA 
TACCTGAAGG 
TGAAGTTGGA 
ATTTCGTTGT 
GCTACGAACC 



GGCAGCGATC 
TGCTCCGGCT 
TGGCTGGCTT 
TTTCTCTGTA 
CTCAACAGCG 
CTGGGTATGC 
GGGGACTGCC 
TCCTCTTGTA 
GCGACATTAC 
GTATAGGTCG 
RAATGTGGGA 
TGAAGAGTTG 
AAAAGATGTC 
GCAAGTGGAG 
TGGCACGCTG 
TCCCAATGGT 
GAATGCATAC 
TGTTCCGGGT 
AGCCAAACGT 
CGCGGTGATG 
AACCATGACA 
GACTAAATAC 
CGATATCACT 
CTCGCTTATT 
CGATAAGACT 



60 
120 
180 
240 
300 
360 
420 
400 
S4 0 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



lUi "^li Cai .-iL & li-i' h.J\ m 



^r-^ ir=h -»r=!; jf";|s 

iU3t O ii:Sr *0 ^iL^- ti^- 



WO 99/29870 



PCT/AU98/01023 



140/490 



GATGATAAAG 
GTTCACATAC 
GATCAGCTGG 
TCGGTTTGCG 
GAGGCCGGAA 
TGGGACTATT 
TCGTACTTAC 
GAAGGAQCCA 
CATTATGCTG 
GAAGAGACAA 
eCAGGAACAA 
TTGGATGACA 
GTTGTCTCCC 
GAACCCGATA 
AATGGCTCGC 
TATTCTTCAC 
AATATCGAGT 
ATTCAATCTG 
C5AGGGATTGA 
TTAAGGGAAG 
GTCTACTTGA 
AGGCCG 



ACCCATTGCA 
AAGACCCGAC 
AAG7GGAATA 
ATAAGCTGAT 
GTATTCCTGA 
ATCCTTGGAC 
CGATGATTGG 
AGCTTGTCAA 
TGATGGm-C 
TGACCGCTAA 
AATATATTGC 
TfACGGTTTA 
TTATTGAGAA 
AGACTGATGA 
TCCTTGTTCA 
GAGACGGTCA 
CCCAATCGGT 
ATACAA6CTT 
GTCGGAGCAA 
AGACTCATTC 
TTAAAGTAGT 



GCTTGCCGGC 
TGTTTTGGAG 
TTGTGTCACT 
TTATGATTCT 
AGGCTGGTTG 
TATGTATGGA 
CGTTTTAACT 
GTATTGGGTA 
TACTACGGGA 
GGCTAACGGT 
CTGGCGGCAT 
TCGTTCTACT 
TAACAAGGGT 
TAAAAAACCA 
CATACAAGAC 
GGTGGAAATG 
TTGCGATAAC. 
GAAAATATAT 
GTCGACAATC 
AGA6AAAACG 
CGGTGGAAAT 



TACAATATCT 
TATATCGATG 
GCCGTTTATA 
CAATCGGACA 
TTGATTGATG 
CATGACAGTG 
CCGGATAACT 
AGTGCGCAAG 
ACTGCTtTTTG 
GCATGGTATG 
TATGATTGCA 
GAGACTGTTC 
CGCCTGAAAT 
TTGCAGCTTA 
CCGACTGTTT 
GAATATTGTG 
CTGAACTATA 
CCTAATCCGG 
GAGTTGTATA 
GAAATCGATG 
AAAACAACAA 



ATGCAAACGG 
AGACTTATTC 
ACGACAATAT 
TTATCTTATA 
CTGATGGCGA 
AGAAGTGTAT 
ATTTGGTTAC 
ATGCTGTTTA 
AAGATTTTGT 
AGCGAACTAT 
CCGATATGTT 
CCGAGCCTGT 
GGAATTATCC 
CCGGCTACAA 
TGGAGTATAT 
TCACTGCCGT 
CTATCACATC 
CATCGTATGT 
ATGCGCTGGG 
TTTCACGTCT 
CCGAAAAGGT 



CTCGCTCCTT 
TTCACGAGAC 
CGAGTCCCAA 
TGAAGGCTTT 
CAATGTTAAT 
TGCATCCCCT 
ACCCAGACTC 
TTCGGCTGAG 
CCTCTTGTTC 
TACATTGCCT 
TTTCTTGCTC 
TACTGATTTC 
TAAOGGCTAC 
CATCTATGCA 
C6ATGAGACT 
TTATAACGAC 
CTTGGATAAT 
GGTAAGGATA 
AATTTGCATA 
CAATGACGGA 
AGAGATAAAG 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
~ 1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2766 



(2) lUFOmiATION. roR SEQ ID MO: 205 

ii) SEOUEHCE CHARACTERISTICS: 

(A) LENGTH: 27 63 ba^e pairs 
(B». TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

{11) MOLECULE TYPt:: DKA (genomic) 

(iii) HYPOTHETICAL: UO 

(Iv) AJ'fTI-SEIISE: NO 

(vi) ORIGIMAL SOURCE: 

(A) ORGAJIISH: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A) HAI IE/KEY: misc^feature 

(B) LOCATIOfJ 1. - .2763 

(Xil SEQLIEJICE DESCRIPTION: SEQ ID NO: 205 



ATGAAAAGTA 
AATCCGACTG 
CCGGATACTA 
ATAGATGCTG 
TACGGACACA 
AACTATCTGA 
CAGTATAGTA 
GAAGACTTTG 
CGTAGACGAA 
GTCACCGACT 
GAAGGGCCCG 
TTGACTTGGA 
CTTAGCGGCT 
ATACTGGAGT 
TGCGTTACAG 
TACGCCACGG 
CTTGTGATAG 
GCTTTTCCCG 
GGCCCGGTGA 
AAGTACTGGG 
•rCGACAACGG 
AAGCCGACCG 
GCATGCCGGC 
TTCGGGACTC 
AACAACAAGG 
GATAAAGACC 
CACATACAAG 
CAGGTGGAAG 



TTGTTTTTAG 
CTCAftGAGAT 
TCTTATATGA 
ATGCTGATGG 
ATGGCCTTTG 
TTACACCCAA 
CCAATCCGGA 
TTTTGTTGTT 
TCGTGGACTT 
CACACACAGA 
AACCTGCTAC 
ACTATCCGGA 
ACAACATCTA 
AT6TGGACAG 
CCGTTTACGA 
ATGCCATCCT 
ACGCTGATGG 
GCCATAATGG 
CTCCCGACAA 
TAAGCACGCA 
GGACTGCTGT 
GCGCATGGTA 
ATTACAACTG 
CTGCATCAGA 
GACGATTAAA 
CATTGCAGCT 
ACCCGACTGT 
TGGAATATTG 



AOCATTTCTA 
CTCAGCCATG 
AAGTTTTGAG 
TGCCACTTGG 
CACCTACTCC 
TATAGAAGGA 
ACATTACGCA 
TGATGATTCC 
ACCGGAAGGG 
ATTCTTGAAA 
CGACrrCACA 
GCiATTATCAA 
TGCGAACGGT 
CACTTACTCT 
TGAAAGCATA 
TTATGAAAAT 
AGATGGATTT 
AGGCCATTGC 
CTATCTGATT 
GGATGCCAAT 
CGGAGATTTC 
TGAAAGAACC 
TACCGATATA 
GCCCGAACCT 
GTGGAArrAT 
TGCCGGCTAC 
TTTGGAGTAT 
TGTCACTGCC 



ACGATTTTGC 
AATGCATCCT 
AATGGACCTG 
GGAAGCCCAT 
CATATACGTT 
GCCAAACGGG 
GTAATGGTAT 
ATAACAGGGA 
ACCAAATATA 
TTGGATGATG 
GTAATCAATA 
CCGGAAGGAA 
ACACTACTGG 
TTGCGAGACA 
GAATCTTCGA 
TTTGAGAATG 
AGCTGGGGAC 
TCCTTGTCGG 
ACCCCCAAGG 
TGGGCAGCGG 
GTCATATTGT 
ATCAACTTAC 
TATTTCTTGA 
GTTACCGATT 
CCTAACGGCT 
AATATCTATG 
ATCGATGAGA 
GTTTATAACG 



TCTCGTGGGC 
GTCTGGCTGC 
TTCCCAATGG 
CAGGCTCm* 
CCGGTATCTC 
TCAAGTACTG 
CGACAACGGG 
AACCGACTCC 
TTGCATGGCG 
TCACTGTGTA 
TTGGTCAGAA 
AGGGGAATGA 
CACAAATAAA 
ATCCCTTGCA 
CCGTATGTGG 
GACCTCTTCC 
ACTATTTGAA 
CTTCTTATGT 
TTGAAGGAGC 
AACATTACGC 
TCGAAGAAAC 
CTGAAGGGAC 
AGTTGGACGA 
TCGTTGTCTC 
ACGAACCCGA 
CAAACGGCTC 
cttattcttc 
ACAATATOjA 



AGCGATCACG 
TCCGGCTCAA 
CTGGCTTGAG 
CTCTGTACCT 
AACAGCGGGC 
GGTATGCAAT 
GACTGCCATT 
TCTTGTATGG 
ACATTACAAA 
TAGGTCGATC 
TGTGGGACGA 
AGAGTTGCAG 
AGATGTCTCC 
AGTGGAGTAC 
CACGCTGCAT 
CAATGGTTGG 
TGCATACGAC 
TCCGGGTATA 
CAAACGTGTC 
(5GTGATGGCT. 
CATGACAGCG 
TAAATACATC 
TATCACTGTA 
GCTTATTGAA 
TAAGACTGAT 
GCTCCTTGTT 
ACGAGACGAT 
GTCCCAATCG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
040 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



iF«}t ,H"ii ir"^ ■%V"h ^ "^1 ,(t<^ 

u™ii -^Ni ::r3 ii3 ™il ii:::: icji oi 



:ir«ii .rtP ir»ii iF^'ii rf^ 

Oi i^: Q lUi JUfi 



WO 99/29870 



PCT/AU98AH023 



141/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



GTTTGCCWTA 
GCCGGAAGTA 
GACTATTATC 
TACTTACXTGA 
GGAGCCAAGC 
TATGCTGTGA 
GAGACAAT6A 
GGAACAAAAT 
GATGACATTA 
GTCTCGCTTA 
CCCGATAAGA 
GGCTCGCTCC 
TCTTCACGAG 
ATCGAGTCCC 
CAATCTGATA 
GGATTGAGTC 
AGGGAAGAGA 
TACTTGATTA 
CCG 



AGCTGATTTA 
TTCCTGAAGG 
CTTGGACTAT 
TGATTGGCGT 
TTCTCAAGTA 
TGGTTTCTAC 
CCGCTAAGGC 
ATATTGCCTG 
CGGTTTATCG 
TTGAGAATAA 
CTGATGATAA 
TTGTTCACAT 
ACGGTCAGGT 
AATCGGTTTG 
CAAGCTTGAA 
GGAGCAAGTC 
CTCATTCAGA 
AACTAGTCGG 



l-GATTCTCAA 
CTGGTTGTTG 
GTATGGACAT 
TTTAACTCCG 
TTGGGTAAGT 
TACGGGAACT 
TAACGGTGCA 
GCGGCATTAT 
TTCTACTGAG 
CAAGGGl'CGC 
AAAACCATTG 
ACAAGACCCG 
GGAAATGGAA 
CGATAAGCTG 
AATATATCCT 
GACAATCGAG 
GAAAACGGAA 
TGGAAATAAA 



TCGGACATTA 
ATTGATGCTG 
GACAGTGAGA 
GATAACTATT 
GCGCAAGATG 
GCTGTTGAAG 
TGGTATGAGC 
GATTGCACCG 
ACTGTTCCCG 
CTGAAATGGA 
CAGCTTACCG 
ACTGTTTTGG 
TATTGTGTCA 
AACTATACTA 
AATCCGGCAT 
TTGTATAATU 
ATCGATGTTT 
ACAACAACCG 



TCTTATATGA 
ATGGCGACAA 
AGTGTATTGC 
TGGTTACACC 
CTGTTTAT7C 
ATTTTGTCCT 
GAACTATTAC 
ATATGTTTTT 
AGCCTGTTAC 
ATTATCCTAA 
GCTACAACAT 
AGTATATCGA 
CTGCCGTTTA 
TCACATCCTT 
CGTATGTGGT 
CGCTGGGAAT 
CACGTCTCAA 
AAAAGGTAGA 



AGGCTTTGAG 
TGTTAATTGG 

ATcxccrrcG 

CAGACTCGAA 
GGCTGAGCAT 
CTTGTTCGAA 
ATTGCCTGCA 
CTTGCTCTTG 
TGATTTCGTT 
CGGCTACGAA 
CTATGCAAAT 
TGAGACrrAT 
TAACCACAAT 
GGATAATATT 
AAGGATAGAG 
TTGCATATTA 
TGACGGAGTC 
GATAAAGAGG 



(2) IWFORIIATIOtJ FOR SEQ ID HO: 206 

(i) SEOUEIJCE CHARACTERI ST TC.<?: 

(A) LENGTH: 17V 9 base pairs- 

(B) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 
<U) TOPOLOGY: circular 

(11) MorJ^CULE TYPE: DNA (genomic) 

a 11) HYPOTHETICAL: HO 

(iv) ANT I -SENSE: NO 

(vi) ORIGIJIAL SOURCE: 

(A) ORGAHISH: P0RYPH*rR0*-10NAS GIHGIVALIS 

(ix) FEATURE: 

(A) HAME/KEY: misc_f eature 

(B) LOCATION 1...1779 



txl) SEQUENCE DESCRIPTION: SEQ ID NO: 206 



ATGAACAGCA 
TCCGTTTGCG 
CACCTTACTA 
GCATACAAGG 
TTCGGATTTG 
ACGGAAGGTC 
CAGCGAATGG 
CTCTCCTATC 
GATAGACGAA 
ACCGGCTTGT 
GTGGGCAGCA 
AACGTCCGTG 
TTCTTCCGTT 
CCTGAACTGA 
GTTCGTACGG 
TCCGAAACCG 
GACTCGGACA 
GCCAAAGGGA 
TGGCGTGTGT 
CAAGAGGAGA 
ACCGCCCAAT 
GACTTCTCCA 
TCCGACTTGT 
CGCGCAGGTG 
TCCTATGGCA 
TTGAAGGCAG 
CAGCTGAACC 
GGTAATGTAC 
TATCTCAAAA 
CAGCCGTTCC 



TCATGAAATA 
GTCAAACCCC 
TCGTTAATGA 
CCATCGAACC 
TCCCTGAAGT 
ATATGAAGCA 
ATGCCGGCTA 
GTGGGATGAA 
TGATGGCAGG 
ATTATTCGAA 
TCCCTCAGCl' 
TATACTTGGG 
CTATTCCCTA 
ACGTCACGAT 
GAGGATTGTT 
ACCGCAACCT 
ATATGCAATG 
GGTTGTTTTT 
ATGCGAAAGC 
TGCCCTACCT 
TAGGGGTGAA 
AGCTGACAGG 
ATCAGTATAA 
GTAAGCTGGA 
AGTGGAATTT 
AAGTAGGTGT 
GACGGTATCG 
ATCTTCTTAG 
TCGATAATAT 
ATTGTTTCGC 



TCAATTATAT 
ACGAAATACA 
CCAGACTGTG 
TCGATTAAAA 
TTCCTCTTCA 
CCGGGGGTAC 
TCGTCTGATA 
ATCGGCTTTC 
AGTGGACTAC 
CCATTATTTC 
ATCGACACCT 
TGCAAAAAAT 
TCTGGGTACC 
GAGTAATGAG 
TTTTGCCAAA 
GTATTATGTG 
GAACATACAC 
CTGGCCTAAA 
CTTCGGCGGT 
GATGCCCAAT 
GGGGAATATA 
TGTGCCTTTC 
TGTGAGTTTC 
ATACTCTTAT 
GGATGGAGGA 
TCATCCCATT 
GTATTCTTTC 
TGCGGATGTT 
GCTGGCGGAA 
CGGTTTTAGC 



ACGGCCGTCA 
GAAACCAAAC 
GAGATGGAGC 
CCTTTCCGTC 
GGCAGGAACA 
CTGAATATCG 
GATGCAGAGC 
AATACCGGTG 
GAGCAGCGCA 
AATAACTACG 
GTTACTCCTC 
GATGTGATCG 
GATCCGATGA 
TTGTCCGATG 
AACAGCGAAA 
GAGGGCGCGC 
GCCGGAGTAG 
CTGGATGCTT 
GTGATTCGTW 
ACGATTGTAC 
GCCGATGTGG 
TATACTCCGA 
TTGCCGATAT 
CGCGATATGC 
CTTGTCGCCT 
GCCCCATTGG 
GGCTCGGCTG 
TCATACAAGC 
ACGACAGAGC 
TGGACTTTC 



TAATGGCTCT 
GCCCCGACAC 
ATGCGGATCC 
CGGAATATAA 
ATCTTCCGAA 
GTATCGGCCA 
AGGAGAGACT 
ACTTCGACGG 
GGCCTTCCTT 
GACGGGGAGC 
AGATGGACAA 
ATGCCAGGAT 
AGGCTCTCAC 
ATATTAAGCT 
TGATTCAAAC 
CCACAATCGG 
GGATTTCTTC 
CGCTTAGTAT 
ATGGTCTCGC 
TCCCTTCGCG 
TACGTATGGA 
CTCTACCCTT 
ATGCCGACGG 
TCCGCT7TCT 
CCATGCA'3CC 
ATGTCAGACr 
GCTCGGAAGC 
TGAAAAA'SAA 
TTATCGGTTA 



CTCTGTATCA 
GCTGCGCAGG 
GCTTCCGGCT 
CAAGCGTACA 
TATCCTGCCG 
TACGCTAAAC 
GAATCTTTTC 
CGACAGAAAG 
TGTGCTTGCT 
TACCACCAAT 
CGGCACCCAC 
CGACTATCGT 
AGAACATACG 
CGGTGTCGAA 
GGGCGTTCTG 
ATTTGTCGGA 
CCATTTCGGA 
CTTCCCTTCA 
CGATGTTATG 
CAATGCTTTG 
GGTTTATGGC 
ATATAATCCA 
CAGCCGCTGG 
GGTAGACGCA 
CGATCTTATA 
CCGGTATACA 
CTTGGGTATC 
CTTGAGCCTT 
•ITATCCTATG 



1740 
1800 
1860 
1920 
3 980 
20*10 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2763 



60 
120 
180 
240 
300 
360 
420 
4B0 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
lOttO 
1140 
1200 
1260 
1320 
130O 
1440 
1500 
1560 
1620 
1680 
1740 
1775 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



"iP"it fl'"it tH"^ «I1 tff^, 

iui ^ :nr:: i!3 Jl; 



-H^Ti ,TtP -if«=i5 ri^i 

m %^ JCZ a iLJt.lL.il 



WO 99/29870 



PCT/AU98/01023 



142/490 



(2) IIIFOW1ATION rOR SEQ ID HO: 207 

(1) SEQUBHCE CHARACTERISTICS: 

(A) LENGTH: 1767 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLCX3V: circular 

(ii) MOLECULE TYPE: MIA {genomic) 

(lil) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORRANISM: PORYPHYROJIONAS GINGIVALIS 

(i>:) tfATURE: 

(A) NAfJE/KEY: misc_feature 
(B( LOCATIOi; 1...17 67 

t:a) SEQUENCE DESCRIPTION: SEQ ID NO: 207 



ATGAA.niTATC 
CAAACCCCAC 
GTTAATGACC 
ATCGAACCTC 
CCTGAAGTTT 
ATGAAGCACC 
GCCGGCTATC 
GGGATGAAAT 
ATGGCAGGAG 
TATTCGAACC 
CCTCAGCTAT 
TACTTGGGTG 
ATTCCCTATC 
GTGACGATGA 
GGATTGTTTT 
CGCAACCTGT 
ATGCAATGGA 
TTGTTTTTCT 
GCGAAAGCCT 
CCCTACCTGA 
GGGGTGAAGG 
CTGACAGGTG 
CAGTATAATG 
AAGCTGGAAT 
TGGAATTTGG 
GTAG«3TGTTC 
CGGTATCGGT 
CTTCTTAGTG 
GATAATATGC 
TGTTTCGCCG 



AATTATATAC 
GAAATACAGA 
AGACTGTGGA 
GATTAAAACC 
CCTCTTCAGG 
GGGGGTACCT 
GTCTGATAGA 
CG GCTTTCAA 
TGGACTACGA 
ATTATTTCAA 
CGACACCTGT 
CAAAAAATGA 
TGGGTACCGA 
GTAATGAGTT 
TTGCCAAAAA 
ATTATGTGGA 
ACATACAGGC 
GGCCTAAACT 
TCGGCGGTGT 
TGCCCAATAC 
G«SAATATAGC 
TGCCTTTCTA 
TGAGTTTCTT 
ACTCTTATCG 
ATGGAGGACT 
ATCCCATTGC 
ATTCTTTCGG 
CGGATGTTTC 
TGGCGGAAAC 
GTTTTAGCTG 



GGCCGTCATA 
AACCAAACGC 
GATGGAGCAT 

rrrccGTccG 

CAGGAACAAT 
GAATATCCGT 
TGCAGAGCAG 
TACCGGTGAC 
GCACCGCAGG 
TAACTACGGA 
TACTCCTCAG 
TGTGATCGAT 
TCCGATGAAG 
GTCCGATGAT 
CAGCGAAATG 
GGGCGCGCCC 
CGGAGTAGGG 
GGATGCTTCG 
GATTCGAAAT 
GATTGTACTC 
CGATGTGGTA 
TACTCCGACT 
GCCGATATAT 
CGATATGCTC 
TGl'CGCCTCC 
CCCATTGGAT 
CTCGGCTGCC 
ATACAAGCTG 
GACAGAGCTT 
GACTTTC 



ATGGCrCTCT 
CCCGACAC3C 
GCGGATCCGC 
GAATATAACA 
CTTCCGAATA 
ATCGGCCATA 
GAGAGACTGA 
rrCGACGGCG 
CCTTCCTTTG 
CGGGGAGCTA 
ATGGACAACG 
GCCAGGATCG 
GCTCTCACAG 
ATTAAGOrCG 
ATTCAAACGG 
ACAATCGGAT 
ATTTCTTCCC 
CITAGTATCT 
GGTCTCGCCG 
CCTTCGCGCA 
CGTATGGAGG 
CTACCCTTAT 
GCCGACGGCA 
CGCTTTCTGG 
ATGCAGCCCG 
GTCAGACTCC 
TCGGAAGCCT 
AAAAAGAACT 
ATCGGTTATT 



CTGTATCATC 
TGCGCAGGGA 
TTCCGGCTGC 
AGCGTACATT 
TCCTGCCGAC 
CGCTAAACCA 
ATCTTTTCCT 
ACAGAAAGGA 
TGCTTGCTAC 
CCACCAATGT 
GGACCCACAA 
ACTATCGTTT 
AACATACGCC 
GTGTCGAAGT 
GCGTTCTGTC 
TTGTCGGAGA 
ATTTCGGAGC 
TCCCTTCATG 
ATGTTATGCA 
ATGCTTTGAC 
TTTATGGCGA 
ATAATCCATC 
GCCGCTGGCG 
TAGACGCATC 
ATCTTATATT 
GGTATACACA 
TGGGTATCGG 
TGAGCCTTTA 
ATCCTATGCA 



CGTTTGCGGT 
GCTTACTATC 
ATACAAOjCC 
CGGATTTGTC 
GGAAGGTCAT 
GCGAATGGAT 
CTCCTATCGT 
TAGACGAATG 
CGGCTTGTAT 
GGGCAGCATC 
CGTCCGTGTA 
CTTCCGTTCT 
TGAACTGAAC 
TCGTACGGGA 
CGAAACCGAC 
CTCGGACAAT 
C7WVGGGACG 
GCGTGTGTAT 
AGAGGAGATG 
CGCCCAATTA 
CTTCTCCAAG 
CGACTTGTAT 
CGCAGGTGGT 
CTATGGCAAG 
GAAGGCAGAA 
GCTGAACGGA 
TAATGTACAT 
TCTCAAAATC 
GCCGTTCCAT 



60 
120 
ISO 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
17 67 



(2 J III FORI lATION FOR SEQ ID NO: 208 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1038 bass palrff 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: doubl- 

(D) TOPOLOGY: circular 

(il) nOLECULE TYPE: DNA (genomic) 

(iil) HYPOTHETICAL: NO 

il-r) Ain-I-SEHSE; HO 

I- I) ORIGINAL SOURCE: 

(A) ORGANISH: PORYPHYRaiONAS GINGIVALIS 

iix) FEATURE: 

(A) NAIIE/KEY: iiiisc_f eature 



SUBSlilUTH SHEET (Rule 26) (RO/AU) 



'^is r3 o -u« ii::; o o in::: a uJ iL3 



WO 99/29870 



PCT/AU98/01023 



143/ 490 



iBi LOCATION 1. 



(xi) SEQLIENCE DESCRIPTTOIV : SEQ ID HO:Z08 



10 



15 



20 



ATCATGGAAA 
CATTTCGGTA 
AAGCAGGTGT 
AAAGCTATCA 
GGATATGAAT 
ATGGGCAATG 
CGTTTCCTGA 
TTTAGTGCTT 
CGCGGTGGAG 
CTTGGTGTGG 
CTGTTCAAGA 
TGGGATTTCC 
ACCTTTGTTCA 
ATGCAAAAGT 
TGGGTCGGGC 
AAATGGGGAG 
TCTGCTGCCA 
GACGATAAGA 



AATGTATCTT 
TCATTTTGGG 
TTCATTTTCT 
CCATCGTAGA 
CCGGTGGCCG 
CCrrGTTATGC 
ACTACGGGTC 
CG-SATATAGC 
TCAGCCTAAA 
ATGTCGGTAT 
ACGTAGGGGC 
AGCTCGGCTT 
ATCTGAATCC 
TCCTCCGACA 
TGGGATATAC 
GTCTTTCGGC 
CCTATCATCC 
GCATCTTC 



TGCTCACTAT 
ATTTCTTTCTr 
GAACCTTCCG 
CGACAATCCC 
CGCCTTTCTT 
CTCGTCCGTC 
TATGCAAG6A 
TGTACAAGGA 
AGCATTGTAT 
CAGTTATTAC 
GCAACTGAAA 
TTCCCGCAGT 
GCACTATTTC 
CTTCTCGATA 
GCCACAGATT 
CGGCCTCGCT 
TGCAGCTCTTT 



CCAC7\TAACC 
CTTGTGTTTT 
GCTACTGCAC 
GGACTGGCTT 
TCCTTATTTAT 
GGAGAGCGTG 
TACGATCAGA 
TTTTACAGCC 
TCTTCTATCG 
GACGATGACA 
GGCTATAATG 
TTTATCAATG 
AAGCGTCTTG 
GGAGCAGAAT 
GCACAGGATT 
TTCACTTCAG 
TCGTTCATGT 



TTGTGTTCAT 
CGGCAGGTGC 
AGGCTTTGGC 
TTGAGAATCC 
ArTATAlGAG 
GCATGTGGGG 
ATGCGATTGC 
ATGAACTGAG 
AGACGTATAG 
AAGGATATTC 
AAGAACGGGA 
CTCCGTTTCG 
TACCACGC6A 
TTACTCCTTC 
TCGAGGTGGA 
GTGTAGTACG 
GTTCGGTAGG 



GATTCGCAAG 
TCAACAAGAG 
TGCCGGAGGC 
GGCTCTGCTC 
TGGTTCGCAT 
TGTTGGCATG 
CACCGGCTCT 
CAACCACTTC 
TTCCTTTGGC 
CCCTTCCGCT 
ACCGCTCGAT 
CTTGCACATC 
•IXTGTCCAAG 
CGAGAGGTTT 
AGGCGGCAAC 
TGTAGGCGTA 
TATCCGTTTG 



60 
120 
180 
240 
300 
360 
42C 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1038 



25 



30 



35 



40 



45 



50 



55 



60 



65 



(2) IirFORllATION FOR SEQ ID KO:209 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1035 b^se pairs 
(D) TYPE: nucleic acid 

(C) STRAHDEDNEaS: double 

(D) TOPOLOGY: circular 

(H» MOLECULE TYPE: DMA (genomic) 
tiii) HYPOTHETICAL: NO 
(IvJ AfJTI-SENSE; NO 

(vi) ORIGIHAL SOURCE: 

(A) ORGAtlISM: PORY PHYROttOTiAS GIMGIYALIS 

( ix I FEATURE : 

(AJ HAHE/KEY: jnisc_f eaturo 

(B) LOCATION 1...1036 

(Xi) SEQUENCE DESCRIPTION: SF.Q ID HO: 209 



ATGGAAAAAT 
TTCGOTATCA 
CAGGTGTTTC 
GCTATCACCA 
TATGAATCCG 
GGCAATGCCT 
TTCCTGAACT 
AGTGCTTCGG 
GGTGGAGTCA 
GGTGTGGATG 
TTCAAGAACG 
GATTTCCAGC 
TTGTTCAATC 
CAAAAGTTCC 
GTCGGGCTGG 
TGGGGAGGTC 
GCTGCCACCT 
GATAAGAGCA 



GTATCTTTGC 
TTTTGGGATT 
ATTTTCTGAA 
TCGTAGACGA 
GTGGCCGCGC 
GTTATGCCTC 
ACGGGTCTAT 
ATATAGCTGT 
GCCTAAAAGC 
TCGGTATCAG 
TAGGGGCGCA 
TCGGCTTTTC 
TGAATCCCCA 
TCCGACACTT 
GATATACGCC 
TTTCGGCCGG 
ATCATCCTGC 
TCTTC 



TCACTATCCA 
TCTTTCTCTT 
CCTTCCGGCT 
CAATCCCGGA 
CTTTCTTTCC 
GTCCGTCGGA 
GCAAGGATAC 
ACAAGGATTT 
ATTGTATTCT 
TTATTACGAC 
ACTGAAAGGC 
CCGCAGTTTT 
CTATTTCAAG 
CTCGATAGGA 
ACAGATTGCA 
CGTCGGTTTC 
AGCTCTTTCG 



CATAACCTTG 
GTGTTTTCGG 
ACTCCACAGG 
CTGGCTTTTG 
TATTTATATT 
GACCGTGGCA 
GATCAGAATG 
TACAGCCATG 
TCTATCGAGA 
GATGACAAAG 
TATAATGAAG 
ATCTVATGCTC 
CGTCTTGTAC 
GCAGAATTTA 
CACGATTTCG 
ACTTCAGGTG 
TTCATGTGTT 



TGTTCATGAT 
CAGGTGCTCA 
CTTTGGCTGC 
AGAATCCGGC 
ATATGAGTGG 
TGTGGGGTGT 
CTIATTGCCAC 
AACTGAGCRA 
CGTATAGTTC 
GATATTCCGC 
AACGGGAACC 
CGTTTCGCTT 
CACGCGATCT 
CTCCTTCCGA 
AGGTGGAAGG 
TAGTACGTGT 
CGGTAGGTAT 



TCGCAAGCAT 
ACAAGAGAAG 
CGGAGGCAAA 
TCTGCTCGiSA 
TTCGCATATG 
TGGCATGCGT 
CGGCTCTTT7 
CCACTTCCGC 
CTTTGGCCTT 
TTCCGCTCTG 
GCTCGATTGG 
GCACATCACG 
GTCCAAGATG 
GAGGTTTTGG 
CGGCAACAAA 
AGGCGTATCT 
CCGTTTGGAC 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1035 



70 



75 



(2) IHFORriATIOt4 FOR SEQ ID 110:210 

(1) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 990 bases paixa 
(D) TYPE: nucleic acid 

(C) STRWIDECX^ESS: double 

(D) TOPOLOGY: circular 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 




wo 99/29870 

144/ 490 

(11) MOLECULE TYPE: DMA (genomic) 

liii) HYPOTHETICAL: NO 

5 (Iv) AIITI-SfclNSE: NO 

.(vi» ORIGIHAL SOURCE: 

(A) ORGANISII: PORYPHYRaiOHAS GTHGIVALIS 

0 tixl FT.ATUaE: 

(A) HATIE/KEY: misc feature 
(Bl LOCATION 1...99Q 



PCT/AU98/01023 



5 



{r.lj SEQUENCE DESCRIPTIOtI: SEQ ID HOi210 



ATGATTCGCA AGCATTTCGG TATCATTTTG GGATTTCTTT CTCTTGTGTT TTCGGCAGGT 60 

GCTCAACAAG AGAAGCAGGT GTTTCATTTT CTGAACCTTC CGGCTACTGC ACAGGCTTTG 120 

GCTGCCGGAG GCAAAGCTAT CACCATCGTA GACGACAATC CCGGACTGGC TTTTGAGAAT 180 

CCGGCTCT'SC TCGGATATGA ATCCGGTGGC CGCGCCTTTC TTTCCTATTT ATATTATATG 240 

0 AGTGGTTCGC ATATGGGCAA TGCCTGTTAT GCCTCGTCCG TCGGAGAGCG TGGCATGTGG 300 

G6TGTTGGCA TGCGTTTCCT GAACTACGGG TCTATGCAAG GATACGATCA GAATGCGATT 360 

GCCACCGGCT CTTTTAGTGC TTCGGATATA GCTGTACAAG GATTTTACAG CCATGAACTG 4 20 

AGCAACCACT TCCGCGGTGG AGTCAGCCTA AAAGCATTGT ATTCTTCTAT CGAGACGTAT 480 

AGTTCCTTTG GCCTTGGTGT GGATGTCGGT ATCAGTTATT ACGACGATGA CAAAGGATAT 540 

5 TCCGCTTCCG CTCTGTTCAA GAACGTAGGG GCGCAACTGA AAGGCTATAA TGAAGAACGG 600 

GAACCGCTCG ATTGGGATTT CCAGCTCGGC TTTTCCCGCA GTTTTATCAA TGCTCCGTTT 660 

CGCTTGCACA TCACGTTCTT CAATCTGAAT CCGCACTATT TCAAGCGTCT TGTACCACGC 120 

GATCTGTCCA AGATGCAAAA GTTCCTCCGA CACTTCTCGA TAGGAGCAGA ATTTACTCCT 780 

TCCGAGAGGT TTTGGGTCGG GCTGGGATAT ACGCCACAGA TTGCACAGGA TTTCGAGGTG 84 0 

0 GAAGGCGGCA ACAAATGGGG AGGTCTTTCG GCCGGCGTCG GTTTCACTTC AGGTGTAGTA 900 

CGTGTAGGCG TATCTGCTGC CACCTATCAT CCTGCAGCTC TTTCGTTCAT GTGTTCGGTA 960 

GGTATCCGTT TGGACGATAA GAGCATCITC 990 



5 (2) IHF0Ri-!ATlOW FOR SEQ ID HO: 211 

{1} SEQUEWCt CHARACTERISTICS: 

(A) LEIJGTH: 972 base pairs 

(B) TYPE: nucl-ic acid 

0 (C) STRAW DEDNESS : double 

(D) TOPOLOGY: circular 

Ui) nOLECULG TYPE: DHA {genomic) 

5 (Hi) HYPOTHETICAL: NO 

(iv) AtiTt -SENSE: NO 

(vi) ORIGINAL SOURCE: 
0 (A) ORGANISM: PORYPHYROMONAS QIHGIVALIS 

(±x) FEATURE; 

(A) MAHE/KEY: misc_feature 
(D) LOCATION 1. . .972 



5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211 



ATGTGCCTCG AACCCATAAT TGCTCCGATT TCATCCGAGT TGCTCGAGCA GGAGCTGACI' €0 

GCCGATCGTT TTCTGCGGAT GACAAACAAA GCCG6CAATG AGATCTATGT TTTTACGGCC 120 

0 GAAGAAGCTC CGCATTGCAT GAAAGAAGTA GGCCGACTGC GAGAAGAAGC CTTTCGGCAT J 80 

TATGGCGGAG GTACTGGCAA GGCGATCGAT ATAGACGAGT TCGACACCAT GCCCGGGAGC 24 0 

TACAAACAGC TGATCGTATG GGATCCGCAA AACAAGGCTA TACTCGGAGG CTACCGCTTT 300 

ATCTATGGGC GGGACGTTGC TTTCGATACC GATGGCAAGC CrTTGCTGGC AACGGCAGAG 360 

ATGTTTCGCT TCAGTGATGC TTTTTTGCAC GATTATCTCC CCTACACAGT CGAATTGGGA 4 20 

5 CGTTCGrTCG TGTCGCTCCA GTACCAATCG ACACGGATGG GCACAAAGGC CATTTTTGTG 4 BO 

CTGGACAATC TTTGGGACGG TATCGGAGCA CTCACTGTAG TCAATCCAGA GGCACTCTAT 54 0 

TTCTATGGCA AGGTGACCAT GTACAAAGAC TATGATCGGC GAGCTCGCAA TCTGATCCTG 600 

TATTTTCTTC GCAAGCACTT CTCCGATCCG GAAGGCTTGG TCAAGCCTAT TCATCCCCTA 660 

CCGATAGAGA TCAGTGCCGA GGACGAAGCC TTGTTCTCCT CATCCGACTr TGACACCAAT 720 

0 TACAAGACl'C TCAATATAGA AGTGCGCAAG CTGGGTATCA ATATCCCTCC TCTCGTGAGT 780 

GCATATATAG CI*TTGTCTCC GGAGATGCGT GTTTTCGGCA CTGCAGTGAA TCAGTCTTTC 84 0 

GGAGAGGTGG AG3AAACCGG CATATTCATT GCTGTGGGTA AGATCCTGGA AGAGAAAAAA 900 

CAACGGCACA TAGAGAGCTT CATCCTCAGC CGGAACGAAA AAAAAGGTCT CGACAGTAGC 960 

AATGGCCGAT CA 972 



SUBSTITUTE SHEET (Rule 26) CROAU) 



tp«il -d»ii -ii'"" zAi '»"h ri=H> .o^^ tP=-ii .-i" ' -^=11 iP^h fl^7 

'^ji Oi «JL H™ cru to iLJi 0 u.^ u 



WO 99/29870 



PCT/AU98/01023 



145/ 490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



60 



65 



70 



75 



(2) INFORMATION FOR SEQ ID HO: 212 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1641 base paira 

(B) TYPE: nucleic acid 
to STRAMDEDNESS: double 
(D» TOPOLOGY: circular 

(il) MOLECULE TYfE: DHA (qeiioralc) 

(ill) HYPOTHETICAL: NO 

flv) At iTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIIAS GIUGTVALIS 

(ix) FEATURE: 

(A) r4AME/KEY: misc feature 

(B) LOCATION 1...1641 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:212 



ATGAAAACCA 
GCTGTGCACT 
ATCGTCCTGT 
AGGGGGGCCG 
GGCTCGGCCT 
TCGCTCTCGC 
AGTGCTTCCT 
GAAGCCrACT 
TATCGCCTTG 
TCATACAGGG 
GCATTGGAGC 
TCGGCTGCGT 
CAGGACAAAT 
TGGTTCGGTA 
ACCCCTAGGG 
AGGTCGTCCA 
CATCTGTCGG 
CGGATATACG 
GCCATTCGCC 
ATTCGCACGG 
GATGAGACCT 
GGTATAGCCT 
TATCGAATGG 
TATCAGCTGG 
TCTCTGTACG 
TATGGCGACC 
TCACATATCC 
TGCAATATCT 



TTGTAAGATA 
TGTCTGTCGA 
CTCCCACCGA 
ATCTTTTCCC 
ATGGCGATAT 
TCGAATCGGG 
ACTCACGTGG 
ATCCCTATTT 
CCGGCTACTA 
GCGAAGTTGC 
TTTCTTGTGC 
ATCTCTATCA 
TCTTCGTCAG 
TCTCCAGAAT 
GCGATGCCAT 
TCAATCTCTT 
ACTTCGATTT 
AACACTACAA 
GCTGGTATAT 
ATAGAGGTTG 
ATCGCAAGCA 
ATGACCATGC 
TGCTGACGCA 
CCTATTTGCC 
TerCGATTCC 
TGATGAAAAG 
TGTCCGATCC 
CCTACCTCTT 



CAGCCGCCTT 
GGCGCAGATG 
AAGGCAATAC 
GTTGAGCGAT 
TGCGGGCGAC 
TGGTCGCATC 
CATGCACAAA 
GGTGTCCGAT 
TTCTTTTCGC 
Tl-ATCGGCTG 
TACCTCTTTG 
TAGACAACAC 
CTACGGTTTC 
GAACTACGTC 
CGGTCTCGAC 
TGCTTTGCTT 
TTCATTTTCA 
GCCGGATGAT 
GCTCAATGAG 
TGCCCTGAGA 
TGGACATCAT 
CGGATCCAAA 
TTCGTATAAG 
CTATGCCTAT 
GATGCAGAAT 
AAAGGACGGT 
GCAAQCCGAA 
C 



CCGGTCGCTC 
C-rCAATACTC 
AGGGAGATTT 
AACCTGCGCG 
TATCTTCCGT 
AGTGTCCGTA 
CGCATCGGCT 
TCGACCGGCG 
GCCGGCCGCT 
ACCGATCCGC 
ACGCTGCCTC 
CTCACACAGT 
GGTCAGGTGG 
AACGGATGGA 
TACAGCGGCT 
TACAATCGCC 
GCCGACTATG 
AATTATCATA 
TTTTCTGCCC 
GTGAGTGCCG 
ACCATGAGCG 
TTGGATTTTG 
ATTCGTACCA 
CGTAATAGAG 
ACCCACCGCC 
ATAGCCTATG 
CGAACGTCCG 



TCTTCTTTTG 
CTTTCGAGCT 
GTGTGCAAAC 
ATTCGGCCTA 
ACAACGGCAA 
ACTATGGCAC 
GGAATGCTCT 
GAGACTATCA 
TGCCCCTCGG 
GTACGACCAA 
GAGAGAACAG 
ACAACTGGCG 
ATGTCAGCAA 
AGCTTAGCTC 
ACTTCCTCGA 
TGCGACTCTA 
CTCTGCGCCA 
TCTACGACCT 
AAGCCCAAGC 
GTAGTGATTT 
GAATGCTACG 
GACTTTCGCT 
TCCAGAAAGA 
AAGGCGTGGA 
TGATGACAGA 
CCAAAACGCC 
GCCATACCAT 



CCrrTTGGGA 

GTCGGATCAG 

GAAAGAAAAA 

TGTTCGTTTC 

TAACTACTCC 

ATTGCAGGGC 

GCGCAACGCC 

TTTCGAAGAC 

TATAGGCTTC 

TACGACCGGT 

GCTATCGCTT 

TCCCGGGCAG 

CAGCCCTATC 

CCGTCl'GGAT 

TACCGAaGAG 

TGGTAGCTGG 

AGGGATAGAG 

CCGTATCTTG 

CTCCTACCGT 

CTACGGCTAT 

rCCTTTTGCC 

TTCGGCTGCr 

GCAGCTCGAC 

GGTGCGTTCC 

GCTGCGGTTG 

CGGTGTCATC 

CGGGGCTATC 



55 (2) IHFORfiATIOr^I FX3R SEQ ID HO: 213 



(i) SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 2250 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS : double 

(D) TOPOLOGY: circular 

(11) HOLF.CULE TYPE: DNA (genomic} 

(lii) HYPOTHETICAL: NO 

(iv) A[4TI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

iiy.) FEATURE: 

(A) NAME/ KEY: nasc_feature 
(EJ LOCATION 1...22S0 



60 
120 
180 
240 
300 
360 
420 
400 
54 0 
6O0 
660 
720 
780 
840 
900 
960 
1020 
lOSO 
1140 
1200 
1260 
1320 
1360 
1440 
1500 
1560 
1620 
1641 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



-ii""it iV^i -75""' tl"^i ™Hi .tt*' rf-'^i ^•-'n, ip^, 

iui -^p ::3 o «JL ^ini-: q o uj u o ar; Oi iiji -tuJ 



10 



15 



WO 99/29870 PCT/AU98A)1023 

146/490 

(xl) SEQUENCE OESCRIPTION: SEQ ID HO:21."* 

^^^^^ J^S'^^''^'*'^ GCCGTCCTGC CTTTCTGCCT GACGGCACAA 60 

5 ct?S?a2^I GATAGATAGT CTTAGCAATG TGCAGCTCCA GACCGTACAG 120 

S^CT?^ S?^A5?a CCTGTCGCTT ACACCAACGT TCGCAAGGCC IbO 

GAACTTTCCA AGTCCAATTA TGGTCGTGAC ATCCCCTATC TGCTGATGCT GACTCrCTCC 240 

GTGGTAGCCA CCAGCGATGC CGGTACGGGT ATCGGATATT CCGGCTTTCG CCTGcSTGcr li^ 

ni'S^'^^?^:: ^^^^^'^^^ CATAACTACC AATGgS^I^ cSSIJ^ ^C^^ 360 
SS.^J^ IIISS^^^^ TATGCCCGAC TTCGCCTCTT CCATCGAAgS ii^T^^ 

^^^3^^ TGGGTACTTC CACCAATGGT GCCGGAGCTT TTGGGGCAAG TGTCAATATG 480 

CGTACGGATA ATTTGGGACT GGCTCCTTAT GGCCGTGTCG ATTTGAGCGG AGGTTCGTTc J!n 

ggcacattcc gccgatcggt caaactcggt agcggaSSa tcggtc^?^ ??^^c^tc too 

^?^At5 l^''^^^ ^^^^^ GGCTACGTC.3 I5^cS ^^^Ir^o HI 

^l^^l If^^^S?! GGGCTATTTC GGTAGCAACA CGGCTCTCAG GTTCATCACT 720 

lllr^G^ ^C^?I^?f TGGAACGGTC TTTCCAAGGA GGATGAAGCC 780 

^ft^^iS GCCOATACAA CAGTGCCGCT CTTATGTAOS I'GGACGCGCA AGGAGTACCG 84 0 

^^SI'''''' ^CW^TACCGA CAATTACGAG CAGCGTCACT ACCATGCCAT CA^GAC^C 900 

S^^SS- A^SJ^Sf ACGGCACACT ACACGGCCGG ATA^gS 60 

20 aI^r?5^T ^l?.?^^^^^-^ ACGTAAACTA AAGGAATATG CACTGCAGCC CTATGTGGAA 1020 

AACAGTGTGA CCGTGAAGAA AACGGATCTC ATCCGTC^GA AGTATCTGGA CAATGA^TTT lOftn 

GGAGGACrCA TCGGTTCGCT TAACTGGCAC ACCGGTGCAT GGCAW^A ^^^^r \^,n 

TCGGGCAATA TCTTATAAAGG AGACCACTTC GGCCGTA?CA OTaS^^ ^^^T Aoo 

^^SJI''^ CTCCCGACTT CGAATATTAT CGGAACAgS ciS^S^ ^G^J llsl 

25 SfS^Z^"" AAGCCAACTG GCAGATCACT CCGCAACTGA ACATGTATGC C^^^^^G llto 

l^^^^^l^"^"^' TCGGCTACAC 3ATAAACGGC ATCACGSyvCG AATATGATGA GGT^SgSa ^380 

n^ISn^'''' ACATCGATTT GGACAAGACC TTCCGCTTCC TCTWTCCG^ Sgtc^^CTT ^440 

C^^^tJl TATGCrTCTG TTGCGgJ^S^ ^^cS^g ISoS 

CCTAACAGAA CXAAITACAC CGAAGCCGGA ATACGACAGT ATCTA^GCn '''«Aro-a/-Tf -t^n 

^S;^ CCGCTATGCT TCGCCCctS J^S^^ A^A^^ xI^S 

OT^CAC^ CCAACTCGTG CTGGATGGCC GTTTGAGCGA TGTGGGACAG 1680 

CA^§^-?^ ^^2^°"="^*'= CGTATGGGAC TGGAGCTGAC TCTCG6TTGG 1740 

CAGATCCITC CTCGTTTGCT GCGTTGGGAT GCTTCTTTCA CTATGAGTCG CaiArABAnTf- lann 

rJSn^I^i"' ^'^O^TATAC ATCCGTATAT GATGCG^CT ACAaG'S SSS^^G Ueo 

V^C^ CT^S? TACTCGCCCA ATGTCATT^? SSSSg ",20 

iii^^rr. CTCATGCCGG TTTCGAAATG GCTTGGA03A GCCGCTTCGT CAGCAAGCAA 1 QflO 

l^JSl'^'^'^'''^ ATACACAGCG CAGCGATCGC ATGCTTTCCT CCTATTGGGT GAACGACCTC 2o!o 

^A^flS^S^ ATGTGCTGCC GGTTCACTTC GTTAAGAGAG TGGCaS C^^A^C 2^00 

^^^Ccl gS^S^ ^TATGCGTCC AATGCCTACA TCTACGATGC CG^?S 2160 

40 ^fo^l^l ItI^^^I 1^^^^'. GATCTGCG^ ATTATCCTCA GGCCCGATTT 2220 
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12) INFORMATION FOR SEQ ID NO: 21 4 
45 {!) SEQaEIICE CHARACTERISTICS 



(A) I.ENCTH: 14 02 base pairs 

(BJ TYPE: nucleic acid 

tC) STRAtJDEDNESS: double 

(D> TOPOI^Y; circular 



(ii) MOLECULE TYPE: DNA (genomic J 
<ili) tnrPOTHETICAL: NO 
(iv) AIJTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM; PORYPHYROHONAS GIIIGIVALIS 

(ix) FEATURE: 

(A) NAUE/KEY: misc_featur- 

(B) LOCATION 1...14 82 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:214 

^ZSS^^^^ GG-rTTCTATC GCTGTTACTG CTGTACATAC TCTCTTCCAT CAGCCTTTCT fiO 

C-S™ l^r^^lTf ^^°«^TC GAGTTGGATA CCGii^cS^'I T^^^C^G 120 

^™ IS CTTGCCGCGC CATCGGTAAA ACGATAGGCG TCAATCTGGC CGTATGGGGC 1«0 

70 ]1t^I^^ ^^^.^"-^''^ GCAGACATCA GTTGGCAGAC S^C^G^C 24^ 

fxj /WlTTCCAAA CAGoCTTTGG CTGGGACAAT GACAAGTTTG TCACrAArr-p f*rTnfvii/-ai» tlX 

l^r^r.'^-"''^ OCGAGG^l^A aS^^ S^cSc llo 

riA^fJ^SI "^°=TTCTT TGGCAGTCTC ATGTGGGAGC TGCTTATGGA AAACGAQCCA 420 
S«GGCT§? C^S^J^ TGCCACCACC ATAGGCGOTA TAGCTTTGGG 

75 SS^T^A ?^CTr^ SI?^™ CGTACCACAG GGTGGGAACG TATGGCGCGC 540 

(.A^TGGCTA TCGCTCTGAT CAATCCGATG CGCTTTCTCA ACCGTCTGAC ACCAGGAGAG 600 
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GTGACTTCTG 
GTCGATGCCG 
CTGACCCTGA 
TACGATTTCT 
CAGATCAATC 
TTCiGTGGGAG 
AATTCGGAGG 
GGTCTTATCT 
ACCTACCTGA 
CGGGACTATA 
GATCTGTGGA 
GAAGAGCCGC 
AAGGCGCGCC 
(iCCATCGTCG 
TTCGATACCG 



TCGGGAGTCG 
GCTTTCGCTT 
ATCTGAGATT 
TCCAATTCAA 
TGATCGGAAT 
GTCTCTTTCA 
AGGTACTCGT 
TCCAGCACCA 
ATGTCGTCCC 
ACCTCGGCAG 
GCTGGCTCTT 
ACCAGAAAAA 
TACTCGTGAC 
CTCGCCGTTT 
GCGACATACA 



CAGCGGACAG 
TTTGGCAGAC 
CGACTACGGC 
AGCCGGATTG 
CCTAAGCGGA 
GCACTTCGAC 
CACCCCATAC 
CGGAAAATTT 
GATGGGAGCC 
CGGATTGAGC 
GGGAGTCGAA 
TACCGATGTC 
GAGrrCCGAG 
CATCCGCAAA 
GCTGCGTGTC 



ATATTTCAGT 
AAGCGGCATG 
GATCCATTCC 
AGTTTCTCC3 
TGCCAACTGC 
TACTACAATT 
CGTATCTCGC 
CGACCACGTC 
AGTCTGTCGG 
GGCAAGCTAT 
AGCTATCGGC 
AGCTCTTTTA 
TTCGCATTTC 
ACAGCCTATC 
GGATTTCACT 



CTGTCCCCAT 
CCCGAACCGG 
GAAGCGAGAC 
AATCGCAACC 
TCGCACACX3A 
CGGAAAAACG 
AAGTGGCAGC 
CTCTGGAGCT 
ATCACTACAA 
ACCTTGGTGC 
TCTACAC3VTG 
TGGTGCAGGG 
ATCCTGGCCC 
AATTCTACCC 
TC 



AAACATAGTC 
TGCCACGGCT 
TTTCTCTCCA 
TCTGCTGAGC 
ACGAACGGTT 
AATAAGCAAA 
TCTGGGAGGC 
ATATGCCGAG 
CGTGGACAAT 
TACGTACAAT 
GATCGGGTAT 
GGACGAAA3C 
CTGGCATGTA 
TAACGTATCA 



660 
720 
700 
840 
900 
960 
1020 
1080 
1140 
120O 
1260 
1320 
1380 
1440 
1482 
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(2) INFOWIATION FOR SEQ ID IIO:21S 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDtlESS: double 

( D ) TOPOLOGY : circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYt»OTHETICAL: NO 

(IvJ AUTT-SEIISE: UO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISE: PORYPHYROHOHAS GINGXVALIS 

( ix ) FEATURE : 

(A) NAIIE/KEV: ml3c_f eature 

(B) LOCATIOU 1. . .882 



(xi) SEQUENCE DESCRIPTlOt4 : SEQ ID NO: 215 



ATGAAACGAC 
CAAGAAGCAA 
GGACTAAATG 
GCACGTAACC 
AACGGTTTGC 
GTAGACAAGA 
AGTGCGCTTT 
T7GACAGGAG 
GGTGCGGACT 
CTGACTGTAG 
GAAAAGACAA 
GAGAATan'CA 
AACATTGACA 
ACGATAGCCA 
CAGTTCAAAG 



TGATTGTTTT 
ACACTGCATC 
CCTCTCAGAC 
TCTATTTGAA 
GTACAGACTT 
TCGAACTCTT 
TCACTTTCCT 
TCAAGCATAT 
ATAAGCCCAA 
TAGCAGACGA 
TGTTCGAACT 
ATTTGATAAC 
TCAATTGGGA 
CCAATCTI'AT 
AAGTTGTGGG 



TCTGGCAATG 
TGACACTCCC 
TTCTCTGACC 
CATAGATCCC 
CGGTCTGACC 
CACGAAGGCC 
CTCACAGTAT 
CTCTAATTTC 
TGAGAAGTTC 
Cl-ACCTCTCA 
TGGTGCTTTG 
CAAGGCTTCA 
GGCTATGCTG 
CTACGACGAT 
CGTGGGTGTT 



GGTGGCTTGC 
AAAAAGGACT 
AACTGGGCTG 
AACTACCTGA 
TACACAACAG 
GGCTATGAGA 
GCCAAAGGAT 
TTCGCTCCTG 
TCTCTCTACC 
AGTTTGGGAG 
GTAGTGGGTT 
TTCTTCTCGG 
GCCATGAAGA 
GATGTGAAGA 
GCGTACACTT 



TGTTCACCCT 
GGACTATAAA 
CCGGTGGAGA 
AAGATAAATG 
CCAACAAGTG 
TCGGCAAACA 
ATGAGAAGCC 
CATATCTCAC 
TCTCTCCTAC 
OCTTCGGGGT 
CGGCCAATAT 
CTTATACGCA 
TCAACAAGTT 
TCAACGATGG 
TC 



TGCGAACGCA 
AGGTGTGACC 
AAACACGGTG 
GAGTTGGGAC 
GAACAAAAGT 
ITGGTACGGA 
CTCGGATCAC 
TCTCGGTA7T 
AACGGGCAAG 
GAAAGTTGGT 
AAATCTGATG 
CGACTTTGGC 
CCTCACGGCT 
CCCGAAAATC 



(2) INFORMATION FOR SEQ ID MO: 216 

(i» SEQUENCE CHARACTERISTICS: 

(A J LENGTH: 612 base pairs 

(B) TYPE: nucleic acid 

(C) Sl-RAT^DEDNESS: double 
{ D) TOPOLOG*! : ci rcul a r 

(11) MOLECULE TYPE: DNA (genomic) 

Uii) HYPOTHETICAL: HO 

(Iv) AHTI -SENSE: NO 

Ivl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GTNGrMLIS 

t*EATURE: 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
882 
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WO 99/29870 PCT/AU98/01023 

148/490 

{A> NAME/ KEY: mi3c_f eatujre 
(B» LOCATION 1. . .612 

(XI) SEQUENCE DESCRIPTION: SEQ ID MO: 216 

O 

ATGAAGAAAA TGATTTTGGC AGCTACTATG CTGCTCGCAA CAATCGGTTT TGCAAATGCT 60 

CAGAGTCGTC CTGCTCTTAG ACTGGATGCT AACTTTGTCG GTAGTAACTT AATGCAAAAA 120 

GTCGCAAACA CGAGCGTGAA CAATAAGATG ATCGTAGGCT TACGTGTTGG TGCTGCTGCT 180 

GAGTTCGCTC TTAGCAATGA TGGATTCTAT CTCGCCCCCG (iATrGGCCTA TACGATGAGA 2^0 

lU GGTGCTAAGA TGGAATCACT AAGTGAAACG ACAACTCGCT TGCATTATCT GCAAATACCG 300 

GTGAATGCCG GTATGAGATT TAGCTTTGCT GACAACATGG CTATTTCATT GGAAGCAGGT 360 

CCCTATTTCG CATATGGTGT CGCCGGAACG ATTAAGACTA AAGTTGCAGG CGTTACGGCT 4 20 

TCTG TAG ATG CCTTTGGTGA TAACGGATAT AACCGTTTCG ACTTGGGCTT GGGCTTGTCr 480 

GCTGCCTTGA GCTACGACCG TTATTACGTA CAAATTGGAT ATGAGCATGG ATTGCTTAAT 54 0 

ID ATGTTGAAGG ATGCTCCGGA TAAGACTTCT TTGCGTAATC ATGACTTCTT TGTGGGTCTC 
GGTGTTCGCT TC 



600 
612 



{2) in FORMATION FOR SEQ ID HO: 217 



ii) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 729 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI4DEDNESS: double 
^5 (D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DIM (genomic) 

(iil) HYPOTHETICAL: NO 

(iv) AtJTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGA^JISM: PORYPHYROI-IONAS GINGIVALIS 



tlx) FEATURE: 

(A) NAME/ KEY: mlsc_f eature 

(B) LOCATIOtJ 1 . . .729 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217 

ATGAAAAGGA TTTTTACTGT AGCCCTTGTG CTACTTGCTT CGGTCACTAT GGCCATCGGA 60 

CAAAGCCGCC CGGCACTTCG CGTAGATGCC AACTTCGTAG GCAGCAATCA GAGCATGAAA 120 

AGAGACGGAT ATGTGTGGGA CACCAAAATG AATGTCGGCC TGCGGGTCGG TGCCGCTGCC 180 

GAATTCATGA TCGGATCAAG AGGATTCTAC ITGGCTCCGG GTCTGAACTA TACGATGAAG 24 0 

GGCTCCAAAA CCGAATGGGA TATACCCGAA ATGGTTCCTG GTACCTATAT TACGATGGTT 300 

TCCACTCGCT TGCACTATCT GCAACTGCCG ATCAATGCCG GCATGCGCTT CGACCTGATG 360 

AATGACATGG CGGTTTCGAT CGAAGCGGGT CCTTTCCTTG CATACGGTAT ATATGGTACA 420 

TATCGGCAGA A'3TTGGAAGG ATGGAAGCCG AACAACTACA GCACAGAGTT TTTTGGCCCA 4 80 

OU ACGCTTGGTG GCCCAACAAA TATCCGCTGG GACATCGGGG CAAACATAAT AGCCGCATTC 54 0 

CACTATAAGC GTTATTATAT ACAGATAGGC TATGAACATG GATTTGTGGA TAn-GTGTCA 600 

6GTGGAGGTT CTGATATTCC CCGACTGAAC GACAATAGGC AATCCTCTTC GACGACCGCT 660 

CTAAGAGAAA AGGGAAATAA CGAATACGCT TATAATCGTG ACTTCTTCGT GGGCATAGGT 720 

55 TACCGCTTT ^^9 



i2) INFORMATION FOR SEQ ID NO: 218 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRAW DEDNESS: double 

(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA (genomic) 

(liil HYPOTHETICAL: NO 

(iv) AimsZUSEi NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIIONAS GINGIVALIS 

iix) FEATURE: 

(A) NAME/KEY: mlsc feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(B) LOCATION 1...621 



(xt) SEQOEItCB DESCRIPTIOK: SZQ ID NO: 218 



ATGAAAAGAA 
GUACAAGACG 
AACATCCCGG 
GAAGTGGGGC 
GCATCGAACG 
TCATTCTTCG 
CATCTGTTTT 
AATAAGCACA 
AATTCGAATA 
AACGTGGOSG 
CCCACGCTCT 



TCCTGCTGCT 
TTATCAGACC 
GAGGATTCAC 
TGTCCATGl'A 
AAGGAGACCG 
ATGCAGGCAG 
QGCAGAATAG 
ATATTCATTT 
CCTACATCGG 
CGGGTGCCGC 
CCACTCATTT 



TerCGTTGTA TTATTATATG GAATTGCAGG CCGATTGGCT 
ATGGTCATTG CAGGTCGGAG CGGGATACTC CGATACGGAG 
CTATGGTTTC TATTTGGGAA AGCGTATGGG GAGCTTTCTG 
CAACTCCACA CGTCAAACAG CCAACAATGC AGACTCCTTT 
ATCTTTTCAG GTAAATATGT CTTCTCCGAA TGAGAAGTGG 
TGCCAACTGC TATATGATCG TCGTCGGAGT CAATCCTCTC 
CCGGCACAAT TTGTTTCTGG CAGTACAAGC CGGCCTGTCC 
CATCTATGGA GACAAGCGAG CCAAAGTCAG TATCTACACC 
TTACGGAGCA CGTGTAGCCT ACGAATATCA AATTCATAAA 
TGTAATGTAC GACCACGGCA ATAAGATGCT TACGCCCATO 
T 



60 
120 
X80 
240 
300 
360 
4ZG 
480 
S40 
600 
621 



(2) IHFCRlIATIOir FOR SZQ ID HO: 21 9 

U) SEQUEnuE CHARACTERISTICS: 

(A} LENGTH: 2853 base pairs 
(Bl TYPE: nucleic acid 

(C) STRAWDEDIJF.SS: double 

(D) TOPOLOGY: circular 

(*i) MOLEO-iLE TYPE: DMA (genomic) 
(lii) HYPOTHETICAL: NO 
(iv) AJJTI-SEM^IE: NO 

(vi> ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMONAS GINGIVALIS 

(ix) FEATURE: 

(A» NAHE/KEY: mlsc_feature 

(B) LOCATIOtJ 1 2853 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 21 9 

ATGAGAGTAT CCGATCTCTG TTCCAGACTT TCATGGTTAT TACCCGTAAT CCTTGTCGGA 60 

TTGCTCTGTG CTACTTTGGT CGCTGCGGAA CGTCCTATGG CCGGAGCAGT CGGATTGCAC 120 

CACCGTCGGC ATGCTGCGCT GTCTGATTCT ACAGCGAAAG ACACGGTGCC TCTCGCAAAA 190 

CCTATTCCTG ACAGTGCTTT TCGAGATTCC CTTCCTGCC-S ATTCCACCGG ATCGATGCGG 24 0 

CAAGATAGCG TGTATGACGA TGAATTC6AA TTGGAAGATA TAGTGGAGTA CGAAGCTGCC 300 

GATTCCATCG TTTTGCTCGG ACAGAATCGT GCCTATCTTT TCGGCAAGAG CTATGTGAGC 360 

TATCAAAAGA GTCGCTTGGA GGCAAACTl'C ATGTATCTCA ATACCGACAG CAGTACGGTT 4 20 

TATACTCGCT ATGTCCTCGA TACGGCCGGT TATCCGATGG CCTTTCCTGT TTTCAAGGAT 480 

GGAGAGCAGT CGTTCGAAGC CAAGAACTTT ACCTACAACT TCCGCACGGA GAAGGGGATT 540 

ATCAGCGGAG TGATCACGCA GCAGGGCGAA GGCTATCTGA CTGCCGGTAA GACCAAGAAG 600 

ATGCCCGACA ATATCATGTT TATGCAAGGA GGGCGTTATA CGACCTGCGA CAATCACGAT 660 

CATCCTCACT TCTATATCAA TCTTTCCAAG GCAAAGGTGC ATCCGGAGAA AGACATCGTC 720 

AC AGGT CCGG TCAATCTGGT TATCGCCGAT ATGCCGCTGC CGATAGGTCT TCCTTTCGGC 780 

TATTTTCCCT TTTCCAACAA ATACTCTTCC GGTATATTGA TGCCCACGTA CGGAGAGGAC 84 0 

AATCGCTATG GATTTTATTT GAGGAATGGT GGATATTATT TTGCCTTCAG CGACTATATC 900 

GATTTGGCAT TGCGTGGGGA GATCTTTTCC AAAGGGTCAT GGGGCATTTC AGCCCAATCG 960 

AAATATAAGA AGAGGTATAA GTACAACGGC TCGTTCGAAG CCAATTATCT GGTATCGAAG 1020 

TCCGGCGACA AATACGTGCC CGGAGACTAC AGCAAGACCA CCAGTCTGAA TATCCGATGG 1080 

ACACACAGTC AGGATCCGAA GGCCAATCCT TTGCAAACGT TGTCGGCCAA TGTCAATTTT 114 0 

GCCACCGGGA GCTATTTCCA GAATTCGCTG AATACCACCT ATGATGTCAA TGCCCGTACT 1200 

GCTACGACAC GAAGTTCGGC CGTGAGCTAT TCGCGCAAGT TTCCGGGTAC TCCTTTTTCG 1260 

ATTACGGGTA GCATGGATAT CAGCCAGAAC ATGCGCGATA CGACGGTGAG CCTTACCTTG 1320 

CCGAATCTTT CGATTAATAT GTCCACGCGT TATCCTTTCA AGCCGAAGAC CCGTGTAGGA .1380 

CCGGAGCGAT GGTACGAGAA GTTGAGTGTG GGCTATTCCG GTCAGCTTCG CAATAGTATC 14 4 0 

TTGACAAAAG AGAAAGATTT GCTCCAGAGC AATCTCGTGC GCGATTGGAA GAATGGTATG 1500 

CGTCATTCCG TACCGATCAG TTTGACTGTC CCTTTGTTGG ATTATATCAA TCTGACTATG 1560 

GGGGTTAACT ACAATGAGTG GTGGTACACG AAAGGCATAC GGAAGTCGl'G GAATGAGGAT 1620 

AAGAAAACAT TCCTGCCTTC GGACACGACC TATAAATTCC GCAGACTGTA CGATTACAGT 1660 

CTGTCGGCAG GCTTATCTAC CACATTGTAC GGTATGTTCA AGCCTTGGAA ACCTTTTTCC 17 4 0 

TTCGGAGGCA ATCTCATTAT GATCCGTCAT CGCTTCACGC CCACTGTCAG TTTCTCCTAT 18 00 

ATGCCGGACT TCACGAAACG CCGATATGGC TTTTGGGAGC TTCTTGAGCA TACGGATCAG 1860 

AACGGCAAGC TGCATACGCT GCTCTACTCT CCTTATTTCG AGCAGATATT CGGTGCTCCC 1920 

TCCATGGGCA ATGCAGGATC TGTCAATTTC TCTTTTGACA ACAACTTAGA GGCCAAGATC 1980 

AAATCCAAAT CGGATTCGAC AGGGATCAAG AAGATCAGCC TGATAGATCA GTTCACATGG 204 0 

TCTACATCCT ATAATATGTT TGCCGATTCG ATCCGATGGA GCAATATCTC GGCTTCGCTG 2100 
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GCACTTCGCC 
ACGAAGTATT 
ATTTTTAACG 
CTCAACAAAG 
GATGAAAAGA 
GAGGGAGGAA 
GGAGCAGTGG 
TATACTTGGA 
TACCGGGTAA 
TTCGGATTCA 
G'-J'CACTCGCG 
AAGTCCTATA 
CAGAGCAATC 



TCTCCAAGAG 
ATGAGGGAGA 
GCAAGGGATT 
AGTCGCTCAG 
AAAACACAGG 
GACCGCAAAA 
ATCAGGATGG 
ATATTGCTAC 
CGCAGAATCT 
ATGCGAACTA 
ACATGCACTG 
ATTTCGTCAT 
GTCCCATCAC 



CTTTACCTTG 
AGATGGGAAG 
GGCACGCCTG 
CGGATTGATA 
GGCTACTCCrr 
TGAAAGTGGG 
TTACTTCGCA 
CGACTACAAT 
GAGCTTTCGC 
CAATTTCGAC 
CTGGGCTATC 
ATCGGTGAAG 
GAATACTTGG 



CGCTTGTCCG 
ATCATTCCCT 
ATCAGTACGG 
GCTCTTTTCA 
CATGAAGGAG 
GGGTCGCTCC 
TATTCGATCC 
AGGTACAATG 
GGCAATATCC 
TTGAAGAAAA 
TCGGCCAGTT 
AGTTCACTCT 
TAT 



GTCTGTTCGA 
ATAAGAGCAA 
GTACITCTTT 
GTGGCAAAAA 
ACGATGCTGC 
TCGAGCGCAA 
CATGGAGCCT 
TCAATAAGAT 
AGCCTACACC 
TAACATCGCT 
TCATCCCTAT 
TGCAGGATCT 



TCCCTATTTG 
CGACCTGCGC 
CAGCTATACG 
GGAGCGGAGA 
CGATATACrr 
CCGTCAGGGC 
GTCCTTCGAC 
GGAGCACTAC 
GAACTGGAGC 
TACCTGCTVAC 
AGGAGCATAC 
GAAGTATCAG 



(2) INFORMATION FOR SEQ ID NO: 220 

(1) SEQUENCE CHARACTERISTICS: 

I A) LENGTH: 367 B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS r double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPF.: DNA (genomic) 
(lil) ir/POTHETXCAL: NO 
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2853 
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(iv) ANTI -SENSE: HO 

(Vl) ORIGINAL SOURCE: 

(A) ORGAIIISI-I: PORYPHVROMONAS GINGIVALIS 

{ ix ] FEATURE : 

(A) NAl-IE/KEY: micc^f eature 
(BJ LOCATIOrj 1...367 8 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 220 



ATGATGAAAC 
TTTCAAATAA 
AAACATTCAA 
ATTGTATACC 
AGGCCTGCAA 

TerccccccG 

GATCCTGAAA 
GTACCTGTCG 
GCCAGATTAA 
GGGACGGCTA 
TTATCTGATT 
ATTTGGGGAA 
ACTGCTGTGG 
GTTAAAAATG 
GATTTGGTTA 
ACACCTCTTA 
GAATCCAAAT 
GCATTGGTCT 
GCTATCAATG 
GACCTGGGAG 
CACTTCTTTA 
ATTAATATAG 
AAAAATCTGA 
CACTATGATG 
GAAGCTACCA 
CCGGTAAATA 
GCAAAATTGA 
CCCACGGTTA 
GGTCAACCCA 
CAGTCACAAG 
TTGGACAAAA 
ACTACTGCCG 
GGCCAAATCG 
CTCAATTATT 
TGGACAGGAG 
TTCGGTTCTC 
AACCITATCA 



GATATACAAT 
AAGCTCGCCC 
TGGATTCTAA 
AATCTGTTTC 
ATCGTTTCCC 
TAGTTCCCGT 
ATCCCAATGC 
ATGTGGTGAT 
AGTCTGCCAT 
CAGAAGGGGT 
TTACCAAAGA 
CACATACCCA 
ATAAGCATAT 
TAACTACTGC 
TACAAGGAGC 
CCCCAAATTA 
TCGATTATAG 
ATGAACCGAG 
AGGCTCAGTT 
ATTTTGCCTT 
CGGCGACACC 
GTATACAjGAG 
CGCAATCGGG 
TCTCTACTAA 
TAACTTATCG 
CTACTTCTGC 
CCTATACCAA 
AGTTAGGTTA 
TACAGGCAAA 
ATTTCTTTTT 
CGACCGAAGC 
ATGGTAAACG 
GTATCAGTTG 
GGATGGGAGG 

cccaagtacc 
cggcggtagc 
ataattccga 



AAITCTTGCA 
TTATGAAAGA 
ATTGGTGCCT 
TGAACATAGT 
TTCGCATAGG 
TGCCGTCGAC 
CTGGGATGTG 
GGTTATCGAC 
TGCATCGGGA 
GCGTATCGCT 
CACTGCTTTl' 
GGGGGGGCTT 
CATATTGATG 
AQACTTCATT 
AATTAATTTC 
TCCAACTCAT 
TAATCTGAGT 
GTTTCCTCAT 
TGCGAAAAAC 
GGCCAACAAT 
GGCCAATTTA 
GGGGGAGGTG 
AGATGTTACT 
AAAACTGACA 
TATTTATCCC 
TATCGGCCCG 
TTCCAATGGC 
TGGTGTTATT 
TGGAACAGTT 
GCCCTCAGGT 
ATTACAGTAC 
TTATCGTTTT 
GAAAAAACCG 
AACAACAGAC 
GCTCACAGGA 
CGATTTGCAT 
CAAGGATTTA 



GTTTTTCTTT 
TTTGCAGATG 
GCAAATAAGG 
GACTTAGTTA 
AAGTCTTTTT 
AAGTATGCGG 
ACGCTAAAAA 
CAGTCITCGT 
CAGCGTTTTG 
CTTGTGAGTT 
CTCTGTCAAA 
AAAATGGCGA 
TCTGACGGGT 
GGCAAAACTG 
CCTACAAATT 
TCTTCTAAAG 
GCAAtSGATTA 
CCCTATTATT 
TCTGGTTATA 
TCGTTGAAAC 
GCTGCAGCGT 
ACGGACTTTG 
CATTTGCTAA 
•1*GGACTACTG 
GATTTGGATT 
GATCTTGGTG 
GAACCGAATC 
AAGCGGCACT 
GTCAGTTCCC 
GGAGGTCATA 
TATTCCGTAC 
GTCGAAGTCC 
GCAGGAAACG 
CAACAGAGTG 
GAAGATGTAG 
GTCCCGACAA 
GTTGTTACCA 



TATTCTGCAC 
TAGAGAAGCC 
GTAACTTAAT 
TTTCACCTGT 
TTGCAGAAAA 
TACCGGTTGC 
TCACTACTAA 
CAATGGGAGG 
TGAAAAAAAT 
ATGACCATGA 
AAATCCGGGC 
GAAACATTAT 
TAGCGACGGA 
GAAATGCGAA 
ATGTTTCCAA 
TTGGACGGAG 
CTTTTGATGG 
ATTATTTCCC 
CAATCCATAC 
TAACCGCTAC 
TTGATAATAT 
TAGCTCCTGG 
ATGTTTCAAA 
GTACTATCCT 
ATATACAGAA 
GATTCGATAC 
AGCAGTTAAT 
ATGTATTG6T 
TAAGCGAGGC 
TTGTTCCCAA 
CGCCGACTAA 
CAGGCTCCAC 
CTTACTTCGC 
AATGGGATGT 
AGTTTGCAAC 
CCAACCCCAA 
CAAGCAGTCA 



GGTATTTACC 
TTGGATTCAG 
TCAAGCl'GAA 
GAACGAAATA 
TCTACGGGCA 
CAATCCAATG 
AGCGGTAACA 
GCAAAACATT 
GTTGCCTAAG 
GCCTCATCGC 
TTTGACTCCT 
GGCCACTTCT 
GCAGTATCCT 
TGATCCCATT 
CAATCCATCT 
AAATCTGCCG 
TGTTGCTGGC 
TTGTAACGCT 
TATTGGCTAT 
AGACGAGAAT 
TGCCCAAACT 
TTTCATCGTT 
TGGAACGGTG 
GAGCTCATCA 
CAATGATATT 
CAATACCGAG 
TTTCCCACCT 
AAATAAAGAC 
TCATGTTCTA 
ATGGATAAAG 
CACGGTCATC 
GCCGAATCCG 
TTACAAGCTC 
GACGTCCAAT 
GACAGAAAAT 
AATTATCGGT 
ATTGACGATC 



60 
120 
180 
240 
300 
360 
420 
4tiO 
540 
600 
660 
720 
780 
640 
900 
360 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



SUBSlil UTH SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



AACGGCGTGG 
GACAATCCTA 
ACCGTCGAGT 
TGGCAGTATT 
AACGAGACCG 
TATGCACCTG 
GCACAGCCTA 
AATCTGACAC 
ACTGGAGCCA 
TATCTGTTCA 
GGCTATCGAG 
CCGGATCGTA 
ACGTTGCAiiA 
ACGCAGATCA 
GATGTTCTTG 
TTCGGATTCG 
TATGCGATGT 
AACCTGCTGA 
TCGGATCATT 
CGTATTACC3A 
TTCCX3CTTGT 
GGCClSTGAAA 
ATAGAAGGTA 
GTGCAGACCG 
AAGGTGCTTG 



TTGAGGATAA 
CGGGGACATT 
TTTACAATCA 
TCGGTATCCC 
TCAACCAATG 
ATACAGAGCT 
CGGGAGTTTA 
GCACGTCCGG 
TCGACATCyVA 
ACACGGGAAC 
CCGGTCAGTA 
TTCCATCGAT 
TCTTGTACGA 
CATGGCtaATC 
GTAACGAGTC 
ACAACGGCTG 
CTGACATCGG 
TCGGCTTCGA 
TTGCGAAAGG 
ATTCTACGTC 
CATATGGATG 
TTATAATTCT 
AGCTTCTTCG 
GAGGGGCCTA 
TTGAGTAT 



CAATCCGAAT 
GCTTTTTGCC 
GGGATATQAT 
TGTCAATGAA 
GGTTGAGCCT 
TCAAAAATTC 
CAGCTTCAAG 
TGTCAACTAC 
GCAGGGTATT 
ACGCGACCAG 
CXTCTCTGTA 
GCATTCCTTC 
TAAGCTGCTC 
CGGCAACTCC 
GGCCGACCGT 
GGATGGTCX5C 
TAATGATAAA 
TGCGGATAAG 
GGCTGTTTAC 
GTATTCATTC 
TGATGAGAAC 
SAATCAAGAT 
CCGCTTGAAA 
TATTGTGCAT 



GTCGG-rACGA 
AATCCGGGCT 
TGTG CCGAT T 
TCAGGTTTTC 
TTCAATGGCG 
AAGGGCTACC 
GGTATGATTT 
TCGGGCGCCA 
GTCTTCCCGC 
TGGCGTAAGC 
CCTAAGAATA 
TTGGTGAAGA 
AAGAACACGA 
GGATOGGCGA 
TTGTGGATCT 
AAGCTCACTG 
TTCCAGGTTG 
GATGGTCAAT 
CTGCACGATC 
GATGCCAASC 
GTAGATGATT 
GCTCTTGACT 
GTATTAGCTG 
CTTCAAAATG 



TCGTCGTGAA 
ATAATCAAAA 
GTGGTATGTA 
CAATTAATGA 
ATAAGTGGCG 
AGATCACGAA 
GTGTGTGCGA 
ACTTGATCGG 
CGGAAGTCGA 
TTAATGGAAG 
CAGCGGGTCA 
TGCAGAACGG 
CTGTAAACAA 
ATATGCCGTC 
TTACCGATGG 
AAAAAGGTTT 
CAGGGGTTCC 
AC3VCGTTGGA 
TTGAGTCAGG 
GGGGAGATTC 
CGCATGTCGT 
GCACTGTAAC 
GTCATAGAGA 
CTTTCACTAA 



GTCGTCGAAA 
TGTAGGGGGG 
TCGCAGGAGC 
TGTGGGCGGA 
GCCAGCACCT 
TGACGTGCAG 
TGCCTTCCTG 
CAACTCATAC 
GCAGACGGTG 
CACGGTTTCA 
GGACAATCrr 
AGOGTCTTGT 
CGGTAATGGT 
ACTTGTGATC 
GGGTCTTTCT 
GTCACAACTT 
GGAGTTGAAT 
GTTTGC-rcn* 
AGCCAAACAC 
CGGGGCTCCT 
GAGTACAAAT 
Crn-AITCACA 
AGTC7VTGAAA 
TGATGTGCAT 



2280 
2340 
240O 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
312C 
3180 
3240 
3300 
3300 
3420 
3460 
3540 
3600 
3660 
3678 



(2) INFORIIATION FOR SEQ ID NO: 221 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 3675 base pairs 
IB} TYPE: nucleic acid 
(CJ STRANDEDWESS: double 
(D) TOPOLOGY: circular 

35 

(ii) 110LECULE TYPE: DMA (genomic) 

(iil) HYPOTHETICAL: NO 

40 (iv) AHTI-SEHSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISH; PORYPHYROWaTAS GIWGIVALIS 

45 (ix» FEATURE: 

(A) MA/1E/KEY: misc_f eaturfl 

(B) LOCATIQI^ 1...3675 



50 



55 



65 



70 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO:221 



ATCAAACGAT ATACAATAAT TCTTGCAGTT TTTCTTTTAT TCTGCACGGT ATTTACCTPT 60 

CAAATAAAAC CTCGCCCTTA TGAAAGATTT GCAGATGTAG AGAAGCCTTG GATTCAGAAA 120 

CATTCAATGG ATTCTAAATT GGTGCCTGCA AATAAGGGTA ACTTAATTCA AGCTGAAATT 180 

GTATACCAAT CTGTTTCTGA ACATAGTGAC TTAGl'TATTT CACCTGTGAA CGAAATAAGG 240 

CCTGCAAATC GTTTCCCTTC GCATAGCAAG TCTTTTTTTG CAGAAAATCT ACGGGCATCT 300 

CCCCCCGTAG TTCCCGTTGC CGTCGACAAG TATGCGGTAC CGGTTGCCAA TCCAAT6GAT 360 

CCTGAAAATC CCAATGCCTG GGATGTGACG CTAAAAATCA CTACTAAAGC GGTAACAGTA 420 

CCTGTCGATG TGGTGATGGT TATCGACCAG TCTTCGTCAA TGGGAGGGCA AAACATTGCC 480 

AGATTAAAGT CTGCCATTGC ATCGGGACAG CGTTTTGTGA AAAAAATGTT GCCTAAGGGG 54 0 

bU ACGGCTACAG AAGGGGl'GCG TATCGCTCTT GTGAGTTATG ACCATGAGCC TCATCGCTTA 600 

TCTGATTTTA CCAAAGACAC TGCTTTTCTC TGTCAAAAAA TCCGGGCTTT GACTCCTATT 660 

TGGGGAACAC ATACCCAGGG GGGGCTTAAA ATGGCGAGAA ACATTATGGC CACTTCTACT 720 

GCTGTGGATA AGCATATCAT ATTGATGTCT GACGGGTTAG CGACGGAGCA GTATCCTGTT 780 

AAAAATGTAA CTACTGCAGA CTTCATTGGC AAAACTGGAA ATGCGAATGA TCCCATTGAT 84 0 

TTGGTTATAC AAGGAGCAAT TAATTTCCCT ACAAATTATG TTTCCAACAA TCCATCTACA 900 

CCTCTTACCC CAAATTATCC AACTCATTCT TCTAAAGTTG GACGGAGAAA TCTGCCGGAA 960 

TCCAAATTCG ATTATAGTAA TCTGAGTGCA AGGATTACTT TTGATGCTGT TGCTGGCGCA lO^O 

TTGGTCTATG 7KACCGAGGTT TCCTCATCCC TATTATTATT ATTTCCCTTG TAACGCTGCT 1000 

ATCAATGAGG CTCAGTTTGC GAAAAACTCT GGTTATACAA TCCATACTAT TGGCTATGAC 114 0 

CTGGGAGATT TTGCCTTGGC CAACAATTCG TTGAAACTAA CCGCTACAGA CGAGAATCAC 1200 

TTCTTTACGG CGACACCGGC CAATTTAGCT GCAGCGTTTG ATAATATTGC CCAAACTATT 1260 

AATATAGGTA TACAGAGGGG GGAGGTGACG GACTTTGTAG CTCCTGGTTT CATCGTrAAA 1320 

AATCTGACGC AATCGGGAGA TGTTACTCAT TTGCTAAATG TTTCAAATGG AACGGTGCAC 1380 

TATGATGTCT CTACTAAAAA ACTGACATGG ACTACTGGTA CTATCCTGAG CTCATCAGAA 14 4 0 

75 GCTACCAT.AA CTTATCGTAT TTATGCCGAT rrGGATTATA TACAGAACAA TGATATTCC3 1500 



SUBSTTTUTE SHEET (Rule 26) (RCVAU) 
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GTAAATACTA 
7VAATTGACCT 
ACGGTTAAGT 
CAACCCATAC 
TCACAAGATT 
GACAAAACGA 
ACTGCCGATG 
CAAATCGGTA 
AATTAITGGA 
ACAGGAGCCC 
GGTTCTCCGG 
CTTATGAATA 
GGCGTGGTTG 
AATCCTACGG 
GTCGAGTTTT 
CAGTATTTCG 
GAGACCGTCA 
GCACCTGATA 
CAGCCTACGG 
CTGACACGCA 
GGAGCCATCG 
CTGTTCAACA 
TATCGACCCG 
GATCGTATTC 
TTGCAHATCT 
CAGA'l'CACAT 
CTTCTTGGTA 
GGATTCGACA 
GCGATGTCTG 
CTGCTGATCG 
GATCATTTTG 
ATTACGAATT 
CGC-rrGTCAT 
CGTGAAATTA 
GAAGGTAAGC 
CAGACCGGAG 
GTGCTTCrrG 



CTTCTGCTAT 
ATACCAATTC 
TAGGTTATGG 
AGGCAAATGG 
TCTTTTTGCC 
CCGAAGCATT 
GTAAACGTTA 
TCAGTTGGAA 
TGGGAGGAAC 
AAGTACCGCT 
CGGTAGCCGA 
ATTCCGACAA 
AGGATAACAA 
GGACATTGCT 
AC7UVTCAGGG 
GTATCCCTGT 
ACCAATGGGT 
CAGAGCTTCA 
GAGTTTACAG 
CGTCCGGTGT 
ACATCAAGCA 
CGGGAACACG 
GTCAGTACCT 
CATCGATGCA 
TGTACGATAA 
GGCGATCCGG 
ACGAGTCGGC 
ACGGCTGGGA 
ACATCGGTAA 
GCTTCGATGC 
CGAAAGGGGC 
CTACGTCGTA 
ATGGATGTGA 
TAATTCTGAA 
TTCTTCCCCG 
GGGCCTATAT 
AGTAT 



CGGCCCGGAT 
CAATGGCGAA 
TGTTATTAAG 
AACAGTTGTC 
CTCAGGTGGA 
ACAGTACTAT 
TCGTTTTGTC 
AAAACCGGCA 
AACAGACC7VA 
CACAGGAGAA 
TTTGCATGTC 
GGATTTAGTT 
TCCGAATGTC 
TTTTGCCAAT 
ATATGATTGT 
C7V\TGAATCA 
TGAGCCTTTC 
AAAATTCAAG 
CTl^CAAGGGT 
CAACTACTCG 
GGGTATTGTC 
CGACCAGTGG 
CTCTGTACCT 
TTCCTTCTTG 
GCTGCTCAAG 
CAACTCCGGA 
CGACCGTTTG 
TGGTCGCAAG 
TGATAAATTC 
GGATAAGGAT 
TGTTTACCTG 
TTCATTCGAT 
TGAGAACGTA 
TCAAGATGCT 
CTTGAAAGTA 
TGTGCATCTT 



CTTGGTGGAT 
CCGAATCAGC 
CGGCACTATG 
AGTTCCCTAA 
GGTCATATTG 
TCX?GTACCGC 
GAAGTCCCAG 
GGAAACGCTT 
CAGAGTGAAT 
GATGTAGAGT 
CCGACAACCA 
GTTACCACAA 
GGTACGATCG 
CCGGGCTATA 
GCCGATTGTG 
GGTTTTCCAA 
AATGGCGATA 
GGCTACCAGA 
ATGATTTGTG 
GGCGCCAACT 
TTCCCGCXrGG 
CGTAAGCTTA 
AAGAATACAG 
GTGAAGATGC 
AACACGACTG 
TCGGCGAATA 
TGGATCTTTA 
CTGACl'GAAA 
CAGGTTGCAG 
GGTCAATACA 
CACGATCTTC 
GCCAAGCGGG 
GATGATTCGC 
CTTGACTGCA 
TTAGCTGGTC 
CAAAATGCTT 



TCGATACCAA 
AGTTAATTTT 
TATTGGTAAA 
GCGAGGCTCA 
TTCrCAAATG 
CGACTAACAC 
GCTCCACGCC 
ACTTCGCTTA 
CGCATGTGAC 
TTGCAACGAC 
ACCCCAAAAT 
GCAGTCAATT 
TCGTGAAGTC 
ATCAAAATGT 
GTATGTATCG 
TTAATGATGT 
AGTGGCGGCC 
TCACGAATGA 
TGTGCGATGC 
TGATCGGCAA 
AAGTCGAGCA 
ATGGAAGCAC 
CGGGTCAGGA 
AGAACGGAGC 
TAAACAACGG 
TGCCGTCACT 
CCGATGGGGG 
AAGGTTTGTC 
GGGTTCCGGA 
CGTTGGAGTT 
AGTCAGGAGC 
GAGATTCCGG 
ATGTCGTGAG 
CTGTAACCTT 
ATAGAGAAGT 
TCACl'AATGA 



TACTGAGGCA 
CCCACGTCCG 
TAAAGACGGT 
TGTTCTAGAG 
GATAAAGTTG 
GGTCATCACT 
GAATCCGGGC 
CAAGCTCCTC 
GTCCAATTGG 
AGAAAATTTC 
TATCGGTAAC 
GACGATCAAC 
GTCGAAAGAC 
AGGGGGGACC 
CAGGAGCTGG 
GGGCGGAAAC 
AGCACCTTAT 
CGTGCAGGCA 
CTTCCTGAAT 
CTCATACACT 
GACGGTGTAT 
GGTTTCAGGC 
CAATCTTCCG 
GTCTTGTACG 
TAATGGTACG 
TGTGATGGAT 
TCTTTCTTTC 
ACAACTTTAT 
CI'TGAATAAC 
TGCTCTTTCG 
CAAACACCGT 
GGCTCGTTTC 
TACAAATGGC 
ATTCACAATA 
CATGAAAGTG 
TGTGCATAAG 



1560 

1620 

1680 

1740 

]80Q 

1860 

1920 

1980 

20-40 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3400 

3S40 

3600 

3660 

3675 



(2) INFORMATION FOR SEQ TD NO: 2 22 

{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 127 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAhiDEDNESS: double 

(D) TOPOLOGY: circular 

(111 MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(Iv) A^JTI-SEIISE: NO 

(vi) ORIGINAL SOURCE: 

{A» ORGANISM: PORY PHYROMONAS GIHGIVTO^IS 

tlx) FEATURE: 

(A) NAUE/KEY: mlsc_ireat.ure 
{B) LOCATia4 1. . .1275 



(xij SEQUENCE DESCRTPTIOH: SEQ ID MO: 222 



ATGGAAGTGA 
CTCTTCCACT 
AAAGGATTCG 
TCGCGTACGA 
TCGGTCAATG 
GCCTTCTTCG 
TGGGGGCCTG 
TTTCTGGCAC 
TTCAACGAAT 
CCTGTGGATC 
AATATCCTCT 
GGTGCGCGAT 
TGGAACAAGA 



AGAAAAACAC 
CCTCACGGCT 
TGGATACCTA 
GAGTGAGAGG 
CCACCTACAA 
AGCATCAGGA 
CCGACGGTGT 
AGGATTACGA 
CGATGAAAGT 
CTCGCAATCC 
GGAAAGAAGA 
GGAGCACTAC 
TGCCCGTCAT 



AGTGGTGCTA 
GTGGGGACAG 
CCATGCCGTA 
TGAGCTGGAG 
TGCTCTACTG 
AGAGCATTGG 
GCGCATCACG 
TGATATTCGT 
GGAAGTCGTG 
TTGGAATATC 
AGCCGGCAAA 
GCTCTCCGGT 
CGAAGTACAG 



CGCCTTCTGA 
GAAGGGGAGG 
CGCAGCTCTT 
AGGTCGTTCG 
AAAGACGAGA 
GGGTTGCGCC 
GATCTGATCT 
ATGCCGGTCA 
GTACTGCCTG 
TTCTCCCTTT 
CCGGCCTTCA 
ATCGACTTCG 
GGCATTGTGC 



TTTGGTTCGT 
GGAGTGCCCG 
CTCCTTTTGA 
GTAATTCGAA 
CCGGCTTACG 
TCGGACGACA 
CCCCGATGGA 
ATGCATTGCG 
TATTCGAGGG 
CGCCCATTGC 
AGGTTGCCAA 
CTTTGGCTGC 
CGACGGAAAT 



GGCCATTCTT 
ATACAGATTC 
TTTCATGAGC 
AGTAGCCGTA 
TTTACGTGAA 
GATTGTCATT 
TATGACCGAG 
TTTCTCTGTC 
GTACCGTCTG 
TCAGGGGATG 
TATCGAGTAC 
ATTGCATACA 
CATCGTTAGC 



60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
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CCrCGCTAlT 
GTTTTCAGGG 
GAGCAAGAGG 
GAATGGATGA 
TCCCAAAGAC 
ACACTCCAAC 
TTTGCAGCTG 
AGTAGTAAGG 
GCCCGCTACA 



ATCGTATGGG 
GAGAGGCTGC 
GTTTCCAAAC 
TCTCAGGACA 
AACATTCTAC 
TTTCGGACTT 
ACTATGCCTT 
GCAGCGGTAT 
GCTTC 



ATTTGTCGGC 
GTTCAATATC 
AATCAATTGG 
ATTCTCAATG 
CCTGATTACT 
CACCTACTAC 
GAACGATCAG 
ATTCGATCGC 



GGCGACCTCT 
GACAAACACT 
TTGGCCGGAG 
GAAAGCATAT 
CTCAATGTTT 
GACCTTACGG 
ATACATCTGA 
TACAAAGACA 



CCGTACCCGT 
TCACCTATAA 
CCGATTGGTA 
TCAGGTATAG 
CCAAGTVWrr 
GCAAAGGATG 
TGGCCGGATA 
ATTCCGAACT 



CGGACAGTTT 
GAGTCATGCC 
TGCTCCCGGT 
GGATTTCATC 
CTTCGGCAGT 
GTTCAGTCGC 
TGACTGGTTC 
CTGGrrCAAA 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1275 



(2) INFORMATIOM FOR SEQ ID NO: 223 

(IJ SEQUENCE CHARACTERISTICS; 
15 <AJ LENGTH: 1212 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

20 (11) MOLECULE TVPE: Dt^A (genomic) 

(111) HYPOTHETICAL: NO 



25 



30 



35 



40 



45 



50 



55 



(iv) AUTT -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRa^Ot-fAS GIMGIVALIS 

(i?:) FEATURE: 

(A) NAME/ KEY: misc_f eature 

(B) LOCATIOt-I 1 . . , 1212 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:223 



ATGTCCTCCT 
GAACGGAATC 
TTAGCTACrA 
TATGCCATCG 
ATACTCCTGA 
AAGAATCCGC 
TTGTCACCGT 
AAGAAGAAAT 
GAGGCAGAGC 
GCCTGCGAAA 
CGTAAGATGC 
TCAGAAGACA 
GGCGATGAAT 
CGCATAGATG 
GATGAGTTCG 
GGCGAGATTA 
AGTTATCfTT 
GAAAATGCTT 
ATCAAGCAGG 
GTGACCCAAA 
TGGGAGGTCG 



GTGAGCTCCC 
ACTCTTCCGA 
TTCTGATCGG 
AGCAGACATT- 
CCACTGTTCT 
TGCAATACTC 
TTTCAAAATT 
ACGATATGTC 
CGGAGGAGAA 
TCATGGTTCC 
TTGACTTCGT 
ATATCAAAGG 
TCGACTGGCA 
ATTTGCTCGA 
GTGGCACTTG 
CGGACGAGTA 
TCGAAGGAAA 
TCGGTGAATT 
AACTCCCCCA 
TGGACAAGCG 
AA 



TTATTTTTCA 
CATCGCGCTT 
GAATAATGTG 
CGTTTTCTCT 
TTTGCTGTTC 
GCGCTTTTCT 
GCTGGTCAAA 
CGTGGATGAG 
AGAAATGATT 
GCGTATCGAT 
TGTTTCGTCG 
GGTGATTTAC 
TCCTCTGATT 
GGAGTTCAGA 
CGGACTGATC 
CGATGAGGAA 
AACGTCTCTC 
GGGGGACGAG 
TGTGGGCGAT 
CCGAATCATC 



Cl'AAAGCCGA 
TCCAATTTAT 
ATTAATGTAG 
TCTCCGATCA 
GGAGAGATTC 
GCTGCAGCTA 
ACTACCGGCA 
CTCTCGAAAG 
AACGAAATCA 
ATTGTGGATG 
GGTTATTCCA 
ATCAAAGATC 
CGTAAAGCAT 
GCCAATAAGG 
ACAATGGAGG 
GAACTCCCCT 
TCCGATGTTC 
GTAGATACGC 
ACAGCA6TGT 
GAAATCAAGA 



TCGATCTGCA 
TAGACAATTC 
CCATCGTTAT 
TTGGATTTCT 
TGCCGAAAGT 
TGTCCGTTAT 
TCGTTACCAG 
CGGTAGCCCT 
TCAAATTCTA 
TGGATCTGAG 
GACTTCCCGT 
TAATCCCAGA 
ATTTTGTCCC 
TGCATGTCTC 
ACATATTGGA 
TTAAGGTTTT 
GACACTATCT 
TAAGTGGGCT 
ACGAGCCATT 
TTTTCCCTTT 



GAACATCCGC 
GAATCAGCTA 
CCTTTCCAAT 
GATCCAGACG 
GTATGCGCGG 
CTATAAGATA 
AGGTATCAGC 
CACCACTACG 
TAATAAGACA 
CTGGCCATTT 
TTCAGAGGGG 
CATGGATAAA 
CGAAAACAAG 
CATCGTTGTG 
AGAGATCGTC 
GGGGGATGGC 
TGACCTTCCG 
CTTCTTGGAA 
CCGCTTTCAA 
CGAGCGCACT 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1212 



60 



65 



70 



75 



(2) INFORilATIOtJ FOR SEQ ID NO: 224 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 780 base pairs 
(Bl TYPE: nucleic acid 

(C) STRANDEDNESS : doubls 

(D) TOPOLOGY! circular 

(ii) MOLECULE TYPE: DMA (genomic) 

(iii) IIYPOIHETICAL: MO 

(ivi AIITI-SEUSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISri: PORYPHYROMONAS GINGIVALIS 

(Ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(A) NAUE/KEY: misc featur* 

(B) LOCATIOW 1, . .780 



txi» SEQUEMCF. DESCRIPTION: SEQ ID NO: 224 

ATGAAACTAT TACTTTATCT CCTATTGGTC TTGTCGACTC TATCCCCGAT GTATTCGCAA 60 

ATGCTCTTCT CAGAGAATCT CACAATGAAT ATAGACAGCA CCAAAACCAT ACAAGGAACG 120 

ATATTGCCCG TACTGGATTT CAAAACCGAA AAGGAAAATG TGTTCACCTT CAAAAATACT 180 

S^Syi?!^ ATCTGCTGAT AAAGCACGGT CAAGTAATCA ACTTAATTAA TAAGCTTGAG 24 0 

TTTTCTACCT ATGGCAATAA ACTAACCGTA AG^l'GGAGGAT ATGTACACAC CGAATACCGC 300 

TA'J-TTGTTGC ATCATGTTTT TGAGGTTTAT CCTTATCTCG AGTCGCAATG GGCAGAAAGT 360 

ACAGGAATGA AATAl'AAGGT TTCTACGGGA TTACAGTCGC GTTATCGGCT GGTAAATAGT 420 

S^r^^^SI. TCATGTTTGC AACATTGGGG GTATTTTTCG AATTCGAAAA GTGGGAACAG 480 

CCAGCCACTA GCCTCTTTGC AGGAACGTAT GCAYACAGCC GAAGTATCAA AAGCCACCTG 540 

TCTATCAGTT TCAGACATCG GTTGGGTGAA CATTGGGAAT TTACAACTAC GGCTATTCAC 600 

CACGGAAAGC CTGACAGTTA TTTTAAGAAG GCACGTTTTG GAGGAGCTAT CGACCTCAAA 660 

TACCATATCA CACCTACGAT AGGAATACGC GGGGCCTATC GGATCATCTA CGATACTG^C 720 

CCTATTGTAC CTGTGCGGAA AGATTACAAC ACCGTTGATG TTGGTATCGA TAITTCGTTT 7 80 



(2) IliFORIIATiai FOR SEQ ID 110:225 

U) SEQUEUJCE CHARACTERISTICS: 

I A) LE1:IGTH: 2502 base paire 

(B) TrPF.: nucleic acid 

(C) STRAllDEDt^ESS: double 

(D) TOPOLOGY: circular 

(il) HOT.F.CULE TYPE: DtlA (genomic) 

(iii» HYPOTHETICAL: NO 

(iv) AHTI- SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOIIAS GINGIVALIS 

(ix) FEATURE: 

(A) HAME/KEY: misc feature 

( B ) LOCATIOW 1 . . . 2502 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225 

ATGAAACGAA TCGTTTTATC ATCTTTCCTG TTCCTTTCTGT CCATACTTTC TTTGATGGCA 60 
CAGAACAATA CCCTCGATGT ACACATATCC GGTACGATCA AGGATGCCTC CTCCGGCGAA 120 
CCAGTGCCCT ATGCCACTGT AAGCATCCGG CTGACAGGAG CAGATACCAC ACAGGTGTTC I 80 



300 
360 
420 
480 



CGACAAGTGA CTGACGGCAA CGGCTACTTC GTCATAGGCC TGCCGGCAGC TCC^CCTAT 24 0 
CACCTGACAG C1TCGTTCGT AGGTATGAAA ACCCATACCA TGCACATTAG TCGGGGAAAT 
GGACAGCACG ACATCAAATC CATCGACATT TCTCTCGAAT CCGAGGACAA ACAACTCTr^C 
ACCGTCACCG TATCGGCAGC ACGACCACTG GTGAAGATGG AGATAGACCG CCTGTCCTAT 

AATATGAAAG ATGACCCCGC AGCCAAGACG AACAACCTGC TCGAAATGCT GCGCAACGTT 4eo 

nSIIT^^ CGGTGGATGG TCAGGGCAAT ATCCAGGTGA AAGGATCTTC CAACTTCAAA SHO 

^l^^^^ ATGGCAGGCC CTCGACCATG GT'SAGCAGCA ACCCGAAGGA GGTCTTTCGC 600 

CCCATACGAT CAAACGGGTG GAGGTCATCA CCGATCCGGG TGTAAAGTAC 660 

GATGCG'^G GCACAAGTGC CATCCTGGAC ATCGTCACGG AAGAAGGTAA GAAGCTGGAA 720 

t^J^T'^^ GTTCCATCAC GGCCAGTGTC ACCAACAATC CCACAGCCAA CGGTAGTATC 7 80 

in^^f^'^ CAAAGTCCGG CAAAGTCGGG CTGACTACCA ACTATAACTA CTACGGTGGC 840 

CCTCTCGCTA CTTTACCGAA CGTACTACAT CCATGCTCCA AACGATAGAA 900 

GGCAAGAAAC CTTTGGCGGA CACTTCGGCA ATGCCCTCCT CTCATTCGAG 960 

Jlii^IT?^ TCAATCTCTT TACGGTGGCC GGCAATGTAC GCCTTTGGGA GATGACCACC 1020 

^kSS^Z^^^ GCGTAGAAAA AAGCTTTGCC GGCAGCAACC TCATGTCCTA CATAGACAGA 1080 

CACAGATGGA TGCCGGATCA TACGAGCTCA ATGCCGACTA TCAGCACAGC 114 0 

^S^^^f^^ CGGGCGAATT GCTCACCGTT TCCTACCGCl- TCACTCACAA TCCTAATAAT 1200 

ACCGA^^ACCT TCATTGACCA ATGGAAGCGC GATCCGCTCA ACACAGCTAA TACGATCCAG 1260 

TACGCCGGCC AGCACl'CCAA ATCCGATGCG GGCATGGACG AACATACCGC ACAAGTGGAC 1320 

VtlnJ^":^!^ CCTTAGGACA AGCACATTCT TTGGAAGCAG GGCTGAAGTA CATCTATCGT 138C 

GCGATCCTCT CTATGAGATA CGACCATCCG AAGATGCTCC GTGGCAGCCC 14 40 

^flS^i^Z ATGCACAGAA TCCGTCGAAC GGAAAGrrCC GCCACGATCA ATACATCGGA 1500 

b^J^GCCTATG CCGGCTACAA CTATCGTAAG GATCAGTATT CTTTGCAAAC CGGCCTCCGA 1560 

SS^S? ^A<^<=TGAA AGCACTCTTT CCCGAAAACG CAGCAGCAGA TTTCTCCCAC l620 

ACTGGGTGCC ACAGCTCACG CTCGGOTATA CCCCCTCGCC CATGAAGCAG 1600 

CTTAAGCTGG CCTATAACTT CCGAATCCAA CGTCCTGCAA TCGGCCAACT GAATCCCTAC 1740 

cggctacaga ccaacgatta tcaagtacag tatggtaatc ccgacctaaa ctc^aS^g l loo 

^J^'^'^ TCGGTCTCTC CTATAATCAA TACGGAGCCA AGGTCATGCr TACAGCATCG 1860 

CTCGACTACG AcrrCTGCAA CAACGCCATC CAGAATTACA CCTTTCTCCGA CCCGGCCAAT I9I0 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



li'^Hi ii'=«? tP"»i .a iF=Ji .,P= 

^uji ''^fn 'rnii c:ij ^„ o oi 
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CCCAATCTCT 
ACCTATGCCA 
CGCACATTCC 
TACTCAGGCC 
TATCATCGGG 
ATAGCCAAAC 
GCGAAGTATT 
AATGCCGCTTA 
ACACAAGTCWT 
CAAGGACAGC 



TCCACCAGAC 
TGTACACGCC 
AAAAGAGCGA 
rGATGTTCAC 
GAAGAAGCTA 
AGCTTTTCCA 
CGACATGGAA 
TACAACGGAG 
GCAAGGTAGA 
AGGGTGGC6G 



CTATGOCAAT 
GGCCGTATGG 
AGCACTCGGC 
CCTGCCGAAG 
CCAGACGAAG 
CAAAAAATTG 
GAGCCGGACC 
TGTTTCCCTC 
GCGTACGATC 
ACAAGGAAAT 



ATCGGACGAG 
GTCAGGATTA 
ATTGATGTCA 
GATTGGACTG 
TATGATGGCA 
AGAGTCTCGC 
ATCGGCAATG 
AGCCTCACCT 
GTCAACGACG 
CCTACCGGCA 



AGCATTCTTT 
TGCTCAACGG 
ATTCATGGTC 
TGAATCTCTT 
ATGTATTCAA 
TGAGCGCAAA 
GATTTACTAT 
ACAGCTTCGG 
ACCTCAAGCA 
AT 



CAGCTTGAAT 
AAATATCGAT 
CGGCATGGTA 
CGGAGGTTAT 
CAATATCGGT 
CAACATTCAT 
TTATTCGGAA 
TAAGATGAAT 
AACCTCATCC 



(2) INFORMATION FOR StQ ID MO: 226 

(i) sequence: CHARACTERISTICS: 

(A) LENGTH: 1197 base pairs 
(B} TYPE: nucleic acid 

(C) STRAMt5EDNESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(iii) H^iPOTHETICAL: HO 

(iv) ANT I -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISIl: PORYPHYROMONAS GTNGIVALIS 

(Ix) FEATURE: 

(A) NAllE/KEY: mlsc_feature5 
(D) LOCATION 1...1197 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226 



ATGAGACTCT 
CAGCATTATT 
CTCTCGGCAT 
GTCAAAGGTT 
GCAGCACATG 
CGCTATCCGG 
GTACGCTTGC 
CTCATGCTGG 
CCTGAGTTGG 
CGTTTCCGAA 
CAGGAGTCGT 
TGGCGACTCG 
GCGCAGCAGG 
CGCCCTCGTA 
TCAAGCGGAG 
GACTTGGAGC 
TTTGCTGCAC 
GGCGATTATA 
GCGGCTGTTG 
GAACTGACGA 



CTGCCATTCT 
ATTCCATGGC 
CGATCCGAAG 
ATACGTTGCC 
GTGTGAAGCT 
CCGGTATCGC 
GTATCCTGCC 
GCAATATAGC 
ATTTGACGGC 
TGGATGTTTG 
TTGTCTTTGG 
AACTGCCCTT 
ATACCGTGCA 
CCGACAAACC 
GATACTTCCC 
ACTTCGCTTT 
CTTTCGCCAA 
TTCGTCTCTA 
CTCGGGTATG 
TGCGTATCGA 



TATCGCTTTG 
GGGAGAGCGA 
TGCGCTTTTC 
GGGTGCACGG 
TTCGCTCGGA 
TTATTCCGAT 
TTATGTACAG 
CGGTGGTACG 
TGATCCTGAA 
GGTCAATTGG 
CTTGTCCACT 
GCAGGCTATT 
TACATGGGTC 
CATGCAGATT 
TTACGAAAGA 
TCGTACCGAC 
TTCCCTGACG 
TGCCGACTAT 
GTTCCAGCCT 
TCCCAAATTC 



ATTGTGATGC 
CTGGAGACGG 
TTTCGGAACA 
GTTTCCGCTT 
GTATCrACCC 
TTACCTTATT 
GCCATGCTGA 
GCTCACGGAC 
GCCGGTGTGC 
ATGAGCATGA 
ACTTCGAAAT 
GCCACGCATC 
AATGGAGCTG 
TGGGGATCTG 
GGGTGGGGCG 
TATTGGTACG 
TATGACAAAC 
TCGTGGCGGA 
TCGGATCGTT 
CCAATAGCTT 



TGCCTGCTGT 
ACAGCATTCG 
ATGAATACAA 
TTGCCTCTTA 
TGAACTACTG 
GGACGGACTA 
AGCCGACGGC 
TGATCGAACC 
AATTTCGGGG 
TTTTCAAAAA 
TGTTATCGGC 
GCGGCGGGGA 
TCGGACTTAA 
CTTATGGTGT 
GTTATCTTTC 
GCAGGCATTA 
AGCCTCTTAC 
TGGCACGAAG 
TTGCGATGAG 
TTCTGAAAGG 



GCTTAGCGGG 

TCCGAACGAA 

TGCACGTTCG 

CTCGCTGCCG 

GGGGGCAAGT 

TAACGACTAT 

CACGACTGCT 

GATCTACAAT 

TGATTGGACA 

TGACAATCAT 

TGAAGGCAAA 

ATACAACTGG 

GCTTTCGTAT 

GGCAGCCTTG 

TCTCGGAATC 

CGTTTCTCCC 

GAACGGTTGG 

TGTTTCGTTG 

CCACGCCTTG 

CAATCAT 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2502 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1197 



60 



65 



70 



75 



(2) INFORHATION FOR SEQ ID NO: 227 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1146 base pairs 
(D) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(lii) HYPOTHETICAL: NO 

(Iv) AJITI-SEJ'ISE: NO 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROIiONAS GIHGIVALIS 

(ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



it=Hi ipf^' if"?h uii =™ti tP^it ,ji*= -iF^ii ^-^is iP^}- Jpan' 

■iLji :™ri ic:: o 4ii cas icn -uinii lur lUi 
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156/ 490 



(A J IJAIIE/KEY: misc feature 
(B) IX»CATIOM 1...1146 

(xi) SEQ0E14CE DESCRIPTION: SEQ ID 110:227 

ATGAACAAAT CGCTATTATC ATTGGCATGC CTCATCCTCr GCGGTATGCC GGCCATCGCC 60 

CAACAGACAG GACCGGCCGA ACGCAGCGGC GAGCCTTCTC TGCCCGAACG TGTATTCGGT 120 

CTGGAGCAGA AGCAGAAAAA GCTGAAGGTG TACTTAGGCA TACAGTCGTT CTACGACCAG 180 

CCGCTTGTCC ATGACGAATC CCATATCGGA CACTTCAAGG TACAGGAGCT GCGGATGTCT 24 0 

GCTCATGGCG A.^CTGAACCG CCACCTCAGC TTCGACTGGC GACAACGTCT CAACCGTGCC 300 

GCCGACGGCA CTTCGTTTGC CGACAATCTC TCCAATGCXIA TCGACATCGC AGGTGTGGAC 360 

TGGCACCCGA ACGACAAGGT GTCTTTCTTC TTCGGACGTC AGTACGCGCG TTTCGGAGGG 420 

ATAGAATACG ACATGAACCC CGTAGAGATC TACCAGTACA GCGACCTTGT GGATTACATG 4 80 

ACCTGCTATA CTTCGGGCGT GAACn-CGCA TGGAACTTCC ACCCCGAACA GCAGCTGCAG 54 0 

CTACAGGTAC TCAATGCTTA CAACAACCGC TTCGCCGACC GCTACCACGT GACACCCGAT 600 

GTCGCTACCG CCACGAGCTA CCCGCTCCTC TACTCX3GCAC ACTGGAACGG TACCCTCCTC 660 

GGAGGAGCAC TGCATATGCG ITACGCCGTG TCGATGGCTC ATCAGGCCCA AGAGCGTAAT 720 

ATGTGGTACT TCACTGCGGG CAACCTGa-TC AATCCGGGCA AACGGATCTVA CGGATACCTC 780 

GACCTCACCT ACTCGATCGA GGGATTGGAC GACAAAGGCA TTATGACTGC TCGCTACGGC 84 0 

AAGGGCAAGA CCCTCACGGA CGTCAAGTAC TATGCTCTGG TATCGAAGTG GAACTTCCGC oqo 

ATTTTCGATC AGGTCAATCT CTTCCTCAAA GGCATGTACG AGAACGGCTA TGCGCCTGCC 960 

CAATACGGCG AGACCAGCCA CACGCGCCAC TCCTACGGCT ATATGGGAGG GGTGGAATAT 1020 

TACCCTACGG AGACCAACTT CCGTCTGTTC GTCACCTACA TAGGACGGCA TTACCGGTAC 1080 

AGTGCGACCG AGACG-5AAAG CACCAATGCT CTTCGCGCCG GTCTGATCTA TCAGATACCT 114 0 

TTCTTA ^^^^ 



(2) IliFORJ lATlOH FOR SEQ ID NO: 228 



(1) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 666 base pairs 
(B» TYPE: nucleic acid 

(C) STRAWDEDHESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 



(ivj AI^TI-SEMSE: NO 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: POKYPHYROMOHAS GIHGIVALIS 

(ix) FEATURE : 

(A) HAIIE/KE-f: misc feature 

(B) LOCATIOIJ I... 666 

(xl) SEQUENCE DESCRIPTION: SEQ ID HO: 228 

ATGAAACATT TGTTTAAGTC GACATTAGTA errCTTTGTG CTCTTTCTTT TTCCGGTACC 60 

TATACCTTTG CACAAGAAAA TAATACAGAA AAGTCACGAT TTGATTTTTC TGTTAGGCTG 120 

GGACAGGGAT ATATTGCAGG TTCAACTACC AACCTGATGT ATGGGTATAC ATCTGCTAAC 180 

GATAGACTTT TGTCTGGTGC AATTTATCTG GGCTTGACAC CAAGTAAGAA AGAAAATGCA 24 0 

ACCGGCGTAG CATTTCGTTT CTTATCfJCCC TCT'CCGGGTT ATTATGTCGA TATATCCGGC 300 

AAAGAAAATA CCTTGAATTA TGCGTTTTAC GTTGTCGGAG CATATAATAG AATAGCCATT 360 

CCTATACGCC CTATCAAAAA TTTTAATTTC ATCTTCTCTA CAGAACTCGG AATGGCTTGG 4 20 

ATCAGTCGTC ATGAGCAAAT TTACAATTCT ACTTCGCAGA CTTGGGATAA GCAGCGCAAG 4 80 

TCGAGGTCGG GACTGGATTT TGGTCTCGGG ATGCATCTGC AATUCCACAT TAATAAGACC 54 0 

GTTTACTTTA T'SGCAGGAAC CGATCTTACG TCTrGCATGT TCGGAAAAAG GATCAATGAC 600 

TACCAGCAAA AGGATCGAAC CTTCATTGCA CTTATCGACA ACAGTATTGG CATAGGATTA 660 

'^^^ 666 



(21 IHFOWWIOH FOR SEQ ID NO: 229 

ii) SEQUENCE CHARACTERISTICS: 

(A» LEMGTH: 1173 bas- pairs 
(B) TYPE: nuclslc acid 
(C| STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



iP^i :;r=ii -H™ -^P^ ™Hi -tTt^b 

u ■'^-n nrji C3> «iL 5::: c:ii cai 
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(iv) Atm-SFMSZ: NO 



10 



15 



20 



25 



30 



(vi) ORIGINAL SOURCC: 

(A) ORGAWISM: PORYPHYROMOt^AS GlflGIVALIS 

(ixl FEATURE; 

tA) NAHE/KEY: mi ec_ feature 

(B) LOCATION 1...1173 

(xi) SEQUEJ^CE DESCRIPTION: SEQ ID NO: 229 



ATGATTATCA 
GCTCCAAAGG 
CTGAATATCA 
CCGGATGCGT 
GGTATTAGCT 
CAGATGACCG 
CGTTATTTCA 
GCCCATCCCA 
TCCATGGCTG 
AATCAGGCCG 
CTGGGTAATG 
ATCTCCTATG 
GGGCTGTTGT 
CTGCTTGTAC 
CTCAAGAAAT 
CCGCGAGGAC 
TATGACGATA 
TTGCAGTACT 
TACCTGTTGT 
TTCGATATGG 



AGAAAATGCT 
CTATGAAGGC 
GTCCGGATGC 
ATTCACAGTA 
TCTCATATAC 
GTTTCTACAA 
CATTAGGAAA 
ATGAATTl'GC 
TTGCACTGCG 
GAAATGCCTT 
CGGAGAGCTT 
ACGGAGGTGT 
ATCCGATCGA 
CCACTCCTCC 
ATCAGGAAAC 
TCAAGGAAGA 
AATTTTTTGT 
TCACGGCCGG 
CTACGATCCA 
ATGGATTGCG 



GAAAAATAAA 
TCAGGAGCAA 
ACGTGCGGCC 
TTGGAATCCG 
ACCCl-GGCTG 
ATTGGGAACA 
GTTG6AGACT 
TGTCGATTTG 
TTACATCCGC 
TGCGGCGGAT 
GTGGTCGTTG 
CACGAGTTTT 
TGACTATAAC 
TATCATGGAT 
TTCTTCGATC 
ATTCCGTGAG 
TCGTGCCGGA 
TGCCGGCTTC 
GAGTAATCCG 
CAATTTGTTC 



TTGGCCCCCT 
CTGAATGTGG 
GCTATGGGGG 
AGTAAATATG 
TCCAAGCTGG 
GACGAGAATC 
TTCGACGAAT 
GGCTATAGCC 
TCAGACCAAA 
ATAGCCGGTT 
GGTTTCAACG 
TTCATCCCTA 
AGCATCAATT 
CAAAACGATC 
AGCGGTATAT 
ATTACATGGG 
TATTCATACC 
AAAATGAACA 
TTGGATCAGA 
CAC 



TGGCCATACT 
TACACACCTC 
ATATAGGTGl* 
CTTTCATGGA 
TCAATGATAT 
AGGCTATTAG 
TGGGCGAATC 
GCCAGTTGTC 
GCRCTCACAA 
ATTTGCAGAA 
TAAAGAATAT 
CTTCGTTGAA 
TCAACCTTGA 
AGGCCGGGTA 
TCTCTTCTTT 
GACn'GGGGC 
TGCACCCCAC 
TATTCCGTAT 
CTCTGCGGTT 



GTTCCTTTTT 
TGTGCCATCG 
GGCAACGACG 
TACGAAAGCC 
TGCCCTGATG 
TGCTTCTCTG 
CATGGGAGAG 
GGAGAACTTC 
CACCGGAGAG 
GTATGTGCTA 
CGGAACGAAG 
TCTCGGGACG 
ACTTAGCAAG 
TGACGCTGCA 
CGGTGATGCC 
TGAATATAGC 
CAAAGGCAAT 
CGATGCTTCC 
TACGCTTGCT 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 

7ao 

040 
900 
960 
1020 
1080 
1140 
1173 



(2) INFORMATION FOR SEQ ID MO: 230 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH! 1338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDCDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 
(ill) HYPOTHETICAL: NO 
(IvJ ANTI-SENSE: NO 



(vil 



ORIGINAL SOURCE: 
iA) ORGANISM: PORY PH YROMONAS GINGIVALIS 



(Ix) FEATURE: 

(A) NAME/KEY: misc_£eature 

(B) LOCATION 1. . . 1338 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230 



ATGAAAACAA 
GGCGGAAACG 
AACGTCAAGT 
GTCACAAAAA 
CGACCATTAG 
GTTGCGGATA 
GATCAAGAGT 
TCTGTGTCTA 
ATAGGTACCA 
TATGAAGACT 
TATCTGGCAT 
GAGTTCGTCG 
AATAAAGAAG 
CTTGCGATCA 
GCGATCGTTC 
ACTCCTAACG 
TTTGAGTGGT 
CCTTGGGACA 



CAGTTCAACA 
CACAGAGCTT 
ATATAATTCC 
GGATTCAGCA 
GTGATAATGA 
TTCAGACAGA 
ATCAACTT TA 
CATATGAGCA 
GGCATGACAT 
CTACACCTAT 
TGATGCATGA 
ATATCTCGGA 
GTGATATAAT 
GAAAAGCAGA 
AAAATATATG 
GAGTAGATGG 
ATGAAGATGA 
AACTTCTTTA 



AATTATTCTG 
TTGGGAAGAA 
CAATATGGGG 
GGGAGCAAAT 
ACAGTGGTGG 
TGCATATGGC 
TATAAACGAG 
TGGTATGGCT 
CTTCGCATCA 
GAGCTGTCGC 
AACCACAATG 
AAGTGAATTG 
AGCTCTTGTT 
TGAAGGCAAA 
CATGATGGAC 
TCGGACAAGG 
AATATACGGA 
TGCGAAATTT 



TGCCTGGCTT 
ATAGCTCCTC 
ATTGATTCAA 
TATACTTCTG 
AAACATGATC 
AGAGTTTATG 
CAGAACGAAT 
GTTTTTCGCT 
GGTGTAAATG 
TTTGCAGACG 
TCTACGACTA 
AGTAACTCGA 
ACTTCATATA 
TGGCAACTTG 
GACAACAAGA 
GTTTGTGTTT 
GGCCTGATAT 
GGTGGGATTA 



TAATGATGTC 
CTTTTATCAG 
AGGGAACAAT 
AGCAATTGGG 
CGTATTTTGA 
TATGTACGAC 
GGAGGTGTAT 
CTTCGACAGG 
ATrrCGAGTT 
CTACGAATAG 
TCCTTACTTA 
TTATTGCATC 
CAGGATTTAT 
TTGGCGGAGA 
TTGCTTGTGA 
CTGACGCATC 
TTSACACTTT 
TGCTCAGGAG 



AGGTGTATTG 
TAATGAGCCT 
CTATGTAACC 
TATGTACTAT 
TGACAAGATA 
TTC-ITCTCGA 
ATTCAAAACT 
GGTGACTTAT 
CAACACTATC 
TGGCACCATC 
TCAAAACGGT 
CATGTGCTCT 
GAGTGGAACC 
TATACAGAAT 
AGTCTTCGGG 
TGTCTTTGAT 
CTTCTATAGC 
TAAAGAGTCT 



60 
120 
100 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1000 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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TTTATAACCT CTTTCATTTC TCCGACAGTT GTACAAGGAG TGGATGTCTA TACTTTGGCC 11 -10 

GGGAAGATAA GGATCGAAAG TGAAACTCCG GTGTCTGAGG TGTTGCTTTT CGACCTGGCT 1200 

GGCAGGATGG TACTTCGGCA AACCATTGAT AATAAAATCT ATTCGGACAT AGATACTAAC 1260 

GGACTAAAGC GAAGCGGTAT TTACGTAGTC TCGGl'GCGGC TCTCTTCCGG ACAGGTATTC 1320 

AGTCATAAGG TGCAGGTA 1338 



10 



15 



20 



25 



(2) IHFORIIATIOM FOR SEQ ID NO: 231 

(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 924 base pairs 

(B) TYPE: nucl-lc acid 

(C) STRAIJDEDUESS: double 
iD) TOPOLOGY: circular 

(ii) HOLECL'LE TYPE: DNA (genomic) 

(jM) HYPOTHETICAL: NO 

(iv) AHTI-SEHSE: NO 

(Vi) ORIGIWAT. SOURCE: 

(A) ORGAinSH: PORYPinfRa-lOHAS GIHGIVALIS 

ii;:) EEATURE:' 

(A) HAftE/KEY: misc_feature 

(B) LOCATION 1, . . 92 4 



30 



35 



40 



45 



(xi) SEQUEMCE DESCRIPTION: SF.Q ID HO: 231 



ATGATAATCC 
GTAGGTCTTT 
GTGCCGAATA 
CCCGCTTCTT 
GCTTTGCTCA 
TTCGCGGCAG 
GTCCTTTCAT 
GGCCATGATT 
: TTGCTGAGTC 
ATACCTATGA 
' ACCACCGGTA 
■CTTAAAGGAA 
TATCCTCTCT 
GGCCCTCAGT 
GGTATGGAGC 
CCTGTATTCT 



GGTGTCTTAT 
TCTCTGCGAT 
GCAGCATGGT 
TTCAGAATAC 
ATGGATTGAA 
AGCGTCGGGA 
GGCGACCGAC 
TATTGACCGG 
AAAATCTTAC 
CGGCTGTCAA 
TCGATTATCG 
GTGCTCGATA 
ACAGCAGTAC 
ACGGTGGATC 
GCGAATACAA 
ATGGCGATAA 



CCGTCGTCCG 
GGCGCAAGAG 
GGAGCAGACC 
CCGTACACTG 
GCCTCATCGC 
TTTCGTrrCT 
CGATAGGATG 
TGTGCGCAAG 
GGCACATGTC 
TGGCCAACTG 
ACAGGTACAG 
CGAAGTGATG 
GCGCTCAGGA 
GCTTGAGCTG 
TATCTGGACT 
GAAG 



AGAACCGTCC 
AAAAAGGATA 
CCTCTTCTCT 
AAAAGGTTTA 
TCATCTTTGC 
CCCCTCTTGC 
CAT'lTTTATA 
GACTTCGGAT 
CAAGGCGGTT 
CGTTGGCAAG 
TGGAATGCTT 
GACAATGTCT 
CTCAATATGG 
AAAGTCTCCG 
CGTCGGTGGG 



TGTTCGGGTT 
GTCTCTCTAC 
CCATTGATCA 
GAGACAAACA 
AATTGAATGA 
AAACTCGCCA 
CATCGGGCAA 
GGAATGCTGG 
GGCAGCAGAA 
CCACCGAGAG 
TCGATAATAG 
TTGTCAATGG 
CTGTTCCGAT 
AGCGATTCGG 
AAACGCATTA 



GATATTCGTG 
GGTTCAGCCA 
CCCCGTCCTG 
TCTTTCCGAT 
GGAACTCAAC 
CGCTGCCGGT 
TATCGGTCTT 
TGCCGACTTC 
TTTCGGCTTT 
ATTGAGTTTT 
AACGTTCTCG 
ATTTGGCAGC 
GCATGGATTC 
CTTTGCCGTC 
CTTTGCTTAT 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
92 4 



50 



55 



60 



65 



70 



75 



(2) IHFORHATIOW POR SEQ ID NO: 232 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 base pairs 
(BJ TYPE; nucleic acid 
(C) STRAIIDEDtiESS: double 
tU) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DtJA (qenomic) 

(Hi) HYPO^VHETICAL: HO 

(Iv) AJITT-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAN I SI i: PORYPHYRa-IONAS CTNGIVALIS 

(ix) FEATURE: 

(A) HAHE/KEY: misc_f eature 

(B) LOCATIO^J 1. . .945 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232 

ATGAAAACGA ATAGACGATA CGCATTTGTT TTGCCGCTTC TGCTACTCAC CGGATTGTTG 
GCATGGGGGC AGGATrcrTC CCACGGTAGC AATACAGCGT TTGCAACTGA TTCTTCGAGT 
AGAGAGTTGC CCACGGAGCA GTCCGCCTAC CGCATTCATT CTGCCTATAT GGTCGGTGGT 
GGCGGAAGCA TAACGCGCGA CACCTATrTG TCACCCCTTC GTTATGGACC ATGGACACTG 



60 
120 
180 

240 
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AATTTGTTGG GA'JAGAAGAC GTTCCCTCTC AAAGCCTCCG ATTCCCGTTG GATGATCCGT 300 

ACCGG*3CATG AGCTGGATTT TGCXTCTGATG GACAATCCGG CCAATAATGC TCATTTCTAT 360 

TCCCTGCTGT ATAACGGTTC CGCl'GCGGCT CTTTACCGCC TTGGCGCTAA GCATCTGCGA 420 

GCCG CGT GGA TGGACAATCT GCGCTTGGCA TTCGGCCCGG GCTTGGAAAT CGGGCTTGGA 4 80 

GGAATTTATA GTACACGCAA CGGCAATAAT CCTGCGACAT TGAAGCTCTA CACCAATGCC 54 0 

ATCGCCCAAG CCTCGATAGG ATACTACGTC CCCTCCGAAA CTTTTCCCCT GTATTTTCGG 600 

TTGCTCtCCC AGA TCAATCT CTTCGGTATA GCCTATGGAA ATGGTTTTGG TGAGAGCTAT 660 

TACCiACiAATT TTTTGCTCAA TAACGGCATT GCAGGCTCCC TGCATTTCAC TTATCCGGGC 720 

AACTTTTACTC GGTTCACGAC ACTCATAACG GCGGATATTC CCATTCGGAA CTTCTGTACG 7fi0 

CTTCGTGTCG GTTATCGCTA TTCCCATTTG GGCTCTTCGC TTAACGCATT GGATACTCGA 940 

ATCCACAQTC ATACGGCTTT TATCGGTTTC GTCACGGAGT TTTACCGATT CCGTGGGCGC 900 

AAAGCCA'i'GA ATACCGGT03 GAGAACCAGT CT'rrACTATC ATGAT 945 



(2) IHFORI4ATIOK FOR SEQ ID HO: 233 



(1) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 955 base pairs 
(Bl TYPE: nuclsic acid 

(C) STRAHDEDHESS: double 

(D) TOPOLOGi: circular 

(ii) MOLECULE TYPE: DHA (genomic) 
( 1 i 1 ) IIYPori'HETICAL: NO 



(ivl AIITI-SENSE: UO 



(vi) ORIGIIIAL SOURCE: 

(A» ORGANISIl: P0RYPHYROtlOt4AS GIMGIVALIS 

{ ix ) FEATURE : 

(A) HAME/KEY: niisc_f eature 

(B) LOCATIOIJ 1...855 

(xi) SEQUEMCE DESCRIPTION: SEQ ID NO; 233 



ATGGTAGTGA CGCTGCTCGT AATTGTCGGT ATTGTGGTAG TAGTGCGGTA CTCCTTGCGC 60 

GTCCACGTAC ATAAGACCGG CACTGTTGTA TCGGCGGCCA TATTTGGCTT CATCCTCCTT 120 

GGAAAGACCG TTCCATGCGA TACCCGTAAC TTCTTTTCCT CCGAAAGTGA TGAACCTGAG 180 

AGCCGTGTTG CTACCGAAAT AGCCCACCTG TGCGAAATAG GATTTCAGAT CCACGCTTCC 24 0 

TCTATCCACG TAGCCGTCCG AACCGATTTT GGACAGGCGG GCATCCACTG CCCAATGGCG 300 

ACCGATGCGT CCGCTACCGA GTTTGACCGA TCGGCGGAAT GTGCCGAACG AACCTCCGCT 36C 

CAAATCGACA CGGCCATAAG GAGCCAGTCC CAAATTATCC GTACGCATAT TGACACTTGC 420 

CCCAAAAGCT CCGGCACCAT TGGTGGAAGT ACCCACACCT CGCTGCACCT GAAGGTCTTC 480 

GATGGAAGAG CCGAAGTCGG GCATATTCAC CCAAAACACG GACTGAGATT CGGAGTCGTT 540 

GAGGGGTACT CCATTGGTAG TTATGTTGAT GCGATTGGCA TCGGTGCCAC GCACGCGAAA 600 

GCCGGAATAT CCGATACCCG TACCGGCATC GCTGGTGGCT ACCACGGAGG GAGTCAGCAT 660 

CAGCAGATAG GGQATGTCAC GACCATAATT GGACTTGGAA AGT'1*CGGCCT TGCGAACGTT 720 

GGTGTAAGCG ACAGGGGTTT TCGCCGTGGC GCGAGTAGCT ACGACCTGTA CGGTCTGGAG 780 

CtGCACATTG CTAAGACTAT CTATCTCGCT 6TTGGAGACG GGTGCTTGTC CCGTCAGGCA 840 

GAAAGGCAGG ACGGC 655 



{2) IMFORHATION FOR SEQ ID HO: 234 



(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 17 97 base pairs 
(E) TYPE: nucisic acid 
(C) STRAW DEC^IESS: double 
<D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DUA (genomic) 

(iii) HYPOTHETICAL: HO 



(Iv) AI4TI -SENSE: WO 

(vi) ORIGINAL SOURCE: 

(A> ORGANISM: PORYPIIYROHONAS GINGIVALIS 

(Ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION i...l7 9/ 
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(x1 ) SEQUENCE DESCRIPTIOM: SEQ ID NO:234 



10 



15 



20 



25 



30 



ATGGAAAACT 
GGTGGCGTCC 
AATACCATCA 
GTGGTTATCA 
AATCCCGAAC 
AAGGGCCAGC 
AACGAGGCTC 
GGTATGATCG 
CGCCCCATTC 
ATCAATCAAG 
GAACAACAGA 
GTCAAGAATA 
CATATCACTG 
CAGAAGATCA 
CTCAAACAGC 
AAGGTGAAGG 
GGCGTTGAGG 
CAGGACTTCC 
GAACGCAAAA 
ACTCGTTTCC 
CTTATTCGTrG 
ACGAAGAACA 
GTAATCGA3A 
AATCCTTGGG 
ATCGAAGT'jA 
ACTCCGAACC 
TTCAAGGTTA 
TTCGAAGATG 
AAAGCGGCTC 
GCCACTCTCG 



TAAAGAACAT 

ATGCTGCCGT 

AGGAAAAGGA 

ATGTAGGGTA 

TCAAAGTGGG 

TCGTCTTGTC 

TCGAAAAAGA 

TCGATGTATT 

GCGACrrACGA 

AATATAAGAA 

AGAAAGAAAT 

TTACTTCTTA 

ACCTTl'CATG 

ATGTCGTTAT 

TGATGCCTCA 

GTAAAGTTGT 

GTCTTATCCA 

TCCATGTAGG 

TGTCGCTCGG 

CTGTAGGCTC 

AGATCGAAGA 

TCAAACAOCC 

TCGAC7VAGGA 

ATGTATTCGA 

TGGACAAGGG 

ACATGGTGAA 

TTGAGTTCAA 

AGCAGAAAAT 

AGAAAGAAGC 

GAGACCTCGG 



TCAGCCCAGA 
GAGTCGTCAG 
AGTGGTAATG 
CAAATCGGAA 
AGACGAAGTG 
TCACCGCAAG 
CGAAATCGTA 
CGGTATCGAG 
TGCATTCGTT 
TGTAGTTGTT 
CATCGGCAAG 
CGGAGTATTT 
GGGTCGTGTG 
CCTCGACTTT 
TCCTTGGGAT 
GGTGATGGCA 
CGTAAGCGAA 
CGACGAAGTG 
TCTGAAGCAA 
TCGTCACCAT 
GGGCGTACAT 
CAGCGAGTTT 
AAACCGTCGT 
GACGGTATTC 
TGCTGTCGTT 
GGAAGATGGC 
TAAGGATGCC 
CGCTCAGCCr 
TGCTGCCGAA 
CGAGCTGGCC 



GAGGATTTCA 
GAGCAGGAAG 
GGTAGGGTAA 
GGTGTGGTAC 
GAAGTTTATA 
GGTCGTGCCG 
AAGGGCTATG 
GCTTTCCTCC 
GAGAAGACGA 
TCCCACAAGG 
CTCGAAAAAG 
ATCGACCTCG 
GCTCATCCGG 
GATGAAGATC 
GCTCTCGACA 
GATTACGGTG 
ATGTCATGGA 
GAAGCCGTGA 
CTCAAGCCGG 
GCTCGTGTTC 
GGCCTTATCC 
ACGGAAGTAG 
CTCAGCTTG'3 
ACl'GTAGGAT 
TCTCTGCCTT 
TCACAGGCTG 
AAGCGAATCA 
GAAGCCAATG 
GCTGCCAATC 
GCTTTGAAAG 



ACT6GGAAGA 
CTGCrrTATGA 
CTGCTATCA'V 
CTGCAACAGA 
TCGAGAATCA 
CTCGCTCTTG 
TGAAGTGTCG 
CGGGATCACA 
TGGAGTTCAA 
TGCTCATCGA 
GGCAGGTACT 
GTGGAGTGGA 
AAGAAATCGT 
GCAAGCGTAT 
GCGAGCTTAA 
CTITCGTTGA 
CACAGCACTT 
TCCTGACGCT 
ATCCTTGGGC 
GCAACTTCAC 
ATAITTCCGA 
GTGCTGATAT 
GTCAC7WACA 
CTATCCACGA 
ACX3GTGTGGA 
TACTCGAAGA 
TTGTATCrrcA 
CAGAGCGTAA 
CTGCACAGGC 
AAAAGCTTTC 



GTTTGAGGCC 
CAAAACGCrC 
CAAGCGTGAA 
ATTCCGCTAC 
GGAAGATAAG 
GGAGCGCGTG 
TACCAAGGGT 
GATCGACGTG 
GATTGTGAAA 
AGCAGAGCTC 
CGAAGGTATC 
TGGTCTTATC 
ACAGCTGGAT 
CGCTCTCGGA 
GGTAGGCGAT 
GATTGCACAG 
GCGl'TCTGCT 
CGACCGCGAA 
TGATATCGAA 
CAATTTCGGT 
CCTTTCTTGG 
CGAAGTTCAG 
GTTGGAAGAG 
AGGAACGGTA 
AGGTTTTGCC 
GAAGTTACCT 
TAGCXTGTGTA 
GGCTGAAGCC 
TGTAGAGAAA 
AGAAAAC 



60 
120 

leo 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1797 



35 (2) IHFORMATIOU FOR SEQ ID NO: 23 5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1650 base pairs 
IB) TYPE: nucleic acid 
40 ■ (C) STRAW DEDNESS: double 

(DJ TOPOLOGY: circular 

til) MOLECULE TYPE: DNA (genomic) 

45 (iii) HYPOTHETICAL; NO 



50 



55 



60 



65 



70 



75 



(Iv) ANTI -SENSE: NO 

(vi) ORIGIHAL SOURCE; 

(A) ORGAi>JISfl: PORYPHYROMONAS GINGIVTVLIS 

(iX) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1...16S0 



(xi) SEQUENCE DESCRIPTION: SEQ ID WO: 235 



ATGGGTAGGG 
GAAGGTGTGG 
GTGGAAGTTT 
AAGGGTCGTG 
GTAAAGCGCT 
GAGGCTTTCC 
GTTGAGAAGA 
GTTTCCCACA 
AAGCTCGAAA 
TTTATCGACC 
GTGGCTCATC 
TTTGATGAAG 
GATGCTCTCG 
GCAGATTACG 
GAAATGTCAT 
GTGGAAGCCG 
CAACTCAAGC 
CATGCTCGTG 



TAACTGCTAT 
TACCTGCAAC 
ATATCGAGAA 
CCGCTCGCTC 
A'l'GTGAAGTG 
TCCCGCGATC 
CGATGGAGTT 
AGGTGCTCAT 
AAGGGCAGGT 
TCGGTGGAGT 
CGGAAGAAAT 
ATCGCAAGCG 
ACAGCGAGCT 
GTGCTTTCGT 
GGACACAGCA 
TGATCCTGAC 
CGGATCCTTG 
TTCGCAACTT 



CAACAAGCGT 
AGAATTCCGC 
TCAGGAAGAT 
TTGGGAGCGC 
TCGTACCAAG 
ACAGATCGAC 
CAAGATTGTG 
CGAAGCAGAG 
ACrCGAAGGT 
GGATGGTCTT 
CGTACAGCTG 
TATCGCTCTC 
TAAGGTAGGC 
TGACATTGCA 
CTTGCCTTCT 
GCTCGACCGC 
GGCTGATATC 
CACCAATTTC 



GAAGTGGTTA 
TACAATCCCG 
AAGAAGGGCC 
GTGAACGAGG 
GGTGGTATGA 
GTGCGCCCCA 
AAAATCAATC 
CTCGAACAAC 
ATCGTCAAGA 
ATCGATATCA 
GATCAGAAGA 
GGACTCAAAC 
GATAAGGTGA 
CAGGGCGTTG 
GCTCAGGACT 
GAAGAACGCA 
GAAACTCGTT 
GGTGTATTCC 



TCAATGTAGG 
AACTCAAAGT 
AGCTCGTCTT 
CTCTCGAAAJ^ 
TCGTCGATGT 
TTCGCGACTA 
AAGAATATAA 
AGAAGAAAGA 
ATATTACTTC 
CTGACCTTTC 
TCAATGTCGT 
AGCTGATGCC 
AGGGTAAAGT 
AGGGTCTTAT 
TCCTGCATGT 
AAATGTCGCr 
TCCCTGTAGG 
TTGAGATCGA 



GTACAAATCG 
GGGAGACGAA 
GTCTCACCGC 
AGACGAAATC 
ATTCGGTATC 
CGATGCATTC 
GAATGTAGTT 
AATCATCGGC 
TTACGGAGTA 
ATGGG6TCGT 
TATCCTCGAC 
TCATCCTTGG 
TGTGGTGATG 
CCACGTAAGC 
AGGCGACGAA 
CGGTCTGAAG 
CTCTCGTCAC 
AGAGGGCGTA 



60 
120 
180 
240 
300 
360 
420 
480 
b40 
600 
660 
720 
7 80 
840 
900 
960 
1020 
1080 
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GATGGCCTTA TCCATATTTC CGACCTTTCT TGGACGAAGA AGATCAAACA CCCCAGCGAG 114 0 

TTTACGGAAG TAGGTGCTGA TATCGAAGTT CAGGTAATCG AGATCGACAA GGAAAACCGT 1200 

CGTCTCAGCT TGOGTCACAA ACAGTTGGAA GAGAATCCTT GGGATGTATT CGAGACGGTA 1260 

TTCACTGTAG QATCTATCCA CGAAGGAACG GTAATCGAAG TGATGGACAA GGGTGCTGTC 1320 

5 GTTTCTCTGC CTTACGGTGT GGAAGGTTTT GCCACTCCGA AGCACATGGT GAAGGAAGAT 1390 

GGCTCACAGG CTGTACTCGA AGAGAAGTTA CCTTTCAAGG TTATTGAGIT CTKATAAGGAT 1440 

GCCAAGCGAA TCATTGTATC TC7VTAGCCGT GTATTCGAAG ATGAGCAGAA AATGGCTCAG 1500 

CGTGAAGCCA ATGCAGAGCG TAAGGCTGAA GCCAAAGCGG CTCAGAAA6A AGCTGCTGCC 1560 

GAAGCTGCCA ATCCTGCACA GGCTGTAGAG AAAGCCACi'C TCGGAGACCT CGGCGAGCTG 1620 

10 GCCGCTTTGA AAGAAAAGCT TTCAGAAAAC 1650 



(2) IMFX5RHATION FOR SEQ ID NO: 236 

15 (i) SEOUEHCE CHARACTERISTICS: 

(A» LENGTH: 137 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: circular 

20 

(HI MOLECULE TYPE: DNA (genomic) 

(iiil HYPOTHETICAL: NO 

25 Uv) AJITI-SEHSE: HO 

(vi) ORIGINAL SOURCE: 

(AJ ORGANISM: PORYPHYRa-fOMAS GIHGTVALIS 

30 (Ix) FEATURE: 

(Al NAME/ KEY: mlsc_feature 
(Bl LOCATION 1...1374 



35 



60 



70 



txi) SEQUENCE DESCRIPTION: SEQ ID NO: 236 



ATGATCGTCG ATGTATTCGG TATCGAGGCT TTCCTCCCGG GATCACAGAT CGACGTGCGC 60 

CCCATTCGCG ACTACGATGC ATTCGTTGAG AAGACGATGG AGTTCAAGAT TGTGAAAATC 120 

AATCAAGAAT ATAAGAATGT AGTTGTTTCC CACAAGGTGC TCATCGAAGC AGAGCTCGAA 180 

CAACAGAAGA AAGAAATCAT CGGCAAGCTC GAAAAAGGGC AGGTACTCGA AGGTATCGTC 24 0 

40 AAGAATATTA CTTCTTACGG AGTATTTATC GACCTCGGTG GAGTGGATGG TCTTATCCAT 300 

ATCACTGACC TTTCATGGGG TCGTGTGGCT CATCCGGAAG AAATCGTACA GCTGGATCAG 360 

AAGATCAATG TCGTTATCCT CGACTTTGAT GAAGATCGCA AGCGTATCGC TCTCGGACTC 4 20 

AAACAGCTGA TGCCTCATCC TTGGGATGCT CTCGACAGCG AGCTTAAGGT AGGCGATAAG 4 80 

GTGAAGGGTA AAGTTGTGGT GATGGCAGAT TACGGTGCTT TCGTTGAGAT TGCACAGGGC 54 0 

45 GTTGAGGGTC TTATCCACGT AAGCGAAATG TCATGGACAC AGCACTTGCG TTCTGCTCAG 600 

GACTTCCTGC ATGTAGGCGA CGAAGTGGAA GCCGTGATCC TGACGCTCGA CCGCGAAGAA 660 

CGCAAAATGT CGCTCGGTCT GAAGCAACTC AAGCCGGATC CTTGGGCTGA TATCGAAACT 720 

CGTTTCCCTG TAGGCTCTCG TCACCATGCT CGTGTTCGCA ACTTCACCAA TTTCGGTGTA 7 80 

TTCGTTGA6A TCGAAGAGGG CGTAGATGGC CTTATCCATA TTTCCGACCT TTCTTGGACG 84 0 

50 AAGAAGATCA AACACCCCAG CGAGTl'TACG GAAGTAGGTG CTGATATCGA AGTTCAGGTA 900 

ATCGAGATCG ACAAGGAAAA CCGTCGTCTC AGCTTGGGTC ACAAACAGTT GGAAGAGAAT 960 

CCTTGGGATG TATTCGAGAC GGTATTCACT GTAGGATCTA TCCACGAAGG AACGGTAATC 1020 

GAAGTGATGG ACAAGGGTGC TGTCGTTTCT CTGCCTTACG GTGTGGAAGG TTTTGCCACT 1080 

CCGAAGCACA TGGTGAAGGA AGATGGCTCA CAGGCTGTAC TCGAAGAGAA GTTACCTTTC 114 0 

55 AAGGTTATTG AGTTCAATAA GGATGCCAAG CGAATCAITG TAl-CfCATAG CCGTGTATTC 1200 

GAAGATGAGC AGAAAATGGC TCAGCGTGAA GCCAATGCAG AGCGTAAGGC TGAAGCCAAA 1260 

GCGGCTCAGA AAGAAGCTGC TGCCGAAGCT GCCAATCCTG CACAGGCTGT AGAGAAAGCC 1320 

ACTCTCGGAG ACCTCGGCGA GCTGGCCGCT TTGAAAGAAA AGCTTTCAGA AAAC 1374 



(2) IHEX>RMATIOH FOR SEQ ID NO: 2 37 



(1) SEQUENCE CHARACTERISTICS: 

(A» LENGTH: 1278 base pairs 
65 (B) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: DNA {genomic) 

(ill) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 

75 (vi) ORIGIIJAL SOURCE: 
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(A) ORGAHISII: PORYPHYROMOIIAS GTI-TGIVALIS 

tix) FEATURE: 

(A) (WHB/KEY: ml3f:_f eatur- 

(B) LOCATION 1...1278 

(xi) SEQUEIMCE DESCRIPTION: SEQ ID HO: 237 

ATGGAGTTCA AGATTGTtiAA AATCAATCAA GAATATAAGA ATGTAGTTGT TTCCCACAAG €0 -ri 

GTGCTCATCG AAGCAGAGCT CGAACAACAG AAGAAAGAAA TCATCGGCAA GCTCGAAAAA 120 

GGGCAGGTAC TCGAAGGTAT CGTCAAGAAT ATTACTTCTT ACGGAGTATT TATCGACCTC 180 

GGTGGAGTGG ATGGTCTTAT CCATATCACT GACCTTTCAT GGGGTCGTGT GGCTCATCCG 24 0 

GAAGAAATCG TACAGCTGGA TCAGAAGATC AATGTCGTTA TCCTCGACTT TGATGAAGAT 300 

CGCAAGCGTA TCGCTCTCGG AOrCAAACAG CTGATGCCTC ATCCTTGGGA TGCTCTCGAC 360 

AGCGAGCTTA AGGTAGGCGA TAAGGTGAAG GGTAAAGTTG TGGTGATGGC AGATTACGCT 420 

CCTTTCGTTG AGATTGCACA GGGCGTTCAG GGTCTTATCC ACGTAAGCGA AATGTCATGG 480 

ACACAGCACT TGCGTTCTGC TCAGGACTTC CTGCATGTAG GCGACGAAGT GGAAGCCGTG 540 

ATCCTGACGC TCGACCGCGA AGAACGCAAA ATGTCGCTCG GTCTGAAGCA ACTCAAGCCG 600 

GATCCTTGGG CTGATATCGA AACTCGTTTC CCTGTAGGCT CTCGTCACCA TGCTCGTGTT 660 

CGCAACTl'CA CCAATTTCGG TGTATTCGTT GAGATCGAAG AGGGCGTAGA TGGCCTTATC 720 

CATATTTCCG ACCITTCTTG GACGAAGAAG ATCAAACACC CCAGCGAGTT TACGGAAGTA 780 

GGTGCTGATA TCGAAGTTCA GGTAATCGAG ATCGACAAGG AAAACCGTCG TCTCAGCTTG 840 

GGTCACAAAC AGTTGGAAGA GAATCCTTGG GATGTATTCG AGACGGTATT CACTGTAGGA 900 

TCTATOCACG AAGGAACGGT AATCGAAGTG ATGOACAAGG GTGCTGTCGT TTCTCTGCCT 960 

TACGGTGTGG AAGGTTTTGC CACTCCGAAG CACATGGTGA AGGAAGATGG CTCACAGGCT 1020 

GTACrCGAAG AGAAGTTACC TTTCAAGGTT ATTCAGTTCA ATAAGGATGC CAAGCGAATC 1080 

ATTGTATCTC ATAGCCGTGT ATTCGAAGAT GAGCAGA7VAA TGGCTCAGCG TGAAGCCAAT 1140 

GCAGAGCGTA AGGCTGAAGC CAAAGCGGCT CAGAAAGAAG CTGCTGCCGA AGCTGCCAAT 1200 

CCTGCACAGG CTGTAGA3AA AGCCACTCTC GGAGACCTCG GCGAGCTGGC CGCTTTGAAA 1260 

GAAAAGCTTT CAGAAAAC 127 8 

12) IHFORl-IATIOtJ FOR SEQ ID NO: 238 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEWES3: double 

(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: DNA (genomic) 

(Ail) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(•/i) ORIGINAL SOURCE: 

(A) ORGA^•IJSH: PORYPHYKOIIONAS GINGIVALIS 

ax) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION 1-..720 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 238 

ATGAAAAAAG CTATTCTTTC CGGAGCGGCC TTGCTCCTCG GCCTATGTGC CAACGCACAA 60 

AACGTGCAGT TGCACTACGA TTTCGGTCAT TCCATCTACG ACGAACTAGA TGGACGTCCC 120 

AAACTGAd'A CCACAGTGGA AAACTTCACA CCCGACAAAT GGGGAAGCAC CTTCTTCTTC 180 

ATCGACATGG ATTACACGGG CAAGGGTATC CAGTCGGCCT ATTGGGAGAT TTCGCGCGAA 240 

CTGAAGTTTT GGCAAGCTCC CGTTTCCATT CATTTGGAGT ACAACGGAGG CCTCTCCACA 300 

AGCTTTACTT TCGGACACGA TGCTCTAATC GGTGCCACCT ACACCTACAA CAACCCCTCC 360 

TTTACACGTG GATTTACGAT CACGCCCATG TACAAGCATC TGGGTGCGCA CGACTTCCAC 420 

ACCTATCAGA TCACCGGCAC TTGGTACATG CACTTTCTGG ACGGTCTGCT TACCTTCAAC 4 80 

GGCTTCCTCG ATCTTTGGGG nTCCCCCAA GAGAACCCAA TCCGGGGCCC TGTGCTCAAA 54 0 

UAAGGGGATA AGTTCGTATT CCTGTCCGAA CCGCAGTTCT GGATCAACCT CAATCGCATC 600 

AAAGGCATCG ACAAGGATTT CAATCTCAGC ATAGGGACAG AGATGGAAAT CAGCAGGAAC 660 

TTCGCTCGCA TGGACAAATT CTCCTGCATC CCTACTCTTG CGGTCAAATG GACTTTCAAC 720 

(2) INFORI-IATION FOR SEQ ID NO: 239 

(1) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 1302 base palr^ 
(B) TYPE: nucleic acid 
(Ci STRANDEDNESS; double 
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10 



15 



(D) TOPOLOGY: circular 

Ui) MOLECULE TYPE: DMA (genomic) 

(ill) HYPOTHETICAL: NO 

( i v) AHTT -SENSE : HO 

(vi) ORIGINAL SOURCE; 

(A) ORGANIStI: PORYPHYROHONAS GINGIVALIS 

(ix) FEATURE: 

(A) rWlE/KEV: iaisc_f eature 

( B ) LOCATIOIJ 1 . . . 1 30;^ 

(Ki) SEQL'EWCE DESCRIPTION: SEQ ID HO: 239 



20 



25 



30 



35 



40 



ATGTATAAAG 
TTGCTATTTG 
ACTCGCTTCG 
GGACTCGGCG 
TCGGCTGTGG 
TATGCCGAGA 
CTTTTTCCTA 
GGGTACCAGT 
TACTTC'SGGA 
AACTTCTCAA 
GTAATCTTCT 
GCTGCCAAGT 
GTTATCCJGTG 
AATCAGGTTC 
GACTATTATA 
CTTCTCTTAG 
GATTGCAAAT 
AATGCCGTTG 
TACCTGAAAG 
TTCGGTATAG 
GACCGATl'GA 
CTCACGTTCA 



ACTATAAGGG 
CACiCAGACGG 
GCCTTGGCCG 
TCG-jTCTGCG 
ATTCGATGAC 
ACGGGAAAAA 
TTTCCAAATC 
TCGGATCCGT 
CAGGCAATCT 
TAGGAGCCAA 
CCACGGACCC 
TCGACTTCGG 
CCGTCTATTC 
AGAACGGTGT 
CCCTGCCTCA 
GAGCAGACGT 
TCCAGGACAG 
GGATGTGGCC 
TGCCGACTAA 
GAATCCCGCT 
TCCCGAAGGA 
ACiSAGTCAl'G 



TTTGTATGCG 
TATACAGGCT 
TCTCGGAGAA 

tcagggcaca 

GTTTATCTTC 

ggacaatagg 
cattgctatg 
tgatcaagtg 
gaacgatctc 
tgcttcatcc 

TCCl'TACAAT 

tatgcagtat 
tccgcgggtg 
agtagtggag 
tacattgggg 
ccaatatagt 
aatacgggta 
taaagttcgc 
aggcggtgta 
caatgacaga 
gggtatgatc 
gtttaaaaag 



tcgcttcggt 
cagaacaaca 

CGGACa\CTA 

tacgtcaatg 
gatttcggtg 
aaaatgccaa 
agtgcgggag 
gaaggaggca 

TATGTCG3TA 
CTTTTTGGGC 
CCCGTACATC 
CACCTTCTTC 
AAGATGCATA 
AGCGAAACCC 
ATAGGTTTTT 
AAATGGAAAG 
TCTCTCGGCG 
TATCGCTTCG 
TATCAAGGAT 

AAA<SAAAATG 
CTGAAACTGA 



GGTATGCCCT 
ACTTTACCGA 
TTACTTGGGCA 
CCGTCAATCC 
CATCTACCGG 
ACATTGAGTA 
TGCTTCCTTA 
GCGTCCAGTA 
TAGGTGCAAC 
GATTCACACA 
TCTCGACGCT 
TCAAATCAGA 
GCGAGCTGAC 
AAGAATATAT 
CTTATGAAAA 
GCGAGAAATT 
GAGAGATCAT 
GTTTACATGG 
ACCATATCGT 
TAAATGTCTC 
CTCTGAAATT 
AC 



GATCATTGGG 
GTCGCCTTAC 
TTCCATGGGA 
TGCTTCATAC 
AATTACGTCG 
TTTCGCCATG 
CTCCGCATCC 
CACCCGTAAA 
CCCGTTCAAA 
CAGCAGGCAG 
GTACTTGAAG 
TCGTTCGCTC 
TCAGATAAAG 
CAAGGGAATG 
GAAAGATAAA 
TTATAAATCC 
ACCGGATATA 
TGAAAATTCT 
AGGTGCTGTA 
TCTTGAATAT 
GACCTTCGGC 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1302 



45 



50 



55 



60 



65 



70 



75 



<2) lUrORMATIOH FOR SEQ ID NO: 24 0 

ii) SEQUEMCF. CHARACTERISTICS: 

(A) LENGTH: 2778 base pairs 

(B) TYPE: nucleic acid 

(C) STRAI4DED1IESS : doubls 

(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: D»A (genomic) 

(111) HYPOTHETICAL: HO 

(iv) ANTI-SEUSF.: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROt'lOI'IAS GIKGIVALIS 

(ix) FEATURE: 

(A) HAIfE/KEY: misc_£eature 

(B) LOCATION i...2778 

(xl) SEQUENCE DESCRIPTION: SEQ IDNO:240 



ATGCGATCGA 
Cl'GGAAGCCC 
ATTCAAGCCA 
ACTAATGAAA 
GTTTCATATG 
ATTACGCTCA 
CAGGCTAAAG 
TATACCGTAG 
ATCGGATCCG 
GGCAAAGAGT 
GTCAATAAAC 



TTTATCAATT 
AACAAGCCGG 
ACGTACAGCT 
AAGGGTTGTT 
TAGGTTACAC 
AAGATATATC 
CGGCAGAGGT 
CACAGGGAGC 
ATGGGAAGAT 
TTTTCTCCAA 
TACAGGTACT 



ACTGTTGTCA 
AGTAGCAGGT 
TGTACAGAGT 
CAGCCTGAAA 
TACCCACGAC 
CATGAACGAA 
CGTGGTACGC 
TTCTATCGAG 
CACCATCAAC 
AGATCCACAG 
GAACAAACTG 



ATACTCCTTG 
AGAGTATTGG 
ACCGGCCAAG 
ACGTCACAGG 
GAAAAAATAT 
GATGCCCGTC 
AACGATACGC 
GAACTGATCA 
GGCAAGGACA 
GTGGCAATAA 
AGCGAGCTGT 



CTTCTCTTGG 
ACGAAGAAGG 
TAGCCGTTGC 
AGGGTGACTA 
CTCTTAGAAA 
TTCTACAGAG 
TCGAATTCAA 
AGAAGCTACC 
TTAGCAAGAT 
AGAATCTTCC 
CGCGGATGAG 



TTTCGTCGGG 
CAACCCCATG 
CGCAGGTGCC 
CATTCTGCGC 
CGGGCAAACC 
TGTGACGGTG 
TGCCGGATCC 
CGGAGCAGAG 
CCTTGTCGAT 
GGCCGATATG 
CGGTTTCGAT 



60 
120 
IBO 
240 
300 
360 
420 
480 
54 0 
600 
660 



SUBS 1 11 U I E SHEET (Rule 26) (RCVAU) 
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GATGGAGAAG 
TTCGGAAC'3C 
AATCGGTTCG 
ATGGGCTTTA 
GGTGGTCGAC 
AACTTCAGTG 
AACGACAAGG 
TATATGGACG 
ATGCAATGGA 
AAGATCGATG 
AACAAAGGTT 
GATATCAGTC 
Cl'GACCGACG 
CAAAAGCAAT 
CCGTTGGGTA 
TCGGATCGTC 
TACGGACTCT 
ATTGCCAAAA 
TATCGGAGCG 
CCGATGCTCC 
GGACGCACGA 
CTATTCtrrGA 
ATGTTCTCGG 
TATACATTCG 
ACTCGTTACG 
CCACTCAAGA 
CAAAGCTTCA 
ACGCTGACCT 
ATGGCGAATA 
TATCAAGTTG 
ACGAACTCCG 
TCATACAGCT 
GGTCAGCGGT 
AATACGATCG 
. GGTGGATCTC 
GGTGGCAGAC 



AGGAGACCGT 

TTCAGGCCGG 

ATGGAAATAA 

GCGAGATGGA 

GCGGCTTCGG 

TCGAATTCl'C 

CCATAGAGAC 

AAAATATATT 

AACCGTCCGA 

GGTTCTTTAA 

CTATCCACCA 

ACAAGCTCAA 

AAGACGGAGA 

TCAACGACAA 

AAAACl'ACTT 

AGGTGTACCG 

CCTACAGTAA 

CGTGGGACTA 

TAGCCGGAGT 

GAATCAACTA 

CACAACCATC 

CGGAAGGCAA 

ACTTCGATGC 

ACGACATCGT 

AAAACGCCTC 

ACAGGGCATT 

TCAATGACGA 

ATCGCAACAA 

ATAGTCTGAG 

CCCTAACGCT 

GTTACAGCGG 

TCCTCCGTGA 

CAAGTATCAG 

GACGCTACX3T 

GCAGCGATCA 

GACCGTCC 



AATCAACCTG 

CTACGGTACC 

GCAATGGACA 

CAGCGAGATG 

CAATAGTGGA 

CTCTGCCCTT 

GACCAAACGC 

GGAACGCTCT 

ACGTACCGAA 

CGACACATAC 

AACTACACAA 

CGACGAAGGC 

TGGCATATAT 

CTCCAACCTG 

CGCACAAGCG 

ACTGGGCGAT 

CGAGTl'CACC 

CACCGTAGGA 

AGAGCAGGAC 

CAAACCGAGC 

CATCAATCAG 

TCCCGGTCTG 

CAAAAGTCAG 

CCCCAATACG 

CGC^TACGTGG 

TTCTTTCAGC 

TAAGAACAAA 

TTGGATCGAT 

CGGACAGAAA 

TCCCTATGGA 

AGGATTCAGT 

CAAGGCCGGT 

CCGTTCTGCT 

GATGGTGGAC 

TCAGCGTGGC 



ACGGTGAAGC 
GACCAACGCT 
TTGATCGGTA 
GGATCCATGA 
GGTGTTACGT 
AATACAGGAG 
GTGGAAAATA 
Ta-CTCTCACA 
GTGGTATTCG 
GAGACGAAAG 
GGAAACAACT 
CGTACGATCA 
CAGGCTGTGC 
CAATATCGGC 
ATTCTGAACA 
GACGGGCAAT 
CAGTATCGCA 
TTCAATGTGG 
AAACTGCCTT 
AGGACTACCA 

atccjctcccg 
aagcx:gagct 
cx;agctttca 
cactacgatc 
caagcgaatc 

ATGTCCTTGT 
GCTCTCTCTT 
ACGAGTATCG 
GATTCTCGCA 
TTCCCTA7CG 
CTGGACGAAT 
ACACTGCGTG 
TCGGCCATCA 
TTTATCTACC 
AATATGAATC 



CCGAAAAAAA 
ATATGGCCGG 
GTGCGAACAA 
CCTTCTTCTC 

cttcx;tcgat 
gcgatgcacg 
tcctcgccga 
atggtcaggc 
agccggatct 
atgccaccgg 
tcagactgaa 

GTGCCTCCGT 
TCCAAAGCGT 
TTCGCCTCTC 
GACGTTTCTC 
ACTCCATATT 
TCGGACTCAA 
ATCCCAACAG 
TCAATCGTGT 
ACCTCCGAGT 
TTCAGGACAT 
ATTCCAACAA 
ACATTGTTTT 
CGTCTACAGG 
TTCATGGGAC 
TCAACAGGTT 
TCCGAACGAG 
GTGGCAATAT 
CATACGATTT 
ACAGCGATGT 
GGCTTTGGAA 
TCAATGGCTA 
Al'ATAGAAGA 
GATTCAACGC 
GTCCGGGCCC 



GAAAGGCCTC 
AGGGAACGTC 
TACGAACAAT 
TCCCCAAGGC 
GCTGGGCGGC 
CTACGGATAC 
AGGGAATACT 
GCGATTTAGG 
TTCGATATCC 
AATCrCTATC 
CGGAGAATTG 
CAGTGGCGGT 
GGAGAC6AAT 
GTATGTGGAA 
CCGTCGCAAT 
AGACAGTCAG 
CCTCAAGAAG 
AACTGTCAGC 
CAATCTCTCC 
GGACTACCGA 
CACGAATCCG 
TGTGATGGCC 
CTTCGGCAAC 
GATCCGTACC 
ACTATCGCTT 
GGCCGAAGGA 
GGAACGCCTG 
CGGATTCTAT 
TGGCGGCAAT 
TGAATACAAT 
TGCTTCGCTT 
TGACATCCTC 
GAGCATGTCC 
CTTCAGTGGT 
ACCTTTCGGC 



770 
780 
B-10 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2S00 
2640 
2700 
27 60 
2778 
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{2J IHFORJIATION FOR SEQ ID NO:241 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1200 base pairs 

(B) TYPE: nucleic acid 

(C) STRAIN DEDHESS: double 

(D) TOPOLOGY J circular 

(ii) MOLECULE TYPE: DUA (genomic) 

UH) HYPOTHETICAL: WO 

tiv) AllTI-SEMSE: NO 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: PORVPHYRa>IONAS GINGP/ALIS 

(ix) FEATURE: 

(A) NAHE/KEY: itd.sc_f ©ature 
<B) LOCATION 1...1200 

Ul) SEQUENCE DESCRIPTION: SEQ ID HO: 241 



ATGGTTATGA 
CTTCCTGCGT 
GAAATACGTA 
GATACGGAAT 
CAGTCCTACG 
GTTCCCGOTC 
TCAGGTGCCG 
TACTATGATG 
CCTTATACGT 
TGGITCTACT 
TGGAGCTACr 
CCTTATGGCT 
CACCACTACC 
GGCCGACCGA 



AGCTGATTAA 
ACTCGCAGAA 
AACAAAACCA 
ATACAGTGGC 
ATGGGAAAAA 
GCTATAGTCG 
ACAATGTATA 
ACGCGTCGTC 
CATGGTATCC 
ACGGTAGCCA 
ACTATGATCC 
GGGGCAGCTA 
CCAAGAAGAC 
ATCGTATCAA 



AAGAAGTTTG 
TGATGACATC 
AGTTAAAGAC 
TTCCGATCGG 
GTTGTCCAAA 
CCGCTTGCCT 
TGTAACTGAT 
TGTAAACATT 
ATCTTTCTCC 
TATCCGATGG 
TTTCTACAAT 
TTACGGTTGG 
CTATTCCAAT 
AGGTGGAACG 



CTCCTGCTTG 
TTCGAAGATG 
TGGCAAAACC 
GACATTGACG 
GACAAGAAAA 
CGATTCTATA 
GATGGTGAGT 
TACATCAACA 
GGCTGGTACA 
GGCGGTTATT 
CCCTATGGAA 
GGTGGCTATC 
GGTCAGCATT 
TCCGGTGCCA 



GAGCGGTACT 
ACATCTATAC 
AAGAGGACGG 
CCTACAATCG 
GAGACTCCAC 
AGCCGAATAC 
ATTTOGTCTA 
GTCCTTGGTG 
ACTATACGTG 
ACCCCGGATA 
TCGGTATGGG 
CGGGAGTGAT 
CCGGAGCTTA 
AACTTGGGAC 



GCTGATTACG 
ATCGCGAAAA 
ATACGGCGAC 
TAGAGATGGC 
TCGTTCTTCT 
GATCGTCATT 
TGGAGACGAA 
CGATCCGTTC 
GAACTATCCA 
TAATTGGTAT 
TTGGGGATAT 
ACATCACTAC 
CTATTCTTAT 
AGGACGCTAC 



60 
120 
180 
2-10 
300 
360 
420 
480 
540 
600 
660 
720 
780 
8-10 
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GATAGAATTC 
AATAATAATC 
ATAGAAACGG 
CAGTCCGGCA 
AACGATAGGA 
TCTCGCTCTT 



AAAATTCGTC 
TGCAAAATGT 
TAACTCCAAA 
ATGACCGACC 
CATTTTCGAC 
CTTCCGGCTC 



TTCGCAAAAA 
CAAGTCGGGA 
CAACGGGCAA 
GACCGGACGG 
TCCTTCTCGT 
TATGAGCGGA 



AATAAGTTC'3 
CGTACCGGCC 
AAGCAGAATC 
AATATCCGCA 
AGCAATAGTA 
GGTGGCX3GAC 



GATTGCACTC 
GAGCCAATAG 
GTCCCGTATT 
GCGAGAGACA 
ACGGTGGCTT 
GTAGTGGCCG 



GAACAAACCC 
AGACCGAAAT 
CCAGCAGAAT 
CCCGGAAAAT 
CTCCACGCCT 
GGGACGCAAT 



900 
960 
1020 
1O80 
1140 
1200 



10 



15 



20 



25 



30 



35 



40 
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50 



(2) INFOWtATia? FOR SEQ ID IIO:242 

(11 SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1194 base pairs 
<B) TYPE: nucl«ic acid 
iC) STRAW DEDHESS: double 
(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic I 

(iii) HYPOTHETICAL: NO 

tiv) AJITI -SENSE: HO 

(vi) ORIGINAL SOURCE: 

(A) ORGAl'flSIt: PORYPHYRaiONAS GIMGTVALIS 

{±K) t-EATURE; 

tA> NAT IE/KEY: misc_f eature 

(B) LOCATION 1 . . .1194 

(xi) SEQUENCE DESCRIPTION: SEQ ID WO:242 



ATGAAGCTGA 
GCGTACTCGC 
CGTAAACAAA 
GAATATACAG 
TACGATGGGA 
GGTCGCTATA 
GCCGACAATG 
GATGACGCGT 
ACGTCATGGT 
TACTACGGTA 
TACTACTATG 
GGCTGGGGCA 
TACCCCAAGA 
CCGAATCGTA 
ATTCAAAATT 
AATCTGCAAA 
ACGGTAACTC 
GGCAATGACC 
AGGACATTTT 
TCTTCTTCCG 



TTAAAAGAAG 
AGAATGATGA 
ACCAAGTTAA 
TGGCITCCGA 
AAAAGTTGTC 
GTCGCCGCTT 
TATATGTAAC 
CCJTCTGTAAA 
ATCCATCTTT 
GCCATATCGG 
ATCCTTTCTA 
GCTATTACGG 
AGACCTATTC 
TCAAAGGTGG 
CGTCTTCGCA 
ATGTCAAGTC 
CAAACAACGG 
GACCGACCGG 
CGACTCCTTC 
GCTCTATGAG 



Tl'TGCTCCTG 
CATCTTCGAA 
AGACTGGCAA 
TCGGGACATT 
CAAAGACAAG 
GGCTCGATTC 
TGATGATGGT 
CATTTACATC 
CTCCGGCTGG 
ATGGGGCGGT 
CAATCCCTAT 
TTGGGGTGGC 
CAATGGTCAG 
AACGTCCGGT 
AAAAAATAAG 
GGGACGTACC 
GCAAAAGCAG 
ACGGAATATC 
TCGTAGCAAT 
CGGAGGTGGC 



CTTGGAGCGG 
GATGACATCT 
AACCAAGAGG 
GACGCCTACA 
AAAAGAGACrr 
TATAAGCCGA 
GAGl'ATTTCG 
AACAGTCCTT 
TACAACTATA 
TATTACCCCG 
GGAATCGGTA 
TATCCGGGAG 
CATTCCGGAG 
GCCAAACTTG 
TTCGGATTGC 
GGCCGAGCCA 
AATCGTCCCG 
CGCAOCCAGA 
AGTAACGCTG 
GGACGTAGTG 



TACTGCTGAT 
ATACATCGCC 
ACGGATACGG 
ATCGTAGAGA 
CCACTCGTl'C 
ATACGATCGT 
TCTATGGAGA 
GGTGCGATCC 
CGTGGAACTA 
GATATAATTG 
TGGGTTGGGG 
TGATACATCA 
CTTACTATTC 
GGACAGGACG 
AtrrCGAACAA 
ATAGAGACCG 
TATTCCAGCA 
GACAGGGGGA 
GCTTCTCCAC 
GCCGGGGACG 



TACGCTTCCT 
AAAAGAAATA 
CGACGATACG 
TGGCCAGTCC 
TTCTGTTCCC 
CATTTCAGGT 
CGAATACTAT 
GTTCCCTTAT 
TCCATGGTTC 
GTATTGGAGC 
ATATCCTTAT 
CTACCACCAC 
TTATGGCCGA 
CTACGATAGA 
ACCCAATAAT 
AAATATAGA^k 
GAATCAGTCC 
AAATAACGAT 
GCCTTCTCGC 
CAAT 



€0 
120 
180 
240 
300 
3G0 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1194 



55 



60 



65 



(2) INFORMATION FOR SEQ ID HO: 243 

(i) SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 174 3 base paire 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 
(D TOPOLOGY: circular 

(ii) MOLECl'LE TYPE: Dtih (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AIITI -SENSE: NO 



70 



75 



(vi) ORIGINAL SOURCE: 

(A) ORGAN ISM: PORYPHYRaiONAS GINGIVAL IS 

{ix} FEATURE: 

(Al MAME/KEY: mlsc_£-ature 

(B) LOCATION 1.-.174 3 

(xi) SEQUF.ircE DESCRIPTION SEQ ID NO: 243 



SUBSimniE SHEET (Rule 26) (RO/AU) 
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WO 99/29870 PCT/AU98A)1023 

166/490 



ATGATCCGCr. AGTTGATCTT ACTGefTGCT CTGATGCCGG TAGCCTCTGT GGCTTTTGCG 
OTACCAACGG ACAGCACGGA ATCGAAAGAC AATCGTATCC TTACAAGCAT GCAATCCTCC 
TCTTTGAATA GGGATGATGC TCCGGATAAA TGGCAACCTA TGCATCCCAA TTTCAGTATT 
S^S!^*^^'^ TGCTGCTTTC TACTGCCCAA AA6TCCAAGA ACACCTGGTT CGGCAACTC- 
GTATAATCAA GAACAATTAT CTGGAGrrTG GTGCCCGTTT CGAGGATCTC 
TATAAGCCCC TGCCCGGACA TGAACCCGAG ATGGGGCGTG GCGTTCCTCA CATGTATGTG 
AAGGGAAGCr ATCATTGGGC GGAGCTGACT ATGGGAGACT TCTACGATCA GTTCGGTAGC 
in ^^Z^I^^ TCCGCACCTA TGAAGAGCGC AACCTCGGTA TAGACAACGC GGTTCGCGGC 
^^f^l^ TACTCACTCC TTTTGATGGA GTGCGTGTCA AGGGTATTGC AGGACAGCAG 
SSIif TCGACCGCAC GGGCAAGGTA TTCAAITCCG GCCGAGGCTA CCTACTGGGT 
l^^Z^l^ AGCTGAATGT AGAGCGTTGG AGCAGTGCCA TGCGCGACAA TGACTATCAT 
GGGGATCGTT CGTTTCCAAA CACGAAGCAG ACGAAGATAT ATTTGTGGGT 
IS ATCGCAAGCG ACTCAACCTG CCGCTCAATG TCCCGATTAT GGGCCTGCGC 

^SJ^JIS '^^^^^^GG TCTCGCCCTC TACGCAGAGl" ATGGATACAA ATACAACGAT 
CCCTCGGCAG ACAATGACTA TATCTACCAC GACGGACAGG CTGCACTCCT CTCTGCCTCA 
TACTCCAAAA AAGGGATGAG TATCCTGTTG CAGGCCAAAC CTTGTGAGAA CTTTGCTTTC 
V GAAGTGCCCA GCTCACACCG CTTATGATCA ACTATATGCC GGCTTTTACC 

7n CTTATACGCT GGCGGCCATC TACCCCTATG CTACTCAGCC TCAGGGAGAA 

- AAGGTGAACT GCGTTACAAC TTTGCTCGCC GGACAGCTCT CGGTGGACGC 

' ' I^^S^J^?^*^ GCTTGCGTAT CAACGTTTCG CATGTGCGTG GTCTGGACAA AAAGATGCTC 
SS^^IS ^SS'^^*^^^ GATCGGAACG GATGGCTACA CCGTTTCTTT CTTCGGCATG 
GGCGACCTCT ATTATTCGGA TATAGATGTG GAGATTACTA AAAAGGTAAG CCCAGGATTC 
7^ TCACCTACTT GAATCAGATC TACAATAACA AGGTACTGCA CGGTGCAGCC 

^^^^ CTGAGAAGAT CTATGCCAAT ATCTTCGTCT ATGATGGTAA GTATAAGCl'G 
TAGCCCTCCG TACCGAACTG CAATATTTGC ACACGAAGCA GGATCA^GGT 
GACTGGATCT ACGGCATGGC CGAGCTCTCT ATCCTGCCl'T CTCTGATGCT TTCCCTCTCG 
GAGCAGTATA ATATCGGAGA GACCAAGAAA CATTATGTCA TGGGGTCTGT CACrTATACT 
30 SaSJJ^S^S ^^5^51?'^'' ITTCTCTGCA GGCAAAACCC GTGCAGGGAT GAACTGCTCG 1680 
GTCGTGTGGT CCCTGAGACT CAGGGATTCT ACCTTTCrTA TAGCACCAAT 11 AO 

1743 



(2) IHFOW>IATION FOR SEQ ID NO: 24 4 



(i) SEQUENCfc: CHARACTERISTICS: 

<A) LENGTH: 717 base pairs 
(Bl TYPE: nucleic acid 
(C) STRANDEDNESS: double 
^ ID) TOPOLOGY: circular 

111) MOLECULE TYPE: DHA (genomic) 

(iil) HYPOTHETICAL: NO 

(iv) AHTI -SENSE: NO 



(vi) ORIGIWAJ. SOURCE: 

(A) ORGAhJIST^: PORYPHYROMOHAS GIHGIVALIS 

iXx) FEATURE: 

(A) NAI'IE/KEY: misc feature 

(B) LOCATION 1. . .717 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 4 

aI^TI^^ ^J^^^™""*" CGCGTTGCGC AGCTCCTCTC TCCATGCTTC AGAGCG.^CGC 
iiSJS^S^ GTTCTTCTGT AGTCATGTCA ATAAGGCAGA AAATAAGGTT ATTCCATCTC 
Rn I^™^^^^ CCCAAACGCA TGATCATCTC ATCGAAATCC ACTTGGTGTG CATCGAATTC 

^S^?'^^ atttcgtctg tcctcccacg cttataogac aagccccJca 
^l^^^^^ ccatccacca taattgtatt gagagaagct atggtcggta tctcgtaacg 
JI^^^*^ agagaaacga acttcatcat cacagcccgc ccgatcgtaa cgcagaggtc 

cgtttgataa cgcittccac tccatccgtt acgaggcctt tcgtcccata 
65 Vz!!'^'^'^'^ tgatcacttc atcgctattg gctcgcattt gttcttcaag 

^IJ?^^? tctttagttc tggcagccaa tacgacaatt acacggttgc ctgctttgtg 

t^^^^^S"^ ACGATCGGGA GCAAAGGAGC CACACCCACA CCGCCTCCGG CACAAACCAC 
l?^^?^ TTTTCGATAT GCGTACTCTG TCCCAGCGGA CCTACCACAT CCGTGATATA 
GTCGCCGACT TCGAGTTCGG CCAATTTCTT GGAAGATTTG CCCACGGCCT GAACCAC 



(2) IHFORJlATlOtJ FOR SEQ ID MO: 24 5 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 633 base pairs 

(B) TYPE: nucleic acid 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(C) STRAf^DCONCSS: double 

(D) TOPOLOGY: circular 

(il) MOLECULE TYPE: ttrA (genomic) 

ail» HYPOTHETICAL: HO 

(iv) AHTI- SENSE: HO 

(Vi) ORIGIIJAL SOURCE: 

fA) ORGAfJISH: PORY PHYROMOMAS GINGIVALIS 

(1x1 FEATURE: 

(A) NAME/ KEY: mis cofeature 

(B) LOCATIori 1...633 

(xil SEQUENCE DESCRIPTION: SEQ ID NO: 245 



ATGTCAATAA 
CATCTCATCG 
CGTCTGTCCT 
TGTATTGAGA 
CATCATCACA 
TTCCACTCCA 
CACTTCATCG 
ACCCAATACG 
AGGAGCCACA 
ACTCTGTCCC 
TTTCTTGGAA 



GGCAGAAAAT 
AA7\TCCACTT 
CCCACGCTTA 
GAAGCTATGG 
GCCGGCCCGA 
TCCGTTACGA 
CTATTCGCTC 
ACAATTACAC 
CCCACACCGC 
AGCGGACCTA 
GATTTGCCCA 



AAGGTTATTC 
GGTGTGCATC 
TACGACAAGC 
TCGGTATCTC 
TCGTAACGCA 
GGCCTTTCGT 
GCATTTGTTC 
G(3TTGCCTGC 
CTCCGGCACA 
CCACATCCGT 
CGGCCTGAAC 



CATCTCTCGG 
GAATTCGGGG 
CCCACACATA 
GTAACGTTTG 
GAGGTCTACC 
CCCATAAGAC 
TTCAAGGATA 
TTTGTGGAAA 
AACCACTGTG 
GATATAGTCG 
CAC 



TATGCGCCCA 
CCATCGACAC 
CCGGTGCCAT 
GTCAGGAGAG 
GTTTCCCGTT 
CCATCGTCTG 
ACCAGATCTT 
GCCTCCACGA 
CCGACCTTTT 
CCGACTTCGA 



AACGCATGAT 
AGACGAATTT 
CCACCATAAT 
AAACGAACTT 
TGATAACGCT 
TCATGATGAT 
TAGTTCTGGC 
TCGGGAGCAA 
CGATATGCGT 
GTTCGGCCAA 



60 
120 
ISO 
240 
300 
360 
470 
480 
^4 0 
60C 
633 
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(2) IIIFORI1ATION FOR SF.Q ID lJO:246 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 base pairs 

(B) TYPE: nucl-sic acid 
{C) STRAf-JOEDNESS: double 
(D» TOPOLOGY: circular 

{lil MOTJ-.CULE TYPE: DHA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTX -SENSE: WO 

ivl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYROMOHAS GINGIVALIS 

(Ix) FEATURE: 

(A) r4AME/KEY: mlsc_£eature 

(B) LOCATION 1. . .2343 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 6 



55 



60 



65 



70 



75 



ATGGATCGTC 
GGCAGGCCTT 
AGTTCTTTTG 
AATAGCTGGA 
CACAAGATTC 
CAATTCTCCC 
CTGACTAAAT 
GCGGAGACGG 
GAGATGGCTC 
AGTGGCTCCG 
GTGGATCCTC 
TTCGCCTGTT 
GATACCGGCA 
GAAGTTACCG 
GCGGAAGTCA 
AAGGTGCTCG 
TCCGCACTCT 
AATTTTGTGC 
ACCGGATTTT 
GAAAACATTA 
GAATGGGCCA 



CTAAGCCTTC 
TGTTTGCGCA 
AGCTGAGTGC 
CACCGATGAC 
GCGGATTCAA 
TTCTTCCCCT 
GGTGCAACCA 
CGACGCCATA 
CGACCGAGCG 
GTCGATGGCT 
ACACCGTAGT 
ATTTCATCCT 
AGTCAGACGA 
TCCGGGTGGC 
AAGGGCAGAG 
GACGCATACA 
ATCGCTGTCT 
ATTACAGCCC 
GG-jACACTTT 
AAATTCAGGA 
GTCCGGGCCA 



ATATATTGTT 
GAGCTATGTG 
GGGCAATACC 
CGGTGTACCC 
ACAGACCCAC 
TACGGCACCG 
ACTCTTTTCG 
CTATTATAGT 
TGCAGCTATC 
TCGTCTTGAT 
GGGCATATCT 
GCAGTCCGAT 
AGGCACAAGG 
ATCTTCTTTT 
TTTCGACCGG 
TGTGGAAGGA 
GCTTTTTCCC 
CTACAATGGA 
TCGAGCCCTT 
AGGTCTGCTG 
TCGGGATTGT 



CGAATAGCAG 
GACTACGTCG 
TATCCGGTGA 
GGTGACGGCT 
CAACCCAGTC 
CAGAAGCCAT 
GACGAACAGA 
GTCTATTTGG 
TTTCGCATAC 
GCCTTTACCG 
CGCAAGAATA 
ACrCCTATGG 
GCATGGGCAG 
ATCAGTGTCG 
ATCAGACTTG 
GGAACGAAGG 
CGTCGCITCT 
GAGGTACTTC 
TTCCCCCTGC 
AATGTATATC 
ATGATAGGCA 



CCATTCTCTG 
ATCCGCTGAT 
TCGGTTTACC 
GGCAATATAC 
CTTGGATCAA 
CATCGAACGA 
CCTCGTGGTT 
CCGATTACGA 
GTTATTCCGG 
GTGGTTCGGA 
GCGGAGGTGT 
CCGATGTCCT 
CCTGTCGCTT 
AGCAGGCCGA 
CCGGTCGCGA 
ATGAGCGCAC 
ATGAGGAGGA 
CCGGTTATCT 
TCAATCTGCT 
GCGAGAGTGG 
ACAACTCTGC 



CTTGTTTGTC 
CGGGACGCTA 
GTGGGGAATG 
CTACTCGGCA 
CGACTACGGC 
CTCCATAGCT 
CrCGCACAAA 
CACACGCGTG 
CAATACCGAA 
GATTAGCATC 
GCCGGCTAAC 
GCTTGAGACA 
CGATTCGCAA 
AAGAAATCTT 
AGCTTGGAAT 
TACATTCTAT 
TGCTTCCGGC 
CTATACCGAT 
GTATCCCGAT 
CTTTTTCCCC 
TTCTGTTCTG 



60 
120 
180 
240 
300 
360 
420 
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900 
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GCGGATGCCT 
TTGCATGCTA 
TGGTACAACT 
ACGCTCGAAT 
GATAGAGCTG 
CCGGAAACCA 
CCTTTCAAAT 
TTTC7VTGATG 
CTCGATTCGG 
CACGAAATCA 
ATACAGCATA 
CTACGCGAAG 
GACAACGGAC 
CXrCGCTACCG 
CCCGACGGAC 
CGCTCGATCA 
CGCTCTTCTG 
AAGGAAAGTG 
AAT 



ACCTCAACGG 
CGAAAGCCGT 
CCTTAGGTTA 
ATGCTTATAA 
CATTGGACAC 
AACTCATGCG 
GGGGAGATGT 
TGCAGGGGCT 
TATTCAATAC 
GAGAGATGCA 
TGATATATCT 
TGATGGGGCG 
AGACTTCGGC 
ATCAGTATGT 
ACAAAACGGT 
GCGTAGAAGG 
CATCCATl'CA 
ACAGACCTTA 



TGTTCGGGTA 
CCATCCGAAA 
TGTTCCGGCT 
CGATTGGTGC 
GTTGGCTCAT 
CGGTAGAAAT 
ATTCACGGAG 
TATCGACCTG 
TCCTCCTATG 
AATAGCGGAT 
GTATAATCAT 
GCTCTATCGT 
TTGGTACGTT 
GCTCGGTTCG 
GTTGCATGCT 
AAAAGAATGG 
ATGGATGATG 
TTCCTTCTCX; 



GAAGATACCC 
ATCTCCTCCA 
GATGCAGGCA 
ATCCTCCCAC 
CGTTCGATGA 
CAGGATGGTA 
GGCAATGCCT 
ATGGGAGGAG 
TTCGATGAGA 
ATGGGCAATT 
GCCGGTCATC 
CCTACTCCGG 
TTCTCTGCTT 
CCGATTTTTT 
CCGGCCAACA 
AGCTGCAATT 
GACACGAAAC 
ACGGAGCAAC 



gtacactgat 
cgggtcgcaa 
tcgacgaaag 
tggggcgcac 
actatcgtca 
gtttccggac 
ggcactacac 
atcgccx:gtt 
gctattacgg 
atgctcatgg 

CATGGAAAGC 
ATGGGTATTG 
TAGGCTTCTA 
CCAAGGTAAT 
GTGCCGATAC 
ACCTGACTCA 
CCAATTATAA 
AGCGTCGCGC 



GAACGGACTC 
AGGTTGGGAG 
TGCTGCCCGT 
ATTGGGTTGG 
TCTGTTCGAT 
ACCTTTTTCC 
TTGGTCGGTC 
CGTGTCTATG 
ATTTGTGATC 
CAATCAACXX 
TCAGGAGAGA 
CGGCGATGAA 
TCCT6TTACA 
ACTCTCTTTT 
GCCTTACATC 
CGAACAGCTT 
TCGTGGTATG 
TAATCACACT 



(2) IHFOWIATIOt^ rOR SEQ ID NO: 247 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 

(B) TYPE: nucleic acid 
{CJ STRAHDEDNESS: double 
(DJ TOPOLOGY: circular 

(11) MOLECULE TYPE: DMA (genomic) 

(iii) HYPOTHETICAL; NO 

(iv) AI-ITT -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGAI4ISM: PORY PHYROr>(OWAS GINGIVALIS 

(Ix) FEATURE: 

(A) NAIIE/KEY: misc feature 
(Bl LOCATlOTi 1. . .813 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247 



ATGATGAAAT 
TTGGGCTGTA 
CTCGAACGCT 
AGCTACGTCT 
GGCGACAGCC 
CTGGGACAAG 
CTTTTGGACT 
TCCACGCGCA 
TTTGCCAACA 
GACTATGCGG 
ACCACACCTG 
GGCAATTTTA 
GCTTATGCCA 
AGGGATGATT 



CCATGCGCAG 
GCAATAACAA 
ACATCCCTCT 
ATCCTTCGGG 
TGATGGATTC 
ACTATCGCTC 
ATATCTACAA 
TCAATACATA 
TCCTTCGCAC 
AAAGGCTGTC 
CCGAATTGGA 
TGATCGATGA 
GAGGTGCTGT 
CCCCACTAAA 



CGTGCTGTTG 
AGCTGCCGAA 
GCGGGCAGAC 
AGATGATATG 
CTCTTCGCCG 
TAACAATGCC 
GCAGGAAAAT 
TACTTACGAA 
CACCGGCAAG 
CGCACTCATC 
TGCAATAGGT 
CAAAGGTOrC 
CTATGTCCGT 
GCGTTACTTG 



CTACTCTTTC 
TCGAAGTCTG 
ATAGATACGC 
CTCACAGAGA 
GAGAATGCCA 
GAAGCCAATC 
ACCATCGCTT 
GGCGGTGCAC 
GTGCTCGAAG 
ATAGGACAAT 
TTCTTCAACG 
ACATACTGTT 
CTCGGATATG 
CCG 



CATTGTCTTT 
TCTCTTTCGA 
CATCGCTGCA 
TTTTCAACGG 
TGGAAGGCTA 
TGCAAGGGCT 
ATTGCGATAC 
ATACGGAGAA 
AGCGAGATAT 
TCGTGCACGA 
CAGAAGAAAT 
TCAATGAGTA 
AC6TATTGGC 



GATCACTGCT 
TTCGGCCTAT 
TGTGATGATC 
TTTGCTCTTC 
TGCACAGATG 
TCCTTCTGAC 
GGGATTGATC 
TACAGTCCGG 
ATTCAAGATC 
TTTCGGCAAG 
ACAGCCCAAT 
TCAGATAGCT 
TCCTTTGCTA 



{2) INEX)RMATION FOR SEQ ID NO: 248 

(1» SEQUEIiCE CHARAC1*ERISTICS: 

(A) LENGTH: 610 baa« pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS: double 

(D) TOPOLOGY: circular 

tii) MOLECULE Ti'PE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(Iv) ANTI -SENSE: NO 

tvi) ORIGIUAL SOURCE: 



1320 
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SUBSTTTUTE SHEET (Rule 26) (RO/AU) 
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WO 99/29870 PCT/AU98/01023 

169/ 490 

(A) ORGANISM: PORYPHYROMONAS GIHGIVALIS 

iix) FEATURE: 

(A) MAIIE/KEY: mlsc feature 
^ (D) LOCATIW 1...810 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248 

in ^SliSI^*^ TGCGCAGCGT GCTGTTGCTA CTCTTTCCAT TCTCTTTGAT CACTGCTTTG en 
ATAACAAAGC TGCCGAATCG AAGTCTGTCT CTTTCGATTC GgSSc 1 
TCCCTCTGCG GGCAGACATA GATACGCCAT CGCTGCAT^f Ga5StS^ llo 
rJ^fJS^Jf CTTCGGGAGA TGATATGCTC ACAGAGATTT TCAACGGTTT 5cT^^ ' I40 
TGGATTCCTC TTCGCCGGAG AATGCCATGG AAGGCTATGC AO^mTOCT^ Inn 
GGAGAAGACT ATCGCTCTAA CAATGCCGAA GCCAATCTGC J^GGGCTTCC tStgI^ f22 
I?^^!;^:^ TCTACAAGCA GGAAAATACC ATCGCTTA?? SS^^S^ JJ^^^ "^^^ 
G^S^JSi JJ^Sil^^'^^ TTACGAAGGC GGTGCACATA CGGAGAAT^C 5^?^^ 
SfSSSS^^^^ TTCGCACCAC CGGCAAGGTG CTCGAAGAGC GAGATATATT CAAGATCGAC 
lilSSS"^ CGCTGTCCGC ACTCATCATA GGACAATTGG TGCACGATTT CGgS^^ 
20 J^^S^ ^215^^ TTCAACGCAG AAGAAAT^ GC^^ 

TCGATGACAA AGGTCTCACA TACTGTTTCA ATGAGTATCA GATAGCTGCT 720 
S?^^2c S^SS C.GAXATGACG .ATTGGCTCC T^^G^^ Vs'o 



30 



45 



420 
480 
54 0 
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660 



810 



25 (2) INFORMATION rOR SEQ ID NO: 24 9 



iil SEOUENCE CHARACTERISTICS: 

(A) LENGTH: 801 base pairs 

(B) TYPK: nucl«ic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOCf: circular 

(11) MOLECULE TYPE: DNA (genomic) 

35 (ill) HYPOTHETICAL: NO 

(Iv) AlvlTI-SEllSE: NO 

(Vl) ORIGINAL SOURCE: 
4U (A) ORGAfJISM: PORYPHYROMONAS GINCTVALIS 



(ix) FEATURE; 

<A) NAME/KEY: misc feature 
(B) LOCATION 1...301 



<Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24 9 

JIS^^S^"^"^*^ TGCTGTTGCT ACTCTTTCCA TTGTCTTTGA TCACTGCTTT GGGCTGTAGC 60 
50 ^l^^^ ^^^l^ fi^S''^*^^ TCTTTCGATT CGGCciS?? ^^^^ 

ilSSJSJ^S GGGCAGACAT AGATACGCCA TCGCTGCATG TQATGATCAG CTACGTCTAT Tfln 
^S^SSS^ ATGATATGCT CACAGAGATT TTCAACGGTT TGCTCTTOGG CGA^OTG 
il^^S^ CrrCGCCGGA GAATGCCATG GAAGGCTATG CACAGATGCT GGG^S^c 
ACAATGCCGA AGCCAATCTG CAAGGGCTTC CTTCTWCCT ???S^a5 

55 aI^I^S^ aggaaaatac catcgcttat tgcgatacgg gattgatctc cacgSw? 
^JS^SI^^^ cttacgaagg cggtgcacat acggagaata cagtccgott tgccScaJc 
^^^^"""^ ccggcaaggt gctcgaagag cgagatatat tcaaStSa ctmgSSI^ 

SS^JSI J^^JI? ?T?5?^^^ TCGGCA^^ 



240 
300 
360 
420 
480 
540 
600 
660 



»jTTs«Tj ^T,^ 5ss^ '.^^^ ssss; 



801 



(2) IHFORMATlOfJ FOR SEQ ID NO: 250 

(1) SEQUE]>ICE CHARACTERISTICS: 

<A) LENGTH: 777 base pairs 
(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: circular 

(11) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Uv) AI^ITI -SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROMONAS GINGIVALIS 

( ix ) ttATURE : 

(A) riAHE/KEY: mi sc;_ feature 

(B) LOCATTOM I. . ,777 

(Xl) SEQUtHCE DESCRIPTION: SEQ ID HO: 250 



ATGAAAAAAA 
GCCAATGCCC 
GCCACGGCCG 
AGtSGTACACC 
AGAGCACACT 
ATTGGTATTG 
GCTCTCGGCC 
TGGGGATGGA 

tatgacacct 
gtgagk:aatg 
atcaagagca 
ctatatccgt 

CGCATGAACC 



CTACTTTGAC 
AATCGAAGTT 
GAAATGGTTT 
ACAGCATTCT 
GGGAAGAGGC 
TTGAAAAGGT 
GCTTCCTCAT 
CCAATGACGC 
ACACAAGAGC 
GCTTTTCACC 
AATATGATTT 
ATCGTAATTT 
ACATCGGCCA 



AGGATCGATA 
AAAGATCAAG 
TGGTGGCAAT 
GCCCGAAGGG 
CGGAAGTCCG 
AATAGTCTTC 
CAGTTTGGGG 
CCATATTCTT 
TGGCAGTGAC 
GGGGATCGGA 
TCGCCTCGGT 
CAAGCTGGAT 
TGTGGGCTTC 



TGTGCTTTAC 
AGCATTGAGG 
ATCTTCGGCA 
TTGGATTTTT 
AAGCTCATGT 
GAAGACGCAG 
ATATCCTATA 
TTCTCACCGA 
CTTGTACTTC 
CTCAAAAGTT 
TTCAGCCTGG 
GGAAATAAGC 
AACTTTACCG 



TCCTGTTTTT 
CAGCTACCAC 
TGGACATGAG 
CGGTAGGAAT 
ATACGAATGT 
A^GACTTTTT 
CCAAGCATCT 
TACCCAAGAG 
AGTCCGAAGA 
CTATTTGGTG 
CCTATGAGTA 
CGCTTTCAGC 
TGGGTCTTTG 



GGGTCTCTCG 
TTTCAGTTCG 
CATACGGATG 
ACATGAAAGA 
CCCAAGTATC 
TGACAAAAAA 
GGGAGCGTAT 
CAAGGTCCAC 
TGTTGCCACA 
GAAAATGCCC 
TCTGAACCTG 
ACTATCTCCT 
GACTAAT 



60 
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180 
240 
300 
360 
420 
480 
540 
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(2» INFORMATION FOR SEQ ID NO: 251 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

( D) TOPOLOGY : circular 

(11) MOLECULE TYPE: DNA (genomic) 

tiii) HYPOTHFTICAL: NO 

(Iv) AJJTI-SENSE: NO 

(Vi) ORIGINAL SOURCE: 

I A) ORGANISM: PORYPHYRa-IONAS GINGIVALIS 

(ix) FEATURE: 

(A) NAJ-IE/KEY: misc_f eature 

( B ) LOCATION 1 ... 37 98 

(Xl) SEQUENCE DESCRIPTlOt-I; SEQ ID NO: 251 



ATGGGCAAGT 
TATACCTTTG 
CAAGCCATTT 
GAAGGGATAA 
GAGCTTTGCT 
CTAAAAAAGC 
ACTTCGTTAA 
AGTCTCACCT 
CTGGAACGTC 
GAGGCTCTGG 
AAGCTGGAGG 
ATCCGTAAGC 
AACCAAATCA 
TCGGGTAACC 
GAACTGTCGG 
AAGCTTCGTC 
CTAACAAAAC 
ACCTCGTTAG 
CGTCTCACCT 
CTGGATAGTC 
GAGGGCCTGG 
AAGCTGGAGG 
ATCAGTAAGC 
AATGATATl'C 
GAAAAACTGA 
TATGATAATC 



ATAAAAGAGC 
AGGGAGTTAC 
TAGACTTAGA 
GCTGGTCGGT 
TGCGAGAATG 
TTGATCTATC 
CAAAACTTCG 
CCCTAACAAA 
TCACCTCGTT 
AACGTCTCAC 
CTCTGGAACG 
TAGAGGGTCT 
GTAAGCTAGA 
AAATCCGTAA 
GTAACCAAAT 
TAAGAAGTAA 
TTTCTCTCTC 
CGGAGCTTTA 
CGTTAACAAA 
TCACCTCGCT 
AACGTCTCAC 
GTCTTGATGG 
TGGAAGGACT 
AATCTATTGA 
GAATCCATGA 
ATTTGCCGGA 



TAAGTACCGC 
TTTTTTATGC 
GAAGGCCTAT 
AAATAGATAT 
TCAGATAGAA 
GTATAACCAA 
TCTAAGAAGT 
ACTTTCTCTC 
AGCGGAGCTT 
GTCCTTAGCA 
TCTCACGTCC 
GGAACGTCTC 
GCGTCTGGAA 
GCTGGAGGGT 
CAGTAAGCTA 
CCAGATCAGT 
CGATAACCAA 
TCTTTTGGAT 
GCTTCGTCTA 
AACAAAACTT 
GTCCTTAGCG 
TCTTGCTTCC 
AGACAGACTA 
TGATATTAAG 
CAATCCATTT 
GATTAAAGCT 



TATTGGCTTT 
GCATCTGACG 
AACATTGAAA 
TTCAAGCAAG 
AGCATGACTT 
ATCAGT/UVGC 
AACCAAATCC 
TCCGATAACC 
TATCTTTTGG 
ACGCTTGAAC 
TTAGCAACGC 
ACTTCGTTAA 
CGTCTCACGT 
CTGGAACGTC 
GAGGGTCTGG 
AAACTAGAGG 
ATCAGTAAGC 
AACCAAATCC 
AGAAGTAACC 
TCTCTCTCCG 
GAGCTTTATC 
TTAACAAGGC 
AAGGTTTTGA 
CTATTGGCTC 
GTTGCATCAT 
CTTCTTGAAA 



TTCCTTTCTG 

ATATGACAAC 

TTCCTGATCT 

ATTCCTCCGG 

GGCTTATTGA 

TAGAGGGTCT 

GTAAACTAGA 

AAATCAGTAA 

ATAACCAAAT 

TATCGGGTAA 

TTGAACTATC 

CAAAGCTTCG 

CCTTAGCAAC 

TCACGTCCTT 

AACGTCTCTC 

GCCTGGAACG 

TAGAGGGTCT 

GTAAGCTGGA 

AAATCAGTAA 

ATAACCAAAT 

TTTTGGATAA 

TTAGTCTAAG 

GAAAACTTGA 

CGATTCTGGA 

CAGGCTTGAT 

AAGAAAAAGA 



TTCGGATTAT 
CAAGAAACCC 
CTCCTCACAA 
TGCAGTCGTT 
TTTTCCTGCT 
AGAACGTCTT 
GGGCCTGGAT 
GCTAGAGGGT 
CAGTAAACTA 
CCAAATCCGT 
GGGTAACCAA 
TCTAAGAAGT 
GCTTGAACTA 
AGCAACGCTT 
TTCGTTAACA 
TCTCACCTCG 
GGAACGTCTC 
GGGCCTGGAA 
ACTAGAGGGC 
CAGTAAACTA 
CCAAATCCGT 
GCGCAACCAA 
TGTTTCGGGC 
GCAAACTTTA 
ACTCTCTCCT 
AAAACAGAAA 



60 
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300 
360 
420 
480 
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600 
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900 
960 
1020 
1080 
1140 
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1380 
1440 
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1560 
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AAGACTTCAG 

GGTAAAACAA 

GTGTTGTCGA 

GACTATTATC 

TGG6ATGCTA 

AATTTCAATC 

GTTGGAGGAA 

CAGACTCATG 

GGAGTATTGG 

CTCAACTATC 

ATCACAGAAA 

CACATCCCTA 

GATCTTTACA 

GTGCT-rTACT 

TTTGTCCAAA 

CCTAAAGACA 

GATGGCCAAA 

GATAAGGACT 

AAAT6GCTTA 

CCCTTTGGCC 

CGGTATTGGC 

GAGCAAGAAG 

ATCAAGCl'CG 

TCAGCTAAGG 

TGGAACAATA 

ACTATTCGTG 

CAAGCGGATA 

AGCAAGACTA 

CGGGTGCGAG 

ACTGCAAAAC 

CTGCAATTTT 

ACTAAGTTTG 

ATAATCGCTT 

CCTGTATTTC 

GAAGACGATG 

GGAAAAGAGG 

GAAAATTGGG 

GAGGTAAATA 



TTGAATATCA 
CATTTCTTAG 
TACATCGAAG 
ATGGGATTTA 
AGAAGGATCG 
GCCCCTATTG 
ATCCTGATGG 
CCGATGAAAC 
AAGAAATCTA 
TGAATGAGCG 
AAGATAAGOG 
TCTCTCTCGA 
CCATAGAGTA 
ATCGTGAGAA 
TGATTCATGG 
TTTTTGAATG 
ATGGTAATAT 
GCTATGTGAT 
CTTTGGGATT 
TGATCAACCA 
GAGATCAGGT 
AAGAGAAAGA 
ACTTTACCGA 
ATATGCAGCG 
TCCCTCCGAG 
AAACAAACAG 
AAGATTTGAC 
CGGCGAGGAT 
AAGTATCGAC 
AGATCTTTAT 
TCAAACCCTT 
AAACACCTAT 
TGATCAGCCA 
GGGAGTAT7VA 
AGTTCCTTCG 
GAAAAACCAT 
TGGCAGTAGT 
CAGATGAA 



CCCATTTTGC 
TCAATACGAT 
CAATAACCCT 
CCAAGCCTTT 
AAACTTTGTG 
GTTAGGACAG 
CAAGGACACA 
GGGCGCTAAG 
TGTATCCTTA 
GGTGCGAGAA 
ATTGTACGAA 
AGCTCTTGCG 
TCTACAGACX: 
TGAGAAGCTG 
AGAAATCCTC 
CAAACTGCAT 
GATCTTGCAG 
ACCGGGCTTAT 
CGAGAGGCCC 
GATTATAGCC 
CATCTTCACA 
GGGTTTGCCC 
CTTGGCCATA 
GAAAGAGGCT 
GGAGCAAATA 
AAAGAAGAGA 
GGAGTCTCAT 
TGCAGCCTAT 
TCGTCCCTAC 
TTCCTATTCC 
GGAGAAGAAT 
TCACCCTGAA 
ACGCTATCTG 
CAAGACCATA 
GGAGAAATAT 
TAAAGCTTAT 
CAGAGAGTTC 



AAAGTAATGC 

ACAAATTATA 

AATGCGATCT 

TTTACCACCC 

AGCGTAGATG 

ATAGCCTATG 

CCACAGACCA 

CAGCAAACCT 

GAGCCCAAGG 

GTTGTCGCAA 

GCTCTTCCCA 

GCTCAATTGA 

GAATTGAACC 

AACAATTATG 

CAAAAAGACA 

AATCTAAGTT 

CTATTATTGG 

CTCCCTTTGC 

AATTTTGTCC 

TACTATGGCC 

GCAGGCCGTG 

AAGACGAATG 

TCCGTATTCA 

ACTATCCTCA 

GGAGATAAGG 

CCCATCCAGG 

TATATCCATT 

CCGTTGAAGA 

AAACATCTTT 

AAAGAGGATC 

GGTCAGATCC 

ATAAGAAACC. 

GCCACGGATT 

GTGCCGATAT 

TTTGCTCAGA 

GATAGTATTA 

AAAGAGAAGA 



TATTGGGAAA 
CGTATCAGAA 
TTTACGACTT 
AATCGTTATA 
ATAAAGAATA 
CCTGCAATCG 
CAGACGATAC 
TAGGCTGTGC 
CGAATAGTGC 
GCAGGAGTAA 
CAATCGCCGG 
ATAAGGGAAG 
AGCTTAGTCT 
TCTGGTTAGA 
ACATCAATAG 
CCGGAAGTAT 
AAGAGCTGAT 
ATTC'CGATGA 
TCAAATTCGA 
GGGAAGAAGG 
AGATGGATAG 
CCGAGGATTA 
TCAAAGAGCA 
GTGATATGl'T 
ATACGGAGCA 
ATCTCTACCT 
TGGGCACGCT 
ACGGCGTTAT 
CCGTCAATAA 
AGACTGAACT 
AGATCTACTA 
GTATTGTCGA 
ACATCCTGGA 
TGATC7UKGCC 
AAGCTCAAAT 
CGGCATCAGC 
TATTAAGAAT 



TCATTCTTCG 
AAATACACAT 
TGGGGGACAG 
CCTTCTCTTT 
TCAGACTCTT 
TTGTATGTCC 
AATTATCATT 
AGCCGAGAAT 
CGTACATGCG 
ATCAATTCAG 
TGATAATAAA 
AGCTGAAAAT 
GCGAGGGGAG 
TCCGGCAGCT 
AGGAACAGTT 
ATTTGAAGAA 
CGTATATGAA 
CGAAGCCTAT 
ACGTTTTATC 
TGCTCTAAAG 
GCAAACGCTT 
TCAGATCTGG 
GAGAAAGACA 
GGATATGTAT 
AACGAGAAGC 
CTCCTGTGCC 
GGACGATGAA 
CGATAAAGAG 
AAATCTGGCT 
GGAGACCTGT 
TGATAAGTTG 
AGCCGACTGT 
TCATGAGTTG 
TTGTACATTC 
AATCAATCTT 
CCATCGTGAT 
AACAAAACAG 



1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2200 

234 0 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3460 

3540 

3600 

3660 

3720 

3780 

3798 



(2) IMFORJ'IATION FOR SEQ ID NO: 252 



45 



50 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 3696 base pairs 

(B) TYPE: nucleic acid 

(C) STRAHDEDNESS : double 

(D) TOPOLOG-^: circular 

(111 MOLECULE TYPE: DTJA (genomic) 
tiii) HYPOTHETICAL: MO 



55 



60 



65 



70 



75 



Uv) AHTI -SENSE: NO 

(vi) ORIGIHTVL SOURCE: 

(A) ORGAIIISM: PORYPHYROMONAS GINGIVALIS 

{ ix ) FEATURE : 

(A) NAME/KEY: mlsc_f eature 

(B) LOCATIOM 1 . . .3696 

(XiJ SEQUENCE DESCRIPTION: SEQ ID NO: 252 



ATGACAACCA 
CCTGATCTCT 
TCCTCCGGTG 
CTTATTGATT 
GAGGGTCTAG 
AAACTAGAGG 
ATCAGTAAGC 
AACCAAATCA 
TCGGGTAACC 
GAACTATCGG 
AAGCTTCGTC 
TTAGCAACGC 



AGAAACCCCA 
CCTCACAAGA 
CAGTCGTTGA 
TTCCTGCTCT 
AACGTCTTAC 
GCCTGGATAG 
TAGAGGGTCT 
GTAAACTAGA 
AAATCCGTAA 
GTAACCAAAT 
TAAGAAGTAA 
TTGAACTATC 



AGCCATTTTA 
AGGGATAAGC 
GCTTTGCTTG 
AAAAAAGCTT 
TTCGTTAACA 
TCTCACCTCG 
GGAACGTCTC 
GGGTCTGGAA 
GCTGGAGGGT 
CCGTAAGCTA 
CCAAATCAGT 
GGGTAACCAA 



GACTTAGAGA 
TGGTCGGTAA 
CGAGAATGTC 
GATCTATCGT 
AAACTTCGTC 
CTAACAAAAC 
ACCTCGTTAG 
CGTCTCACGT 
CTGG.^ACGTC 
GAGGGTCTGG 
AAGCTAGAGG 
ATCCGTAAGC 



AGGCCTATAA 
ATAGATATTT 
AGATAGAAAG 
ATAACCAAAT 
TAAGAAGTAA 
TTTCTCTCTC 
CGGAGCTTTA 
CCTTACCAAC 
TCACGTCCTT 
AACGTCTCAC 
GTCTGGAACG 
TGGAGGGTCT 



CATTGAAATT 
CAAGCAAGAT 
CATGACTTGG 
CAGTAAGCTA 
CCAAATCCGT 
CGATAACCAA 
TCTTTTGGAT 
GCTTGAACTA 
AGCAACGCTT 
TTCGTTAACA 
TCTCACGTCC 
GGAACGTCTC 



60 
120 
180 
240 
300 
360 
420 
480 
540 
6 00 
660 
720 



SUBSTTTUTE SHEET (Rule 26) (RCVAU) 



ip=ii »p=ii ii"™ 4P°h «ii jp«i. .flp tp^i .rtP ip^H." a"*?,' 
■li^i iCJi JB,. iC:; iO ^Oi m U O iC;; O lUJj. ii„,i1 



wo 99/29870 



PCT/AU98/01023 



172/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



ACGTCCTTAG 
CGTCTCrCTT 
CTGGAACGTC 
GAGGGTCTGG 
AAGCTGGAGG 
ATCAGTAAAC 
AACCAAATCA 
TTGGATAACC 
AGTCTAAGGC 
AAACl'TGATG 
ATTCTGGAGC 
GGCTTGATAC 
GAAAAAGAAA 
TTGGGAAATC 
TATCAGAAAA 
TACGACTTTG 
TCGTTATACC 
AAAGAATATC 
TGCAATCGTT 
GACGATACAA 
GGCTGTGCAG 
AATAGTGCCG 
AGGAGTAAAT 
ATCGCCGGTG 
AAGGGAAGAG 
CITAGTCTGC 
TGGTTAGATC 
ATCAATAGAG 
GGAAGTATAT 
GAGCTGATCG 
TCCGATGACG 
AAATTCGAAC 
GAAGAAGGTG 
ATGGATAGGC 
GAGGATTATC 
AAAGAGCAGA 
GATATGTTGG 
' ACGGAGCAAA 
; CTCTACCTCT 
GGCACGCTGG 
GGCGTTATCG 
GTCAATAAAA 
ACTGAACTGG 
ATCTACTATG 
ATTGTCGAAG 
ATCCTGGATC 
ATCAAGCCi'T 
GCrrCAAATAA 
GCATCAGCCC 
TTAAGAATAA 



CAACGCTTGA 
CGITAACAAA 
TCACCTCGCT 
AACGTCTCAC 
GCCTGGAACG 
YAGAGGGCCT 
GTAAACTAGA 
AAATCCGTAA 
GCW\CCAAAT 
nTCGGGCAA 
AAACTTTAGA 
TCTCTCCTTA 
AACAGAAAAA 
ATTCTTCGGG 
ATACACATGT 
GGGGACAGGA 
TTCTCTTTTG 
AGACTCTTAA 
GTATGTCCGT 
TTATCATTCA 
CCGAGAATGG 
TACATGCGCT 
CAATTCAGAT 
ATAATAAACA 
CTGAAAATGA 
GAGGGGAGGT 
CGGCAGCTTT 
GAACAGTTCC 
TTGAAGAAGA 
TATATGAAGA 
AAGCCTATAA 
GTTTTATCCC 
CTCTAAAGCG 
AAACGCTTGA 
AGATCTGGAT 
GAAAGACATC 
ATATGTATTG 
CGAGAAGCAC 
CCTGTGCCCA 
ACGATGAAAG 
ATAAAGAGCG 
ATCTGGCTAC 
AGACCTGTCT 
ATAAGTTGAC 
CCGACTGTAT 
ATGAGTTGCC 
GTACATTCGA 
TCAATCTTGG 
ATCGTGATGA 
CAAAACAGGA 



ACTGTCGGGT 
GCTTCGTCTA 
AACAAAACTT 
CTCGTTAGCG 
TCTCACCTCG 
GGATAGTCTC 
GGGCCTGGAA 
GCTGGAGGGT 
CAGTAAGCTG 
TGATATTCAA 
AAAACTGAGA 
TGATAATCAT 
GACTTCAGTT 
TAAAACAACA 
GTTGTCGATA 
CTATTATCAT 
GGATGCTAAG 
TTTCAATCGC 
TGGAGGAAAT 
GACTCATGCC 
AGTATTGGAA 
CAACTATCTG 
CACAGAAAAA 
CATCCCTATC 
TCTTTACACC 
GCTTTACTAT 
TGTCCAAATG 
TAAAGACATT 
TGGCCAAAAT 
TAAGGACTGC 
ATGGCTTACT 
CTTTGGCCTG 
GTATTGGCGA 
GCAAGAAGAA 
CAAGCTCGAC 
AGCTAAGGAT 
GAACAATATC 
TATTCGTGAA 
AGCGGATAAA 
CAAGACTACG 
GGTGCCAGAA 
TGCAAAACAG 
GCAATTTTTC 
TAAGTTTGAA 
AATCGCTTTG 
TGTATTTCGG 
AGACGATGAG 
AAAAGAGGGA 
AAATTGGGTG 
GGT7\AATACA 



AACCAAATCA 
AGAAGTAACC 
TCTCTCTCCG 
GAGCTTTATC 
TTAACAAAGC 
ACCTCGCTAA 
CGTCTCACGT 
CTTGATGGTC 
GAAGGACTAG 
TCTATTGATG 
ATCCATGACA 
TTGCCGGAGA 
GAATATCACC 
TTTCTTAGTC 
CATCGAAGCA 
GGGATTTACC 
AAGGATCGAA 
CCCTATTGGT 
CCTGATGGCA 
GATGAAACGG 
GAAATCTATG 
AATGAGCGGG 
GATAAGGGAT 
TCTCTCGAAG 
ATAGAGTATC 
CGTGAGAATG 
ATTCATGGAG 
Tl'TGAATGCA 
GGTAATATGA 
TATGTGATAC 
TTGGGATTCG 
ATCAACCAGA 
GATCAGGTCA 
GAGAAAGAGG 
TTTACCGACT 
ATGCAGCGGA 
CCTCCGAGGG 
ACATVACAGAA 
GATTTGACGG 
GCGAGGATTG 
GTATCGACTC 
ATCTTTATTT 
AAACCCTTGG 
ACACCTATTC 
ATCAGCCAAC 
GAGTATAACA 
TTCCTTCGGG 
AAAACCATTA 
GCAGTAGTCA 
CATGAA 



GTAAGCTAGA 
AGATCAGTAA 
Al'AACCAAAT 
TTTTGGATAA 
TTCGTCTAAG 
CAAAACTTTC 
CCTTAGCGGA 
TTGCTTCCTT 
ACAGACTAAA 
ATATTAAGCT 
ATCCATTTGT 
TTAAAGCTCT 
CATTTTGCAA 
AATACGATAC 
ATAACCCTAA 
AAGCCTTTTT 
ACTTTGTGAG 
TAGGACAGAT 
AGGACACACC 
GCGCTAAGCA 
TATCCTTACA 
TGCGAGAAGT 
TGTACGAAGC 
CTCTTGCGGC 
TACAGACCGA 
AGAAGCTGAA 
AAATCCTCCA 
AACrGCATAA 
TCTTGCAGCT 
CGGGCTATCT 
AGAGGCCCAA 
TTATAGCCTA 
TCTTCACAGC 
GTTTGCCCAA 
TGGCCATATC 
AAGAGGCTAC 
AGCAAATAGG 
AGAAGAGACC 
AGTCTCATTA 
CAGCCTATCC 
GTCCCTACAA 
CCTATTCCAA 
AGAAGAATGG 
ACCCTGAAAT 
GCTATCTGGC 
AGACCATAGT 
AGAAATATTT 
AAGCTTATGA 
GAGAGTTCAA 



GGGTCTGGAA 
ACTAGAGGGC 
CAGTAAGCTA 
CCAAATCCGT 
AAGTAACCAA 
TCTCTCCGAT 
GCTTTATCTT 
AACAAGGCTT 
GGTTTTGAGA 
ATTGGCTCCG 
TGCATCATCA 
TCTTGAAAAA 
AGTAATRCTA 
AAATTATACG 
TGCGATCTTT 
TACCACCCAA 
CGTAGATGAT 
AGCCTATGCC 
ACAGACCACA 
GCAAACCl-l'A 
GCCCAAGGCG 
TGTCGCAAGC 
TCTTCCCACA 
TCAATTGAAT 
ATTGAACCAG 
CAATTATGTC 
AAAAGACAAC 
TCTAAGTTCC 
ArrATTGGAA 
CCCTTTGCAT 
TTTTGTCCTC 
CTATGGCCGG 
AGGCCGTGAG 
GACGAATGCC 
CGTATTCATC 
TATCCTCAGT 
AGATAAGGAT 
CATCCAGGAT 
TATCCATTTG 
GTTGAAGAAC 
ACATCTTTCC 
AGAGGATCAG 
TCAGATCGAG 
AAGAAAGCGT 
CACGGATTAC 
GCCGATATTG 
TGCTCAGAAA 
TAGTATTACG 
AGAGAAGATA 



780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1900 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2 94 0 
3000 
3060 
3120 
3180 
324 0 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3696 



55 



60 



65 



70 



75 



(2) 1NFORMATIOI4 FOR SEQ ID NO: 253 

(i) SEQUEtJCE CHARACTERISTICS: 

(A) LENGTH: 3525 base pairs 
(D) TYPE: nucleic acid 

(C) STRAl^DEDIJESS: double 

(D) TOPOLOGY: circular 

(il) MOLECULE TYPE; DNA fgenomict 

(iii) HYPOTHETICAX: NO 

(iv) ATiTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRao^AS GINGTVAMS 

H X > FEATURE : 

(A) NAHE/KEY: misc featur- 
IB) LOCATION 1. . .3525 

SEQUEIJCE DESCRIPTION: SEQ ID HO:253 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



iP-'h W^" iP^i 

U ^S*- ^ lOi JiL 



1 -ti^^n ^if 



""^J iP-ii -if"*!! .ti°ni 

u Q c::: cai u -u 



WO 99/29870 



PCT/AU98/01023 



173/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



ATGACTTGGC 
AGTAAGCTAG 
CAAATCCGTA 
GATAACCAAA 
CTTTTGGATA 
CTTGAACTAT 
GCAACGCTTG 
TCGTTAACAA 
CTCACGTCCT 
GAACGTCTCA 
GGTCTGGAAC 
CTAGAGCGCC 
AGTAAGCTAG 
CAAATCCGTA 
AGTAACCAAA 
CTCTCCGATA 
CTTTATCTTT 
ACAAGGCTTA 
GTTTTGAGAA 
TTGGCTCCGA 
GCATCATCAC 
CTTGAAAAAG 
GTAATGCTAT 
AATTATACGT 
GCGATCTTTT 
ACCACCCAAT 
GTAGATGATA 
GCCTATGCCT 
CAGACCACAG 
CAAACCTTAG 
CCCAAGGCGA 
GTCGCAAGCA 
CTTCCCACAA 
CAATTGAATA 
TTGAACCAGC 
AATTATGTCT 
AAAGACAACA 
CTAAGTTCCG 
TTATTGGAAG 
CCTTTGCATT 
TTTGTCCTCA 
TATGGCCGGG 
GGCCG-rCAGA 
ACGAATGCCG 
GTATTCATCA 
ATCCTCAGTG 
GATAAGGATA 
ATCCAGGATC 
ATCXTATTTGG 
TTGAAGAACG 
CATCTTTOCG 
GAGGATCAGA 
CAGATCGAGA 
AGAAAGCGTA 
ACGGATTACA 
CCGATATTGA 
GCTCAGAAAG 
AGTATTACGG 
GAGAAGATAT 



TTATTGATTT 
AGGGTCTAGA 
AACTAGAGGG 
TCAGTAAGCT 
ACCAAATCAG 
CGGGTAACCA 
AACTATCGGG 
AGCTTCGTCT 
TAGCAACGCT 
CGTCCTTAGC 
GTCTCTCTTC 
TGGAACGTCr 
AGGGTCTGCA 
AGCTGGAGGG 
TCAGTAAACT 
ACCAAATCAG 
TCGATAACCA 
GTCTAAGGCG 
AACTTGATGT 
TTCTGGAGCA 
CCTTGATACT 
AAAAAGAAAA 
TGGGAAATCA 
ATCAGAAAAA 
ACGACTTTGG 
CCTTATACCT 
AAGAATATCA 
GCAATCGTTG 
ACGATACAAT 
GCTGTGCAGC 
ATAGTGCCGT 
GGAGTAAATC 
TCGCCGGTGA 
AGGGAAGAGC 
TTAGTCTGCG 
GGTTAGATCC 
TCAATAGAGG 
CAAGTATATT 
AGCTGATCGT 
CCGATGACGA 
AATTCGAACG 
AAGAAGGTGC 
TGGATAGGCA 
AGGATTATCA 
AAGAGCAGAG 
ATATGTTGGA 
CGGAGCAAAC 
TCTACCTCTC 
GCACGCl'GGA 
GCGTTATCGA 
TCAATAAAAA 
CTGAACTGGA 
TCTACTATGA 
TTGTCGAAGC 
TCCTGGATCA 
TCAAGCCTTG 
CTCAAATAAT 
CATCAGCCCA 
TAAGAATAAC 



TCCTGCTCTA 

ACGTCTTACT 

CCTGGATAGT 

AGAGGGTCTG 

TAAACTAGAG 

AATCCGTAAG 

TAACCAAATC 

AAGAAGTAAC 

TGAACTATCG 

AACGCTTGAA 

GTTAACAAAG 

CACCTCGCTA 

ACGTCrCACC 

CCTGGAACGT 

AGAGGGCCTG 

TAAACTAGAG 

AATCCGTAAG 

CAACCAAATC 

TTCGGCCAAT 

AACTTTAGAA 

CTCTCCTTAT 

ACAGAAAAAG 

TTCTTCGGGT 

TACACATGTG 

GGGACAGGAC 

TCTCTTTTGG 

GACTCTTAAT 

TATGTCCGTT 

TATCATTCAG 

CGAGAATGGA 

ACATGCGCTC 

AATTCAGATC 

TAATAAACAC 

TGAAAATGAT 

AGGGGAGGTG 

GGCAGCTTTT 

AACAGTTCCT 

TGAAGAAGAT 

ATATGAAGAT 

AGCCTATAAA 

TTTTATCCCC 

TCTAAAGCGG 

AACGCTTGAC 

GATCTGGATC 

AAAGACATCA 

TATGTATi'GG 

GAGAAGCACT 

CTGTGCCCAA 

CGATGAAAGC 

TAAAGAGCGG 

TCTGGCTACT 

GACCTGTCTG 

TAAGTTGACT 

CGACTGTATA 

TGAGTTGCCT 

TACATTCGAA 

CAATCTTGGA 

TCGTGATGAA 

AAAACAGGAG 



AAAAAGCTTG 

TCGTTAACAA 

CTCACCTCGC 

GAACGTCTCA 

GGTCTGGAAC 

CTGGAGGGTC 

CGTAAGCTAG 

CAAATCAGTA 

GGTAACCAAA 

CTGTCGGGTA 

CTTCGTCTAA 

ACAAAACTTT 

TCGTTAGCGG 

CTCACCTCGT 

GATAGTCTCA 

GGCCTGGAAC 

CTGGAGGGTC 

AGTAAGei'GG 

GATATTCAAT 

AAACTGAGAA 

GATAATCATT 

ACTTCAGTTG 

AAAACAACAT 

TTGTCGATAC 

TATTATCATG 

GATGCTAAGA 

TTCAATCGCC 

GGAGGAAATC 

ACTCATGCCG 

GTATTGGAAG 

AACTATCTGA 

ACAGAAAAAG 

ATCCCTATCT 

CTl'TACACCA 

CTTTACTATC 

GTCCAAATGA 

AAAGACATTT 

GGCCAAAATG 

AAGGACTGCT 

TGGCTTACTT 

TTTGGCCTGA 

TATTGGCGAG 

CAAGAAGAAG 

AAGCTCGACT 

GCTAAGGATA 

AACAATATCC 

ATTCGTGAAA 

GCCGATAAAG 

AAGACTACGG 

GTGCGAGAAG 

GCAAAACAGA 

CAATTTTTCA 

AAGTTTGAAA 

ATCGCTTTGA 

GTATTTCGGG 

GACGATGAGT 

AAAGAGGGAA 

AATTGGGTGG 

GTAAATACAG 



ATCTATCGTA 
AACrrCGTCT 
TAACAAAACT 
CCTCGTTAGC 
GTCTCACGTC 
TGGAACGl'CT 
AGGGTCTGGA 
AGCTAGAGGG 
TCCGTAAGCT 
ACCAAATCAG 
GAAGTAACCA 
CTCTCTCCGA 
AGCTTTATCT 
TAACAAAGCT 
CCTCGCTAAC 
GTCTCACGTC 
TTGATGGTCT 
AAGGACTAGA 
CTATTGATGA 
TCCATGACAA 
TGCCGGAGAT 
AATATCACCC 
TTCTTAGTCA 
ATCGAAGCAA 
GGATTTACCA 
AGGATCGAAA 
CCTATTGGTT 
CTGATGGCAA 
ATGAAACGGG 
AAATCTATGT 
ATGAGCGGGT 
ATAAGGGATT 
CTCTCGAAGC 
TAGAGTATCT 
6TGAGAATGA 
TTCATGGAGA 
TTGAATGCAA 
GTAATATGAT 
ATGTGATACC 
TGGGATTCGA 
TCAACCAGAT 
ATCAGGTCAT 
AGAAAGAGGG 
TTACCGACTT 
TGCAGCGGAA 
CTCCGAGGGA 
CAAACAGAAA 
ATTTGACGGA 
CGAGGATTGC 
TATCGACTCG 
TCTTTATTTC 
AACCCTTGGA 
CACCTATTCA 
TCAGCCAACG 
AGTATAACAA 
TCCTTCGGGA 
AAACCATTAA 
CAGTAGTCAG 
ATGAA 



TAACCAAATC 
AAGAAGTAAC 
TTCTCTCTCC 
GGAGCTTTAT 
CTTAGCAACG 
CACGTCCTTA 
ACCTCTCACT 
TCTGGAACGT 
GGAGGGTCTG 
TAAGC^AGAG 
GATCAGTAAA 
TAACCAAATC 
TTTGGATAAC 
TCGTCTAAGA 
AAAACTTTCT 
CTTAGCGGAG 
TGCTTCCTTA 
CAGACTAAAG 
TATTAAGCTA 
TCCATTTGTT 
TAAAGCTCTT 
ATTTTGCAAA 
ATACGATACA 
TAACCCTAAT 
AGCCTTTTTT 
CTTTGTGAGC 
AGGACAGATA 
GGACACACCA 
CCCTAAGCAG 
ATCCTTAGAG 
GCGAGAAGTT 
GTACGAAGCT 
TCTTGCGGCT 
ACAGACCGAA 
GAAGCl'GAAC 
AATCCTCCAA 
ACTGCATAAT 
CTTGCAGCTA 
GGGCTATCTC 
GAGGCCCAAT 
TATAGCCTAC 
CTTCACAGCA 
rn^GCCCAAG 
GGCCATATCC 
AGAGGCTACT 
GCAAATAGGA 
GAAGAGACCC 
GTCTCATTAT 
AGCCTATCCG 
TCCCTACAAA 
CTATTCCAAA 
GAAGAATGGT 
CCCTGAAATA 
CTATCTGGCC 
GACCATAGTG 
GAAATATTTT 
AGCTTATGAT 
AGAGTTCAAA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
€00 
660 
720 
780 
840 
900 
960 
1020 
1000 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2S20 
2580 
2640 
27 00 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3525 



65 



70 



75 



(2» INEORl-lATION FOR SEQ ID NO: 254 

(11 SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 
to STRAIIDEDNESS: double 
(D) TOPOLOGY: circular 

(li) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) AlITT-SEJISE: NO 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ip^'i, iH--Mt ,Ai iF=^i r»7==i, t|-»qi- if=3ai ^t^^T'" 

■U.J ^^nis r3i iCji iC: Oi D ur iCP liT; iCU iUl iUi 



WO 99/29870 PCT/AU98A)1023 

174/ 490 

Ivt) ORIGIIIAL SOURCE: 

(A) ORGANISM: PORYPHyRaiONAS GIHGIVALIS 

(Ix) FEATURE; 
> (A) NA/IE/KEY: mlsc feature 

(B) LOCATION 1...687 

ixl} SEQUENCE DESCRIPTION: SEQ ID HO: 254 



?r?S^^^J^ ^t^T^^ TTTCGTACTA CTGGTTTGCC TATTCTCCTC GTTCAGCAGT 60 

CAACGACGAA CAGTAGCCGG AGTTATTTTA CAGGACGAAT CGAGAAGCTG 12C 

^^IITSS??^ TAGGGGTCCC CCCCGTAAGC ACAGAGGTTT GGGGAATGAC CCATGATGCG 180 

nT^I^fi^^ ACCTATCTCT TTCAGTCGTT TCAACAGCCA GGGAGATATA 240 

15 rJl^^^^^ GAATAGCGAG GCAACTTTGA ATGAATGGTG CGACTATGCA 300 

f^^f^f™^ GGTAGAAGGT CGTTTTTGGA AAATGACTTA CAACATACCA 360 

CAOTCTGCAC CCGGATTACA TTCGAAAATC AAGAAATAGA AGGAAC^ATC 420 

^JSJS^I^^: ^iS^^^^^ AGTCTCGCTG CCTCATGTGT CGGAATC6GT GCCTT^S^C 480 

n^^^ CCGGGAGGGA ATTTATCCTT TGCGAAGAAG ACGACACCTT TGTGTCTCA- 5 JO 

^0 G?G^^rr ^S^S^n^I AGGCGGTAAA CCTTTCTTGC TCAATACCAA CGTAAAGAT^ 600 

?G?^^ l^^L^ ^S'^^ GGGGTAGGAG AAATTCGATT CCTGCAGATT 660 



25 



30 



35 



40 



45 



65 



70 



t2) IMFORIIATIOM FOR SEQ ID HO:2SS 

(1) SEQUENCL CHARACTERISTICS: 

(A) LENGTH: 664 base pairs 
f3) TYPE: nucleic acid 

(C) STRAIIDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: PORYPHYRCMOHAS GIHGTVALIS 

(ix) FEATURE: 

(A) NAt-IE/KEY: nVec feature 
{□) LOCATION 1. . ,684 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255 

S^^S^"^^ CATTTG-iTTT CGTACTACTG GTTTGCCTAT TCTCCrrCGTT CAGCAGTTC- 60 

CGACGAACAG TAGCCGGAGT TATTTTACAG GACGAATCGA GAAGGTGAG? ifo 

50 I^S^SS^? S^rlfnSfSS CGTAAGCACA GAGGTTTGGG GAATGACCCA TGATG^GA^C llo 

DU GGTCTCCCTT TCGAAATACC TATCTCTTTC AG7CGTTTCA ACAGCCAGGG AGATATAGCT 2d0 
CcS^ TAGCGAGGCA ACTTTGAATG ^aJ^A^ 

^l?^^?^ TCGTGAGGGT AGAAGGTCGT TTTTGGAAAA TGACTTACAA CATACCAACC 360 

TCTGCACCCG GATTACATTC GAAAATCAAG AAATAGAAGG AACGATCGTC 420 

55 I^^In^^^ AGCCCAAAGT CTCGCTGCCr CATGTGTCGG AATCGGTGCC TTGCATCCGA 480 

r^S?^?-'^ GGAGGGAATT TATCCTTTGC GAAGAAGACG ACACCTTTGT GTCT^^ 540 

li^^f^ATAGG CGGTAAACCT TTCTTGCTCA ATACCAACGT AAAGATTGTG 600 

^?^AG ^A^^C^ GTAGGAGAAA TTCGArrCCT GCAGATTTGT 660 



60 

12) INFOW-rATION FOR SEQ ID NO: 256 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1620 basc pairff 
iB) TfVEi nucleic acid 
{Ct STRAJ^DEDNEflS: double 
(D) TOPOLOGY: circular 

(li) MOLECU'LE TYPE: DNA (genomic;) 

till) H-j-FOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

75 (vi) ORIGINAL SOURCE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



lUi -^f?!' .^3 q^ai ..j;^||j^*£nti 11:31 lui o iir; oj u ■ ■ 



wo 99/29870 PCT/AU98A>1023 

175/490 

iPii ORGASAISH: PORYPHYROHOMAS GTMGIVALIS 

(ix) FEATURE: 

(Al NAUE/KEV: mlsc_f eatuc* 
(B) UXIAriOII 1...1620 



10 



Ixi) SEQUENCE DESCRIPTION: SEQ ID NO: 256 

ATGAAGACAA A.i\GTTTTACG CAAATTCGTG GTGGCGGCTT TCGCCGTCGC AACCCTCTGT 60 

CCTCTCGCCC AAGCGCAGAC GATGGGAGGA GATGATGTCA AGGTG6TCCA GTACAATCAG 120 

GAAAAACTGG TACAAACCAG GATGAGTG-I'G GCGGACAACG GATGGATCTA TGTAATGACC 180 

CACAGTGGAT ACGACACCGG CAATAOCAAT GTGAAGATCT TCCGCTCCAA AGACCAAGGT 2<10 

GCCACATACC AAAAGTTGAG GGATTGGGAT CCATCGGATG ATTATCAGTT TCAAGACTTC 300 

1«; TAACGGGTAA GAATGAATCC GACATCAAGA TTTGGTCGGT AGAGCTCATG 360 

AATAAGCCCG GAGGATATAA GAGTAGAGTT GCGGTCTTCA GTCGCGATGC CAACGCGCAG 420 

AATGCGAAAC TCGTGTATAA GGAAGACTTC TCCAATGTGC AGTTGTACGA TGTGGATATA 4 BO 

GCCTCCAACT ATCGTTCGCC TTCTTCTCTT AACAATGGTG GCAACCCTTT TGCTTTGGCi' SJO 

TTCGCTTACA CCGGCTTCAA CAATACGCAC AAAATAAGTT TTGTGGACTA TGTGTTCTCT 600 

70 fi^lf:^'^ GGCAAAATTT CAATAAAAAC TTACTCTTCA GTCAAGATGG AGAGAAGAAA 660 

ATTGACAAGG TGGATCTCTC ATTGGGTAGC ACCTCTGAAT CCATGGGTCA CAATGCCTGG 720 

CCGCTAATGG GTGTGGTATT CGAAATGAAT AAACAAGGGG GAAAAAGCGA TATCGGTTTC 780 

TTGTCGAACT TTGTCGACAA TGATCCCGAA TTTCAGTGGT CAGGCCCTAT AAAAGTGAGT 840 

t^^^^''^'^-^ TCTCGTTCAG CCCCAAAATC CAAATGTTGC TGGACGAGGA TAACAATACG 900 

ATCAATGGGG AGAGTTGCCA CAACTTCATG ATTACGTACA GCGATTATGA TTCTGAATAT 960 

TCGGATTGGG ACATTCGGTA TGTATATCCC AAGAAATCGT TCAAGTATGA AAAAGGAAAA 10^0 

TGGATGATCT GGTGGAAGCT TTCCTTACAG CTTCGTACCA GAGTGAGACC 1080 

AACTCGGGGC TGGGGTATGA CAAGAACGCC AATCACTACC TGATTACATA TGCCAAAAAA 114 0 

^^^^n^^S 51^?'^^'^'' GCTGAAATAC CGCTGGGCCA ATTATGACAA GATTCATAAC 1200 

qn ^^J^r^^^ GGAGCGACAC ATTTACGTAT ACATCATCTG CCAATGCTCT CTACACACCT 1260 

^^l^'^'' TCAATCCGAC CAAGGGTCTC GTGTGCTGGT CATGGGTGGA ATATCTGCCG 1320 

^^S??"" TCGTTTGGTC TGATACGCAG TGGACCCATG CCAACGGTGT AGAAGACATC 1380 

GTAATGCAAG AAGGCAGCAT GAAGCTCTAC CCGAATCCGG CPCAAGAATA TGCTGTGATT 1440 

^^'^^^^'^ CGGCAGCAAA CTGCAAGGCT GTTGTTTACG ATATGCAGGG CAGAGTAGTC 1500 

35 ^^^^^SI*^ CTTTCTCCGG CAACGAATAC AGGCTGAACG TGCAGCACTT GGCTAAGGGT 1560 

ACGTACATAC TCAAGGTCGT ATCCOATACG GAGCSTTTCG TAGAGAAGCT CATCGTGGAA 162 0 



40 



45 



50 



60 



65 



70 



75 



iP.) INFORMATIOW POR SEQ ID NO: 257 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2313 base pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDHESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DNA {genomic) 
(ill I HYPOTHETICAL: NO 
tiv) AMTI -SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISI1: PORYPHYROr40t>lAS GIHGIVALIS 

55 (ix) FEATURE: 

(A| NAME/KEY: mlsc^feature 

(B) LOCATION 1 . . .2313 



(xi) SEQUENCE DSSCRIPTIOf-l: SEQ ID NO:257 



CGCTGATAGC 


GTTTGCTGCC 


60 


AATATGTCAA 


TCCCTTTATC 


120 


TGCCCAATGC 


GTTGATGAGC 


180 


TCGACAAAGA 


TTCGCGTTGG 


240 


GTTTCAGCCA 


TGTGAATCTG 


300 


TGGCCACTTC 


CGGCACATTC 


360 


AATATGCGCG 


CCCGGGAGAA 


420 


TGACCGTAAC 


CGAGCGGACr 


480 


TCCTGCTGAA 


CCTGGGACAC 


540 


ACGACTCCAC 


AGTCGTCGGC 


600 


TTTTTCGTCA 


GTATTTCGTA 


660 


AGAAGCAGCC 


TCCTATGACA 


720 


AGTACGACGG 


CTACAAGCGT 


780 


ACTGCGATCA 


GGGGGAAAAG 


840 


ATGCGCTCTA 


TAATCTGGAA 


900 



SUBSTITLrre SHEET (Rule 26) (RO/AU) 



« "!r:iJ rjii :c:i^ ^ :c:p to ui, u o ic:: o» O u 



WO 99/29870 



PCT/AU98/0ia23 



176/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



GCGGAGCAAC 
ATACGCTCTC 
GGCACACCGG 
AATATCCTAC 
ACGGCTCACG 
CTGCTCTGCC 
TACCGAGAGA 
GAGGGCGACC 
GATACGGCAA 
CCGATCCGTC 
TACGACAATT 
TTTGCCCACG 
TACAGACACT 
TTCCTCACTC 
GAGGGCAGTG 
CTGATGGGAG 
TATCATfTCGA 
GAAGCATGGC 
CCTAACGGCT 
ATGCTGGGAT 
TTCCCCCGAG 
ACGACCAATA 
AATAAAACAC 
CACCTCCGTT 



AAGAGGTG'i"r 
GC3CTATAGA 
ATGAAAAGAC 
AAGATGCCAA 
ACCGCTACAC 
TCCTCTATCC 
GCGGGTGGCT 
CCTCGCrTAT 
CGGCCTATGA 
CTGACAACGA 
CCGTATCGCA 
CACTTGGGCA 
ATTATAATAA 
CTTTCGATCC 
CTTATAACTA 
GAGCAAAGGT 
CCAACGAGCC 
GAACGCAGAA 
TGCCCGGTAA 
TCTACCCT^SA 
•ITAGGATTCG 
CAGAGAATCA 
TTCCGCATGG 
ACGAACTAAG 



CAAAAGTGTC 
ilCGTTGGGAG 
GATATTCTAT 
TGGAGAATAT 
CGTGTTCTCT 
GGAGAAGCAG 
GCCGAGATGG 
CGTCATCAAT 
AGCCATGAAA 
CGACTATCTC 
TCCGCTGGAA 
TAAAGAACAC 
GGAGTATGGT 
CAAACAGGGT 
TGCCTTTTTC 
TTTTTCGCAA 
GGACRTCGCC 
ATTGACCCGG 
TGACGATGCC 
CTGTCCGGGC 
GCTCAATCCG 
ACCGACAGAT 
AACAAGGCAT 
CAATCCTCCT 



GGAGGGAATC 
GAAGCCCTCG 
ACCGCACTCT 
CCTATrJ\TGG 
CTTTGGGACA 
TTGGATAT-3G 
GAGCTGTACG 
GACACTTGGC 
AAAAATGCTT 
ACCCTCGGCT 
TACTATCT'SG 
GCAGCTCTAT 
ATGCTGTGTC 
GAAAACTTCG 
GTTCCCCAC3 
AGGTTGCAGA 
TATCCTTr*C- 
GAGTTGATAG 
GGTACGATGA 
AGCCCCACCT 
CAGTATTATC 
TCCATTTACA 
ATCAGCCATG 
CGA 



CGGCCAAGCC 
GTACGGTGGA 
ATCACCTGCT 
GCAGTGGCAA 
CGTACCGCAA 
TACGGACACT 
GACAGGAGAC 
AAAGGGGCCT 
CTTCGGCAGG 
TCGTACCGCT 
CCGACTGGAA 
TCGGAAAACG 
CATTGCTGCC 
AGCCTAATCC 
ATATACAAG<3 
AAGTCTTCGA 
TCTTCTCCTA 
ACAAACATTT 
GTGCTTGGCT 
ATACACTGAC 
CTCAGGGGGA 
TCCy\TACGGT 
CCGATTTGGT 



TTTCTCCGCT 
AGTGGAAGGA 
GATACATCCG 
AACGGGTAAT 
TGTACACCCG 
GATCGACATG 
CCT6ACGATG 
TCGTGCTTTC 
AGCGACCCAT 
TCGCGAACAC 
TCTGTCCCGG 
CTCX3TTGGGC 
GGATGGATCA 
CGGTTTCCAC 
GCTTGCCCGG 
TGAAGGATAT 
TTTCCCCAAG 
TTGCAATGCT 
TGTCTATTCC 
CTCGCCGGTA 
GTTGATCATT 
TTCTCTTGGC 
GCGCTGCGGT 



(2) IHFORIIATia-I FOR SEQ ID NO: 258 

(i) SKQUEJJCE CHARACTERISTICS: 

(A) LEHGTH: 2328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDMESS: doubl- 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: DfJA (genomicf 

(111) HYPOTflETlCAL: NO 

(iv) Al^TI -SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGW/ISfl: PORT PHYRa lONAS GINGTVAI.TS 

(ix) FEATURE: 

(A) NAJIE/KEY: misc fsatur- 
(D) LOCATION 1...2328 



(>:1) SEQUENCE DESCRIPTION: SEQ ID NO:258 



ATGTGCAAAA 
TTTTCTCTCC 
GAAAAAGCCG 
GTGGGCAAAG 
ACATCGGTAA 
TCAGGCAATA 
GACAATATCG 
TATTTGGCAG 
ACCTACGCCA 
ATGCTGAAAG 
GCATGGACAG 
GGGGAAGACA 
TTTGAGCCGG 
GG7CTGTTCA 
CAATTTCCTA 
ATAGCATTGG 
GCCATGCCTG 
CACGGACGTC 
ATGGTGAAGC 
CAGITGGGCG 
CCCGATCACT 
GCGATAGCTC 
CGTGTGAAAG 
GGCAAGAACA 
GATGTAGCCA 
GTAA^TATCC 



TAAGATTCAG 
AAGCTCAAGA 
TAGAAACGCC 
AAGCTCCCCA 
GCAGCATAGC 
TCGACATCTT 
ATCTGATAGA 
GAGGATTCGG 
AGGGAACTAA 
AAGGGCAGCT 
CCTTGTCTTT 
TCTGTTTCCT 
AGCTATTGCT 
TCTGTGCCGA 
TAGCCGACGT 
GAGAAGAAGG 
TAGCATTCGA 
TGTATGCAGC 
TATACGACGG 
GCGGATTCAG 
TTTTTGTCGG 
GCTATTCGGG 
CGATTGCCTT 
TCTTCATGTA 
ATCTGGCCTC 
TTCACCGTAG 



CCTCTTGCAG 
GGAAGGTATT 
CAAGAAAGTC 
TGAGGCAAAG 
CTACTCCGAG 
GGACGAAGCA 
CAAAACGCTC 
CCTCTCCGTT 
GGTGACCGAT 
CTTCATCGGA 
GAATTTGCCG 
GCTCGCCGAT 
CTCTTCCTCC 
GAATCGAATT 
CCTTGGTGTC 
TTTGGCTTCA 
CGGACCGGGG 
CAGCGGACTC 
CAACCGATGG 
TTTCAATGAT 
TACATGGGGA 
AAACGAAACT 
TGACAATAAG 
CGATCCGCAG 
CTTCGGCAAT 
TGGCGGATCC 



GCTCTTGTCG 
TCGAATACCC 
TTTGCCGTAG 
ATCTTCGACC 
CAACTAAAAT 
GGCCGTGTGA 
AATCGCCTTT 
CTGGATGTCG 
GTGGCTAAGT 
AAAGAGACCG 
ATGGGCTCG'3 
GGCCGTGTAT 
GCCGATTCAC 
TATTTCATAG 
GGTGCCATGA 
CTTCTTCTC3 
GACAATGAYT 
TGGGGAACAA 
ACTAACTTCG 
GCTATCGA7A 
AACGGTCTGT 
GCTATCGCAG 
GGCAACCT-rr 
AGTAGCACAT 
ATGATTATCC 
ACGCGCAAAG 



TCTGCTTATT 
TCTTGGCl'AT 
CCAACKGAGT 
GTATCAGCGG 
CCTTGGTCAT 
CCAACGTACC 
TGATCGTAGG 
CCGAAGCTCG 
TGGACAATGA 
ATAACCTGCA 
TCACCGGTCT 
ATGTCGCTGC 
GACTGTATGT 
AAAAACGTCG 
ACGAAAGCAA 
CAGAGGGAAG 
TCTACGAGAT 
ACCTGATGGG 
ACAAGAAGAC 
TAGCTGTTTC 
TCGAATTCAA 
AATGTAATCC 
GGGGGACGCT 
GGCATTCTTT 
TACCCAACGG 
GTGTCTTGAT 



GTTCACCTCT 
CCACAAGACG 
ACTTT ACT CG 
ACTCAGCGAT 
ATACTATGCA 
TGCATTGAAA 
CAACAGGGCT 
CATACCGGCT 
TCGCTTGCTG 
AGATCCGGCC 
GGGCATTGTC 
AAACCAATCG 
GACGGATCGT 
CAAAACGACA 
TACGGCATAC 
TACGGCCGAA 
GCGGTTTAGT 
ACATGCCGGT 
CGTACAGGAA 
CAACGGAGAC 
GGATGGCAAA 
CGGAGATGCC 
CGGTGCCGTA 
CAGCTATCCG 
AGACAAAT GG 
CTTCAACGAT 



9eo 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

19G0 

2040 

2100 

2160 

2220 

2280 

2313 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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CGGGGTACAC CGGAAACGAC TTCGGACGAC AGCCATCTTT ACGTCGAGCA GTTTGTCAAT 1620 

CGCCTCGGGG CA'3CCATAGG ACATAAGACT ATCTATGCAA TGGCCGTCGA TCATAACGGC 16B0 

TCTGTCTGGA TGGGATCGGA TATAGGCATT TTCGGCGTCT ACAATGCAGC CGGAGTATTG 17 40 

TCCTCGACTT CTACCCCTAT CGCTGTTCGG CCGGTCGGAG GAGAAGAACC CAATTTGTAC 180O 

5 TATGTGCTGG ACAAGGTGAC GGTGACAGAC ATCGTCGTGG ACAAACTCAA TCACy\AATGG 1860 

GTTGCCACCC AAGGGACAGG ACTCTATCTC CTTTCGGAAG ATTGCAGTAA GATCCTCGCG 1920 

CAATTTACCG TAGAAAACAG CCCTTTGCTT TCTAACAACA TACTATCCCT GGCCTTAAAT 1980 

GACGATAACG GACTGCTGTA CATCGGTACG GCGGACGGAC TGATGACGTT OCAAACGGGT 204 0 

ACGGGGAGTG GATCAGCTTC CGAACTGGAC GGCGTCTATG TATACCCCAA TCCGCTAAGG 2100 

10 CCGGAATATC CCGATGGCGT CACCATTGCC GGACTGCAAG CCGGCTGTAG TGTCAAAATC 2160 

ACCGATACCA CCGGCAGACT GCTATACCAG ACTGAGAGCG TAACCACCGA AGTCAAATG-j 2220 

AATGCTCCAG GTGCCGATGG CAATAGGGTA GCTTCGGGCG TATATGCCGT TGCAGTGTAC 2280 

GATCCGGTAT CGAAAAAGTC CAAACTAA1T CGCTTCGCAG TGATTCGC 2328 

15 

(2) IlirORIlATIOH FOR SEQ ID HO: 259 



11 » SEQUENCE CHARACTF.RISTICS: 

(A) LEHGTH: 3474 base pairs 
20 (B) TYPE: nucleic acid 

{C) STRAIIDEDNESS: double 
tDI TOPOLOGY: circular 



25 



(ii) MOLECULE TYPE: DMA (genomic) 
(iii) HYPOTHETICAL: NO 



(Iv) AIJTI-SEMSt:: NO 

30 (vi) ORIGIHAL SOURCE: 

(A) ORGAWISM: PORYPHYROIIOHAS GTtJGIVALIS 



( Ik ) FEATURE : 

<A) MAME/KEY: ml .?c_f *ature 
35 (B) LOCATIOil 1. . .347 4 

(xij SEQUENCE DESCRIPTION: SEQ ID HO: 259 



ATGAAACGAA TACTTCCAAT AGTCGCATTC 

40 GCGAAAGCGC AACGAGCTAT GGGGAAGACG 
TGGGTCAAGA TACGTGTCGA TGCAAGTGGA 
GCCAATGGCT TCTCCGATCC GTCCAAGGTA 
CCCQAAGATC TGAGCCGGAT CACGACAGAC 
GGCAATGCGC TGTATTTCTA TGCCGTGGGC 

45 ACCACCATGG AGCATACGGT GAATACATAC 
GCTGCCGGAG CACCTTTGCA GATGTCCCAA 
TTGATCGACT ACTACGATGA GCTGATGCTC 
TCGG*JACGAG ATCTGTATGG CGAGTCTTTC 
CCTTTGAGGG GCAACACCCG CTCGTCTGGC 

50 AAGGCCAGAT CGGCCGGTGG CGGCCGTGAG 
TTCAGCGATC CTTTTTCCAT GACATCGAAT 
AAGCGTCGTC TCTATCACAG TACGCCGATG 
GCGAACTATA GCATGACAGG AGATGCGGTC 
AACGACCTCC GGTACGATGG CGCACCCATG 

55 TTGGGGGGCG AGTCCTGCCG GTTCGTTATC 
CAGGCCAATT CTTCCCTGAC AGCATCGCTT 
ATTGAGTTCU TGGCTCCGCC GAAGGGTCAG 
GTGGACTTGT CACAGGCTTC TGCTCCGGAG 
CATGGAGAGG AAATCCCTGA TCTGATCATT 

60 GATCGACTGG CCACCTATCG TAGAGAGAAA 
GAACAGGTGT TCAACGAGTT TTCGGGTGGA 
GCCAAAATGT TCTACGACAG ATGGAAGGCA 
CAAATGCTTC TCTTCGGTGA TCCGGCTCAT 
AAACCGTATC TCCAACAAAC GGAGTTCTTG 

65 GTAAACAGTT ATGTGACGGA TGATTACTTC 
AATATCGGTT GGCGCAATTA TAATATGGCT 
GAAGCTCGCA TCGCAGTGGA CAAGACCATC 
TGGCGTATTC GTGCCTGTTT TGCGGCAGAC 
CGTTTGATCG ATACCGTCAA GCGTTATGCT 

70 GACGTATATC CGCATGTCAT CGAGAACGGG 
ATGCTGGAAA CCCTTCAGTC GGGTATTATC 
GCCGGATG'3T CGGACGAGCA TTTGCTGACG 
CATATGCCCA TTTGGATTAC TGCCACGTGC 
TCGGCAGGGG AGGAGGTTTT CCTCCATGAG 

75 ACGCGTGTCG Tri'ACAATAC GCAGAATGAG 



CTTTCTCTCT TCCTTGCCCT TGCTTTGCCT 69 

GCCGACCGTT CGCTAATGGC TTCGGGACAT 120 

GTGTATCGCC TTACGGACGA ACAGCTCCGT 180 

GGTGTGTTCG GTTATGGTGG AGGGGTGCTT 24 0 

GATTTGCCTC CGGTACCGGT ACTCCGTCAG 300 

CCGGTGACAT GGTTCTACAA TCCGGCCAAA 360 

AGTACGCATG GCTACTACTT CCTGTCGGAT 420 

TATACGGGTG GAGGTGCGTC GGCCGAGGCT 4 BO 

CATGAACA'jG AATTGTATTC GCCCAAAGAA 54 0 

AGTGCAGTCA ATACGCGTAC GGTCAAGTTC 600 

GAACTCGGTA CCGTATTCTC ATACATAGCC 660 

ATGTCGCTCT CGGCGAATGG CATTCTGATC 720 

GAAGTCTCCA ATTCCTATTT GGCCGGCAAG 7 80 

AACAGCTTGG TCAATGAGTT GCGCTTGGAC 84 0 

AATCTGGATT TCATAGAGCT GGCTACACAG 900 

CATATCAGGC GGTTTTCCAA TTTGCCCGTT 960 

AGTGAGGTGC CGGAGTCTCT GGTGGTTTTG 102O 

GTTCCCGTTA AGACTGTCGG GGATAAGACC 1080 

GATCGTAGGA CTATCAATAC GTTTTATGCC 114 0 

ATCCTCGGAG CGGTACCCAA TCAAAACCTG 1200 

GTCTCTACTC AGGCGCTCCT CCTTGAGGCT 1260 

AACGGGCTGA AGGTTTTGGT CGTGTTGCAG 1320 

ACTCCCa=;TG CTACAGCATA CCGCCTCTTT 1360 

AATGCACCTG TGGGAGAGAC CTTCCCGATG 14 40 

GACAACAGGA AGGTCTCCGT AGCTTGGCAG 1500 

CTGACATTCC AAGCCGTCAA TTCGACGAAC 1560 

GGCTTGCTGG ATGATCAGCC GGCCTCGGTC 1620 

GTAGGGCGAT TCCCCGTACG TACTCCGGCC 1680 

CGATATGAGG AGGATCGAGA GAGTGGTGCC 174 0 

AACGGGGACA AGCACGCAAC CGAGACTTCC 18 CO 

CCTGCCATCA TGCCGGTACG CGCCTTTCAG 1860 

TTGCACAGCA TTCCGGGTGC AAAGAAAAAQ 1920 

CTGCTTAATT ATGCTGGTCA TGGCGGTCCT 198 0 

CTCAACGATA TACACAAATT CAATTATAAG 204 0 

GACTTTtSCCA ACTATGACAG TCAGACGACC 2100 

AAGAGT'MCA CTCCGATCAT GTTCTCGACT 2160 

r»AGATCrtATG GTTTTATGCT TCGGCGTATG 2220 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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178/490 



10 



15 



TTCGAGAAAG 
CAGGGGATGC 
GGTGATCCGT 
GGGCAGGATC 
CTGAAGGGTA 
TTCCTGACCG 
CTCTCTCTTG 
GACGGACTCT 
GGCCGGATCA 
TTCTCCATCA 
GAAATCATAA 
ACTCCTCTGT 
GTAGGGCATG 
GCATATTTCA 
GCTTTGGCCG 
GTCCATCATG 
ATTCTATTCC 
GGAAGCGATT 
TTGCCAGTCA 
ACCTCCAAAT 
TCTCCCGGAG 



CTAAGGATGG 
TCAGTACTGT 
CCGTGCGTAT 
CCGAAGGGCA 
AGGTAACCGA 
TCTTCGATGG 
TATATTATGA 
TCGAAACTTC 
ATCTTTATGC 
GAGTCCAACC 
GOrGCTTCCT 
TTATGGCCGA 
ATATTACGCT 
CAAGTTCGGC 
AAGGAGATCA 
ACTTTTCATT 
CGAATCCGGT 
TGAACGTGGC 
AGACCTATTC 
ACGGAGl'GAA 
GACAGACGGC 



GCGTTATCGT 
TTTCCCCGAT 
GAATCTTCCT 
GTATGGAACT 
TGAAAAGGGG 
CAGAAAGAAA 
CTATCCTAAC 
GTTTATCGTA 
TTATAACGAti 
GGGTATTCXrr 
CAATGACAGT 
AGTATTCGAC 
TTGTATCGAT 
TACGGATGCA 
TACTGCCCGA 
CAGAGTGGTA 
ACGCGAGAGT 
CCTGGAGATC 
GTCTTCCTAC 
GATCGGAAAC 
CrCCATGGCC 



ACGATGGGCG 
TCGATCAACC 
ACCCACAAAG 
ATTATGCTCA 
ACATTCGAC6 
ATGACAGCTT 
GTGATGTATG 
CCC3VAGGATG 
AGCACAAAGG 
GATGAGGTAA 
ACATTCCGAT 
TTGAATGGAA 
GGCCGTGCCG 
GGTGT6GGCA 
CTGACXOTTT 
GATGGC7\TTG 
GCTACGTTCC 
TATGACTTCA 
GGAGAACd'A 
GGATTCTACC 
AAGAAAATGA 



AGATTATCCG 
AGTTGAGTTT 
TGCAATTGAC 
AGTCTTTGGA 
AGACATTCAG 
TGiSAAGAGGA 
CCGGTATTGC 
TGAACTATTC 
CGGAAGCCAT 
CGGAAGATAA 
CGGGAGATGA 
TCTIATATCAC 
ACCTGACCTA 
CTATTCTCTT 
GGGACATTTT 
CTCCGGATGT 
GAATCTTCC7V 
CCGGTCGTCT 
TAGAGATCAA 
TCTATCGTrG 
TCGTGGTACG 



ATCGGCCAAA 
CTTTCTGATG 
CGCAATC3\AC 
ACGGGTAGCT 
TGGCAAGGTT 
GGGAAACGAT 
CGAGGTGAAA 
CGAGCACGAA 
GGGGGTAGAC 
TACACCGCCT 
GGTTAATCCT 
GGGTAGCGGA 
CAACCTCAAT 
CATGATACCG 
CAATAATGCC 
GGCTGATGTG 
CAATCGCCCC 
TGTGAACAGT 
GTGGGATCTG 
TGTGGTGAAC 
ACAA 



2280 

2340 

2400 

2460 

2520 

2S80 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

34 7 4 



(2) INFORflATIOtJ FOR SEQ ID NO: 260 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2883 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEWESS: double 
ID) TOPOLOGY: circular 

MOLECULE TYPE: DWA (genomic) 

aii» HYPOTHETICAL: No 

(Iv) AtJTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PORY PHYROI-IONAS GINGIVALIS 

I ) FEATURE : 

(AJ NAME/KEY: misc feature 
IB) LOCATION 1,..2883 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 260 



ATGAAGAAAC 
AJ\AGAAAAAC 
TACGAAAAAG 
AAGAACCTTG 
TCCCAATACA 
AGGGACACCA 
GGTGCTTTGG 
AGCGGAAACC 
AACATCAATC 
CAAGGTGAGA 
CGTATCTATC 
GAGCAGAACA 
GAAGTAGACG 
AAACTGATGG 
CGTATCAATC 
GGGACTGCGA 
GAGAATCTCG 
ACGTTGAGCG 
AACTACGAAC 
AAACAAGAGA 
CTTTTGATTT 
AAAGAGACGG 
CCTATCACCG 
TCGGACAGAA 
GTGAACCAGC 
AATGGCGACA 
GCTCAAGACA 
CCCTCCTACT 
GCCGGAGGGA 



TTTTCCCATT 
ACTCTGTAAC 
AACTCCCCAC 
TGGGACAAAT 
CAGATGCGAT 
TCGTAGCCGC 
CAGAAGCTTC 
AGAATAGGCC 
TTGAGTTAGG 
AAGAACTCGA 
GACAACGCAA 
ATATGGCAGA 
AAGAAAAAGG 
AACAGTTGTC 
TCAAACAAGG 
AAGAAATTAA 
AGCGTAAAAA 
ACAGCATGTC 
GAAACAAGTC 
AATCGAAGTT 
CTGTTCTGAC 
ATAAACTTCG 
TCATACAAGG 
CCGAGCTGCA 
TGTTCGATAT 
TCGTCTCCTT 
TAACCTTGGA 
TGCAAAAAAT 
GAGTGGTCAT 



ATTATTACTC 
TGAAATCCCC 
TGATTCTGTT 
GTTATTCGCG 
GCTATATCAC 
ACAAGCATGG 
GGATTATCAT 
TGCGATCAAA 
ATACCATGAT 
CAGTCCTCTC 
AGAATACGAC 
QAATCTGATG 
GGATTATCAA 
CGATCGATGG 
TAACGAAAGG 
TTCGACTTCA 
AGAATACAAA 
CATTGCGCAC 
CCAAAAA6AG 
TATCCTCTTG 
ATATGCATAC 
CTCCCGCTTT 
TTTGAATGAG 
CAAGATAATA 
TTGCAAGATC 
CGTACAGATT 
GCTACAACCC 
CATATCCAAT 
ATCTCTGGCA 



ATAITGTCGA 
CGAGAGAAAA 
AAACAGCTTT 
ATCGAGGTCG 
CAAAAAGGGT 
AATCATCTCG 
TACAAGGCTC 
GCCAGATCGG 
GAGGCCGAAA 
GGGCAAGCTA 
AAGGCTCGTA 
GGTATCGGAC 
AAGGCTTTGC 
CACTGGCTGA 
CTCTACCAGC 
CATCTGATAG 
CAAGCCCTCG 
AAGGTCAGCA 
CTTGAAGAAA 
AGCACTCTTT 
CGTCAGGGCA 
TTCACCGGTA 
AAAATGAGTT 
GACAGACACA 
AGAAGCGGAG 
CTCATCGATT 
GAGAGCAAAC 
CTTTTGTCCA 
AAAACCAAGA 



TTTTGGTCGG 
AGCGTATTAC 
ACGAGAACAG 
GCAAACGGAT 
TGAACGCTGC 
GAACGGATAG 
TTTCCTTGAT 
CGGCCCTGAA 
AGAATTTCCT 
TCAACTATGC 
CCTACTTCCT 
TCTGTAGCAT 
AAGAGTATGC 
ATTCCTGTAT 
ATITCATTTC 
AAATATACAA 
AAGCATTCTG 
GCATACAAGA 
TACAGCAAGT 
TTGCCCTTTT 
AGAAGCATAA 
TTACACACGA 
CAAGTCCTGA 
GTAGCCAl'AT 
TATCCACGCC 
CGTTTGCACC 
CTATTGTCGT 
ATGCCATCAA 
ATGAAAAAAA 



ATGTGGAAAA 
TGCATTGCTG 
CGTACAGAAC 
CCGTAATATG 
ATTGAATCTA 
CCGTCGTATC 

AGAATcrrrr 

CGGCATCGGC 
GAAAGCACTG 
GAACCTCGGA 
CTTGTCTCTG 
CAATCTCGGA 
CACGGCATAC 
CCCGATGGCA 
TTTGGCCGAA 
TCTTCAATAC 
TCTGAGCAAG 
AACGCGATTC 
AAGCAAGGCA 
CATCTCGATT 
CAAGCTGATC 
ATTTCCTACG 
TCTCCAAGCA 
GCTGAATTTG 
CGAATGGCGC 
ATACGCACAG 
GGACTTCGTC 
GTATTCTTTA 
TCTGATCATA 



60 
120 
180 
240 
300 
360 
420 
480 
S40 
600 
660 
720 
760 
840 
' 900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
IbOO 
1560 
1620 
1680 
1740 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



CGCGTTGCAG 

TTCTATCGAG 

AATATACTGG 

AGTGCCTTCA 

TGGCTACCCT 

ACATCTCCGA 

CTCGTCGAGG 

AATGTGCTAT 

GACATTATCA 

ATGAAGCAAT 

GAACAGGACA 

TCTCCGGAGG 

AAGTTCTATA 

GATGACAGCA 

CAAAATCCGG 

CTCAACAGAA 

AAGATAAAAT 

ATTGAGGCAG 

TGCTCTCCCT 

GTT 



ACAATGGCAT 
GACAGTCCGC 
TCGAAAACCT 
CCATCAGTAT 
CCTCCGATGA 
TGGTAGCAGC 
ACAATAAGGA 
CCGCCGCAAA 
TTACGGATAT 
CGCCTCTGCT 
GATTGGAAGG 
AGCI'TTTGAT 
TGCAAAAACT 
GTATGCAGTT 
ATTT7*TCCGC 
AGCTCACGAG 
TGGCCTGCAA 
GCTTTTCGGA 
CCCTXATATCG 



AGGAATACAT 
TACCGAAAAG 
TCGAGGTACG 
TCCTACACAA 
CATTGTCATG 
TCTGAATCAT 
TATCAACCTG 
CGGAAAAGAG 
TATGATGCCG 
CTGTCAGATT 
AATCAAAAGC 
GCGGATCGAG 
CATGCTGGAT 
TCTCCTTGCT 
TCAAGACTTG 
TGTCGTAGGT 
GCTCCTTGCC 
TCCGGCTTAC 
GCAAAAACTC 



AAAACTGATC 

CATGGATCAG 

ATCAAAGTGG 

AACCA6TCCT 

CCTGTCCACA 

CGCTTCGAGG 

CTCGTCAAAC 

GGTATAGCXrC 

ATAATGGATG 

CCCATTGTCG 

GGTGTAGTCT 

CAGCTTCTGA 

CGGAAGCCGG 

GCCAAAGATG 

GCCGAAAAAA 

TGCTCCACCA 

GATGAGAGCA 

TTCTCTCGCA 

CTTGCCATGC 



AGGCTCATAT 
GCGTCGGACT 
AAAGCCAGCC 
CTTCGGCAGA 
TCGCGCCCGA 
ACGAACGTCC 
TACTCCTTTG 
TOSCTACCGA 
GGATAGAAAT 
CTTTGACGGC 
CTTATCTATG 
AAGACCGTGA 
AGGAGGAGCC 
CAGTGTCCGG 
TGTGC7\TGAG 
TCGGCTACAT 
AAAACATCTC 
CCITCAAACG 
CGGGGAGCGA 



CTTCGACATC 
CTCGTTTACC 
GGGGAAAGGA 
GATTCTTCCT 
TGACTCACCX3 
GACCATACTG 
CGATCGCTAC 
GCATATTCCC 
GACAATCCGG 
CAAGAGTACC 
CAAGCCATTC 
CTTGCTCAAG 
TCAACCGATA 
TGGAATCAAA 
TCCATCCCAA 
ACACCAGATC 
CGACATTAGC 
CTACATGAAC 
CAAGGAGACA 



(2) lUFORI-IATION FOR SEQ ID HO: 261 

( 1 ) SEC2UEUCti CHARACTERISTICS : 

(A) LF.MGTH: 1668 base pairs 
IB) rfPE: nucleic acid 

(C) STRAt4DEDHESS: double 

(D) TOPOLOGY: circular 

(ii) nOLECULE TYPE: DHA (genomic) 

iiii) HYPOTHETICAL: NO 

Uv) Al^TI -SENSE: NO 

(vl) ORIGINAL SOURCE: 

(A) ORGANISH: PORYPHYRa-IOHAS GINGIVALIS 

lix) FEATURE: 

(A) NAME/KEY; miac feature 

(B) LOCATION 1...1668 



(xi) SEQUENCE DESCRIPTION; SEQ ID HO:261 



ATGAAAAAGC 
GCAATCGCCC 
Al'GCAACGAG 
TACGAACGCT 
AAAGAGCTCG 
ACCGTTATAA 
GCGAAAACTG 
AACAAAGTAG 
CCCAAATACG 
GGTGGCAACT 
GAGAACTCCT 
ACACATCATG 
GGCAAGTATT 
CACCAAGCCC 
AAGTACGAGG 
CTGAACAACA 
AACGTCTATA 
ACACCTTCGT 
TATCTCTATA 
GAAGCAGATG 
ATCAATGGTT 
ACTTATAGCT 
GACAATAGTG 
ACGTGTAT6A 
TGGTTCAACG 
TATCGGATAA 
GTACCAGGGA 
CTGGTTGTTG 



TTTTACAGGC 
AAACGCAAAT 
CATTCCAGGA 
CTGCAGCCGT 
CCAAGAACGA 
CCCAGTACAC 
ACTCTTACTG 
6TCTCGTGGA 
AAGCACAATA 
ACATGACGGA 
CTCTGTCTCA 
ATGTGGTACA 
TGGCACCGAA 
TGGAAGATAT 
TATATCGCGC 
GGGTATTTGT 
AGACGGCAAT 
TAGGAACAGA 
TCAAGCACTA 
TCGTCTCATG 
CCGGTAGCTT 
TTACAGGTCT 
GTCGCAAAGA 
ACGAAACCAA 
CCGGTCGTTC 
AGCTTTATAA 
CGAGTGTCT*!' 
AACGAAATGG 



TAAAGCCTTG 
GCAAGCAGAC 
AACGAATCCC 
TTTGGTACGC 
CAAGGTGATT 
CCAAAGCGGT 
GACACGCGAC 
CTTTATTTAT 
TCTGGGCATC 
CGGATATGGA 
AGCTCAAGTA 
AGATCCGAAC 
CAAAATCCTC 
GGCAGCCTAC 
TTTGGCCACC 
TCCTGTCAAT 
GCCCGGTTAC 
TGCCCTGCAT 
CCCGATACTG 
CGCCAATGCT 
TAAGGCTGCT 
TAACAAGAAT 
GACTTATCCC 
TACATGTACT 
AGAACTGGCl' 
CACCGCAGGA 
CAGTATGGAT 
AATCCGTGAG 



ATTCTGGCAT 
CGAACAAACG 
CCTGCAGGTC 
TACCCGTTCG 
ACCATTGTGG 
GTGAATCTCT 
TATACCGGTT 
AACCGCCCTC 
GAGATGl'TCG 
TCCGCTGTGC 
AATCAAAAGA 
GGCGAATATA 
ATCAGGAAAG 
TTCGCAGCAC 
AATGAACAAC 
GGCCCCGCCT 
GAAATTATAG 
TGTCGTACl'C 
GGCGAACAGG 
ACTATCTCGC 
GATATGACGA 
GATAAGGTAG 
TTTATCGGCG 
GTGACCXJGAG 
GTTTCGGTAA 
GAAGAAGTCG 
GTGTATTCTC 
ACAATGAAAA 



TGGGACTCTT 
GTCAATTTGC 
CTGTGCGTGC 
GTATCCCGAT 
CGAGTGAAAG 
CTAATTGCGA 
GGTTCGCAAT 
GTCCTAACGA 
GGATGAAGCT 
AGTCACATAT 
TGAAAGACTA 
TCAACCATGT 
TGCCTGACAA 
AGACCTGCGC 
CGTACACGAA 
CCGTGGACAA 
GTGTCAAAGG 
ACGAGGTAGC 
CAGGCCCTGA 
CGGTACAATG 
TGGAATCAAC 
AATACTATAT 
AACCTGATCC 
CTGCCAAAGC 
GTTTGAATAT 
CTGCAATGAC 
AGGCTCCGGG 
TTCTCAAA 



CCAACTGCCC 
AACAGAAGAG 
TATCGCTGAG 
GGAATTGATC 
CCAAAAAAAC 
TTTCATCATT 
GTACGATACG 
TGATGAATTC 
CAAGCAGACC 
CGCATATACG 
TCTCGGCATC 
GGACTCTTGG 
TCACCCTCAG 
ATGGGGAACG 
CTCTCTGATT 
CGATGCTCl'G 
GGCTTCACGA 
GGATAAGGGC 
TTATAAGATC 
TTACl'ATCGT 
AGGTCACTAT 
CTCTGCCGCT 
TTTCAAGTTT 
TCTTCGTGCA 
TGCCGGCACA 
CAAGGAATTA 
CACATATGTT 



I BOO 

1860 

1920 

19B0 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

264 0 

2700 

2760 

2820 

2880 

2003 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
15O0 
1560 
1620 
1668 
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(2) IlIfORIlATION EX)R SEQ ID HO:262 

(i) SEQUEriCt CHARACTERISTICS: 

(A) LENGTH: 12B4 base paire 
5 (B) TYPE: nucleic acid 

tCJ STRAl IDE Ot-I ESS: do\ible 
(D) TOPOLOGTY; circular 



10 



70 



(ii) MOLECULE TYPE: DMA (genomic] 
llli) IirPOTHETlCAL: HO 
ftv) AlJTl-SEllSE: NO 



3S 

(Ji) MOLECULK TYPE: DNA (genomic) 

(ill) HYPOTHETICAL: HO 

60 tiv) At ITl- SENSE: NO 

tvij ORIGIIIAL SOURCL: 

(A) ORGAIJISM: PORY PHYRaiOtiAS GINGIVAL! S 

65 (Ix) FEATURE: 

(A) HWIE/KEY: mis cofeature 

(B) LOCATION 1, . .930 



Ixi) SEQUEUCE DESCRIPTIOt^: SEQ ID NO: 263 



-J 



PCT/AU98/01023 



15 (Vlj ORIGIHAL SOURCE: 

(A) ORGANISM: PORYPHYROHONAS GINGTVALIS 

lix) FEATURE: 

(A) NAIIE/KEY: misc featur*r 
20 (B) LOCATION 1...1284 

(xi) SEQLIEUCE DESCRITTlOIl: SEQ ID NO: 262 

ATGAAACTTT CATCl'AAGAA AATCTTAGCA ATCATTGCAT TGCTGACGAT GGGACATGCT 60 

25 GTGCAG<3CAC AGTTTGTTCC GGCTCCCACC ACAGGGATTC GCATGTCTCT CACTACAACC 120 

AAGGCCGTAG GCGAAAAAAT CGAATTGTTG GITCATTCCA TAGAGAAG/"\A ACGCATCTGG IBO 

ATCGATCTCA ATGGGGATGC CACTTACCAA CAAGGAGAGG AAATAACCGT ATTCGATGAC 2-10 

GCATACCACG AATACACGAT CGGGACGCAA ACCCTCACTA TCTATGGTAA TACGACCCGA 300 

TTGGGCTGTC GATCTACCGG TGCAACGGCT GTCGATGTAA CGAAAAACCC TAATCTGACC 360 

30 TATCTCGCAT GCCCGAAAAA TAATCTGAAA TCATTGGACT TGACGCAAAA CCCAAAGCTG 420 

CTGCGAGTTT GGTGCGACTC TAACGAAATA GAAAGTTTGG ACCTGAGTGG CAATCCGGCT 4 BO 

TTGATCATCC TCGGCTGTGA CAGGAATAAG CTGACTGAGC TGAAGACCGA TAACAACCCC 54 0 

AAGTTGGCCT CTCTTTGGTG TTCTGATAAT AACCTGACGG AGTTGGAACT CAGTGCCAAT 600 

CCTCGTCTCA ATGATCTTTG GTGCTTCGGT AATCGGATCA CGAAACTCGA TCTGAGTGCC 660 

35 AATCCTCTAT TGGTAACACT TTGGTGCAGT GACAATGAGC TTTCGACCTT GGATCTTTCC 720 

AAGAATTCGG ACGTrGCTTA CCTTTGGTGT TCATCGAACA AACTTACATC CTTGAATCTG 780 

TCGGGGGTGA AGGGACTGAG TGTTTTGGTT TGTCATTCCA ATCAGATCGC AGGTGAAGAA 84 0 

^ ATGACGAAAG TGGTGAATGC TTTGCCCACA CTATCTCCCG GCGCAGGCGC TCAGAGCAAG 900 

TTCGTCGITG TAGACCTCAA GGACACTCAT GAGAAGAATA TCTGTACCGT AAAGGATGTG 960 

40 • GAAAAAGCTA AAAGTAAGAA CTGGCGAGTA TTTGACTTCA ACGGTGATTC TGACAATATG 102 0 

CTTCCATACG AAGGAACTCC GACATCGAAC TTGGCAGTAG ATGCTCCCAC TSTCAGGATA 1080 

• TATCCCAATC CGGTAGGAAG ATATGCGCfC GTCGAGATCC CCGAGTCTCT TTTAG'SGCAG 114 0 

GAAGCTGCTT TATACGATAT GAATGGGGTA AAAGTCTATA GTTTCGCGGT AGAGTCTCTT 1200 

CGTCAGAACA TTGACCTGAC ACATCTTCCC GACGGCACTT ATTrCTTCCG TCTCGATAAC 1260 

45 TATACCACTA AGCTCATCAA ACAG 1284 

(2) INFORi-lATIOn FOR SEQ ID HO: 263 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 base pairs 

(B) TYPE: nucleic acid 
to STRAJIDEDNESS: double 
(DJ TOPOLOGY: circular 



ATGAGAAAAA CAATAATTTT CTGCTTGTTG CTCGCCCTAT TTGCCTGTTC TTGGGCACAA 60 

GAAAGAGTCG ATGAAAAAGT ATTCTCCGCA GGAACAAGTA TTTTTAGG&S CATCCTTGAA 120 

AAGGTGAAAG CACCGCTTAT GTATGGAGAT CGTGAGGTAT GGGGTATGGC TCGTGCGAGC 180 

GAGGAT'ITCT TTTTTATACT TCCCGTTACG GATGACCTCA CTCCCGTGCT TTTCTATAAC 24 0 

75 CGTCTTACAA ACGAACCCTG CTTTGTCTCA GACCAAGGAA TAACTGAGTA TTTCAAATTC 300 
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GCTCAAGAAG 
TACTATCGTT 
AGCAAGACGG 
GAAATCTCAA 
GACGGGATGG 
CjTGTATGGTC 
CCAACCAATT 
AAACAACAAG 
TACGATCTTG 
TCCTTTAGAG 
ATCAATAAAA 



GTGATTACAT 
TTTTCCCGAC 
GAAATCCTGC 
ACAACCGCAA 
AACCTCCGAT 
ATGTCTCACA 
GCTATCCGCT 
GCAGGCAAAT 
AGGGGAAAAG 
CACCCATGCT 
AACTTAATGT 



TGAAGTCGAA 
AAGAATTACC 
TTTTACAATC 
AGTCTTTCTG 
TATTGCCGGA 
GCGGTGGGAC 
CAGCACGAAA 
AGAGATCGAT 
TGTTTTTCGC 
CGGCTTTATG 
TACACAGCTA 



GGAAGCTCTG 
TCCTATAATG 
CCGATGCTCC 
ACCAATCAAT 
GTCTCTGCTT 
ATCATAGGCC 
CCCGTTGCAG 
AGCAACAGCC 
AAAAGAATGA 
ACCTVTCATGA 



TATTCATGGC 
CTCCCATTGA 
CGGGGGTTTC 
TAGGGGTTGT 
CCTATGGATC 
ATTGCTATTT 
GAGACGATGA 
CCATAGTCCA 
CCGAAAACGC 
TCGAAACACA 



GAATCTTTTC 
AGGTGTTGTG 
TGATTGCATA 
AAACATCACT 
TTCCGTCCGG 
GGATATCTAC 
GGTTTTTGTC 
AGTGGTCGTA 
TTATACCCTA 
AAATTCGATT 



360 
420 
480 
540 
600 
660 
720 
7 80 
840 
900 
930 



15 



20 



25 



30 



35 



40 



45 



50 



55 



(2) IHEX3RIIATI0N i'OR SZQ ID NO: 264 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 1215 base pairs 
(B» TYPE: nucleic acid 
(C) STRAW DEDIIESS: double 
(D» TOPOLOGY: circular 

til} MOLECULE TYFE: DIIA (genomic) 

UiiJ HYPOTHETICAL: liO 

(iv) AJITI-SEIISE: NO 

tvl) ORIGINAT. SOURCE: 

(A) ORGANISM: PORYPHYROMOfJAS GIHGIVALIS 

tlx) FEATURE: 

(A) NAI-tE/KEY: misc_feature 

(B) LOCATION 1...121S 

(xi) SEQUEl'lCE DESCRIPTION: SEQ ID NO: 264 



ATGAAAAAAA 
CAAACCAAGG 
TACTCTCTCC 
ACCGTCTCCA 
ATCGCTCGTG 
GCCAACGGGC 
CTCGACACCC 
TTCATGCTGG 
CAGGGCAGAC 
TATCACTATA 
ACCTATTCAG 
GCCGAGTACT 
ACCTATGAGG 
TACAAGCGAG 
ATGCCGGAAC 
TATTCCTCCT 
GCTCTTACCT 
TCAACGGACC 
CAGGGTAAGC 
TCTTTGACCA 
AAAGTCGTGA 



C7WCCATTAT 
ACAATTCTTC 
CGACTCAAAA 
CJ\AACCAGTC 
ACATAAAAGC 
GAATATCGCA 
GATTCAAGTA 
TAATGGATCC 
TGACCCACTG 
ATGAAAAAGG 
ACACCGGCAA 
TCGTCGTCCA 
ACAATATCTG 
ACATCGAGAG 
AGACCTGGCC 
ACGAAGGAGA 
CAATGGCAAC 
GGTTAGTGAT 
TTATCCGTGA 
AAGGGACATA 
TTCGA 



TTCTTTGATT 
TTACAAACCT 
TCGTGCGCAG 
TGCAGATACT 
CAATGGGGTA 
TGTGGATCTT 
CACCTATGAT 
GAATACACCT 
GATTTCTTTT 
TCTGTTGGTC 
AACGGAATAC 
GCAAGGAAAG 
CATACAATAT 
CGATAAGTCC 
TAATATGTAC 
TGTGGCTACT 
ACCTTCGACA 
TCTGGCCAAC 
TTGTGCCTTG 
CCTGCTTAAA 



GTCTTCGGTG 
TTTTCGAAAG 
AAGGACAATG 
CACTTTATCT 
AGAAAATCCA 
TATATCTCGT 
GACGAGGGAA 
ATCTCACGCT 
GCTTTCGGGG 
AGCGAAGTGC 
AGCTATGACG 
GCATGGCAAG 
TTGGCTATTA 
ATCTCCGCAA 
GGATTCAACG 
CCTATATTCG 
GAAGCTCAGG 
GGCATCACAC 
ACCGGCCATA 
GTGAATACGG 



CTTTCTTTGC 
AAGATATTGC 
CCGAGTGGCT 
TCGATGAGAA 
CGGACTCCAT 
TCAGTGGCGG 
AGATGACCGT 
TGGAATATCA 
CAGAATCCCA 
TGAGCAATGC 
ATGCAGATAA 
TACTCAAAAG 
ACGGTACC6A 
ATGTCATTGA 
CAAAGCGACT 
ACTATATCTA 
TAGCAGTCTA 
ATCTGAGCAT 
AGGTGGAAAT 
ATCAGGGAGC 



AGCCGTGGGC 
CGGAGGAGTT 
TCTTACAGCG 
CAACCGCTAT 
TTACTACGAT 
AGAGCCTGCA 
GAGGGAAGTA 
TTATGATGCA 
AAAGAATACG 
AATGGGGACA 
TATGGTGAAG 
AGAGGAATAC 
CACAAAGGTG 
CATTCCGTCA 
GAAAGAGACT 
TACGTACAAG 
TCTCAATCCG 
GTACGACITG 
GGGTGTCGGA 
CTTTGTGAGA 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
640 
900 
960 
1020 
1080 
1140 
1200 
1215 



60 



65 



70 



75 



(2) IIIFORHATION FOR SEQ ID N0:265 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protsin 

(ill) HYPOTHETICAL; YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAltlSf'I: Porphyromonas gingival is 

(Ix) FEATURE: 

(A) NAME/KEY: mlsc feature 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(B) LOCATION 1. . ,45'1 

(xil SEQOEIICE DESCRIPTION: SEQ ID Ha:265 

Phe Cys Val Met Ala Lys Val lie Lys Thr Lys Lys Gly Leu Ala Leu 

^ S 10 15 

Acn Leu Lys Cly Lya Pro Leu Pro Glu Met Leu Ala Glu Pro Ala Gin 

20 25 30 

Ser Pro Thr Tyr Ala Val Val Pro Asp Asp Phe Glu Gly Val II- Pro 

3S 40 45 

Lys Val Thr Ala Arg Pro Gly Asp Lys Val Arg Ala Gly Ser Ala Leu 

SO 55 60 

Met His His Lys Ala Tyr Pro Glu Met Lys Phe Thr Ser Pro Val s-r 

"JO 75 80 

Gly Glu Val He Ala Val Asn Arg Gly Ala Lys Arg Lys Val Leu Ser 

85 90 95 

He Glu Val Lys Pro Asp Gly Leu Asn Glu Tyr Glu Scr Phe Pro Val 

100 lob lio 

Gly Asp Pro Ser Ala Leu Ser Ala Glu Gin He Lys Glu Leu Leu Leu 

lis 120 125 

Ser Ser Gly Met Trp Gly Phe He Lys Gin Arg Pro Tyr Asp II* Val 

130 135 140 

Ala Thr Pro Asp He Ala Pro Arg Asp He Tyr He Thr Ala Asn Phe 
150 355 160 

Thr Ala Pro Leu Ala Pro Asp Phe Asp ?he He Val Arg Gly Glu <llu 

165 170 175 

Arg Ala Leu Gin Thr Ala He Asp Ala Leu Ala Lys Leu Thr Thr Gly 

IflO 185 190 

Lys Val Tyr Val Gly Leu Lys Pro Gly Ser Ser Leu Gly Leu His Asn 

195 200 205 

Ala Glu He Val Glu Val His Gly Pro His Pro Ala Gly Asn Val Glv 

210 215 220 

Val Leu He Asn His Thr Lys Pro He Asn Arg Gly Glu Thr Val Trp 
225 230 235 240 

Thr Leu Lys Ala Thr Asp Leu He Val He Gly Arg Phe Leu Leu Thr 

245 250 255 

Gly Lys Ala Asp Phe Thr Arg Met He Ala Met Thr Gly Ser Asp Ala 

260 265 270 

Ala Ala His Gly Tyr Val Arg He Met Pro Gly Cys Asn Val Phe Ala 

275 280 285 

Ser Phe Pro Gly Arg Leu Thr He Lys Clu Ser His Glu Arq Val He 

290 295 300 

Asp Gly Asn Val Leu Thr Gly Lys Lys Leu Cys Glu Lys Glu Pro Phe 

? c 320 
Leu Ser Ala Arg Cys Asp Gin He Thr Val He Pro Glu Gly Asp Asp 

32b 330 335 

Vsl Asp olu Leu Phe Gly Trp Ala Ala Pro Arg Leu Asp Gin Tyr Ser 

340 345 350 

Met Ser Arg Ala Tyr Phe Ser Trp Leu Gin Gly Lys Asn Lys Glu Tyr 

355 360 365 

Val Leu Asp Ala Arg He Lys Gly Gly Glu Arg Ala Met He Met Ser 
370 375 3g(j 

Asn Glu Tyr Asp Arg Val Phe Pro Met Asp He Tyr Pro Glu Tyr Leu 
390 395 400 

L-u Lys Ala He He Ala Phe Asp He Asp Lys Met Glu Asp Leu Gly 
405 410 *^ ^ 

He Tyr Glu Val Ala Pro Glu Asp Phe Ala Thr Cys Glu Phe Val Asp 

420 425 430 

Thr Ser Lys He Glu Leu Gin Arg He Val Arg Glu Gly Leu Asp M-t 

435 440 445 

Leu Tyr Lys Glu Met Asn 
450 



(2) IHFORf lATIOH FOR SEQ ID NO: 266 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

{ii} MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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183/490 

(A) ORGAI-IISM: Porphyromonas glngivalis 

(ix) FEATURE: 

(A) NAUE/KKY: inisc_f eatuxe 

(B) LOCATION 1...201 

(x±) SEQUENCE DESCRIPTION: SEQ ID NO: 266 

Glu Leu Ser r.ys Cyn Tyr Met Asp Lys Val Ser Tyr Ala Leu Gly Leu 

^ 5 10 15 

Ser lie Gly Asn Asn Phe Lys Ser Ser Gly lie Asp Ser Val Val M-^t 

20 25 30 

A£p Asp Ph« Hot Gin Gly Leu Ser Asp Val Leu Glu Glu Lys Ala Pro 

3S 40 4S 

Gin Leu Ser Tyr Asp Glu Ala Lys Arg Glu lie Glu Ala Tyr Phe Met 

50 55 60 

Asp Leu Gin Gin Lys Ala Val Lys Leu Asn Lys Glu Ala Gly Glu Glu 
6S 70 75 80 

Phe Leu Lys He Asn Ala His L^'s Glu Gly Val Thr Thr Leu Pro Ser 

85 90 95 

Gly r^u Gin Tyr Glu Val He Lys Met Gly Glu Gly Pro Lys Pro Thr 

100 105 110 

Leu Ser Asp Thr Val Thr Cys His Tyr His Gly Thr Leu He Asn Gly 

115 120 125 

He Val Phs A^p Ser Ser Met Asp Arg Gly Glu Pro Ala Ser Phe Pro 

130 135 140 

Leu Arg Gly Val He Ala Gly Trp Thr Glu He Leu Gin Leu Met Pro 
150 155 160 

Val Gly Ser Lys Trp Lys Val Thr He Pro Ser Asp Leu Ala Tyr Gly 

165 170 175 

Asp Arg Gly Ala Gly Glu His He Lys Pro Gly Ser Thr Leu He Ph- 

ISO 195 190 

He He Glu Leu Leu Ser He Asn Lys 
195 200 

(2) lUFORTlATIOH FOR SEQ ID NO;267 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 9 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

[vi) ORIGINAL SOURCE: 

<A) ORGANISM: Porphyromonas glngivalis 

(ix) FEATURE: 

(A) NAME/ KEY: roisc_£eaturci 

(B) LOCATION I... 279. 

Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 267 



Glh 


Lys Asn 


Lys 


Arg 


Lys 


Met 


Lys 


Lys 


Ala 


Leu 


Leu 


He 


Gly Ala 


Ala 


1 








5 










10 








15 




Leu 


Leu 


Gly Ala 


Val 


Ser 


Phs 


Ala 


Ser 


Ala 


Gin 


Ser 


Leu 


Ser Thr 


He 


Lys 


Val 




20 










25 










30 


Gin 


Asn 


Asn 


Ser 


Val 


Gin 


Gin 


Pro 


Arg 


Glu 


Glu 


Ala Thr 


He 


Gin 




35 










40 








45 






Val 
50 


Cys 


Gly 


Glu 


Leu 


Ala 
55 


Glu 


Gin 


Val 


Asp 


Cys 
60 


He 


Gly Thr 


Gly 


A«n 
65 


Ser 


Ala 


He 


He 


Ala 
70 


Ala 


Ala 


Ala 


Lys 


Phe 
75 


Glu 


Ser Asp Asp 


Leu 
80 


Glu 


Ser 


Tyr 


Val 


Gly Trp 


Glu 


He 


Met 


Ser 


Vul 


Asp 


Pho 


Phe Pro 


Gly 


Tyr 


Lyif 






es 










90 






95 




Ala 


Cys 


Lys 


Tyr 


Thr 


Ser 


Ala 


Val 


Trp 


Ala 


Asp 


Asp Met 


Thr 


He 






100 










105 










110 




Leu 


Gly Gin 


Ser 


Glu 


Asp 


Ser 


Asp 


Pro 


Glu 


Met 


Gin 


Thr He 


Asn 


Asn 




115 










120 










125 






Lou 
130 


Ala 


Leu 


Lys 


Thr 


Ser 
135 


Val 


Lys 


He 


Glu 


Ala 
140 


Gly 


Lys Asn Tyr 


He 
145 


Val 


Gly 


Tyr 


He 


Ala 
150 


Asn 


Thr 


Ala 


Gly 


Gly 
155 


His 


Pro 


He Gly 


Cys 
160 
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A«p Gin Gly Pro Ala Val A^p Giy Tyr Gly Asp Leu Val s«r He Ser 

Glu A,p Gly Gly Ala Thr Ph« Pro Pro Ph" Glu Ser Leu Hi, g" Ala 
c IOC* 185 1**0 

Val Pro Thr Leu Aan Tyr A^n He Tyr Val Val Val His L^u Lys Ly^ 

200 205 
Gly Glu Gly Val Glu Ala Val Leu Thr A,n A*p Lys Ala A,n Ala Tyr 
21b 220 
10 «s Gly Arg Gin Val 

S«r Leu Phe Asp Met Asn Gly Lys Val Val Tyr Thr Gly Val Ser Glu 



250 255 
<^1" Gly M-t l-yr He Leu Arg Val 

15 Lys ser He Ly^ Leu Ala He 

275 



30 



35 



55 



60 



65 



70 



Thr He Ala Ala Pro Gin Lys Gly Met l-yr He Leu Arg V.l G^y Ala 



(2» INFOWlATlOtJ FOR SEQ ID NO:268 

(i) SEQUENCE CHARACl'ERISTICS: 

(A) LENGTH: 157 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



25 (H) IIOLECUT.F. TYPE: protein 

(iiij HYPOTHETICAL: YES 



(Vij ORIGINAL SOL'RCE; 

lA) ORGy^NISM: Porphyromonas gingivalis 
(ix) FEATURE: 

<A) NAUE/KEY: misc feature 

<B) LOCATIOH X..,15^7 

(Xi) SEQUENCE DESCRIPTIOtJ: SEQ ID IIO:268 



Arg Unk Phe Leu Pro Glu Lys Ala Leu Tyr lie Gly Cys Arg Val Glu 

40 Thr Gin Glu Gly His Ala Val Gly Phe G^y Leu Asp Asp Gly Pro Ala 

Met Lys Gly Lys GJ y Asp Leu Val "y Ser Tyr Leu Pro Giy Ala Ala 

Pro Met Pro Phe Val Pro Leu Ser A^p He Pro Ala Arg Ser Het A^p 

Ala Asn Phe Tyr He Tyr ler Arg He Ser Leu Giy Ser Gly Thr Gin 

A^p Val Leu Gin His Arg Met Lys Val Tyr Pro Asn Pro Aia Thr Thr 

Glu Leu Hi., val Glu Ala Leu Ser Ala ^rp Val Giy Glu Gin ^a Ala 

105 Tin 
Val Tyr A^p Met Arg Gly Arg Arg Val Ser Ala Arg Thr Val A,p Ser 

Glu Ly^ Leu Cys He Asp lie )ll Ser Leu Pre Val III Vol Tyr Het. 

135 1 jn 

L8U Arg He Gly Ser Tyr Ser Ala I.y, Phe Glu Lys Arg 

150 



45 



(2) lNFOR^rAT^OtI for SEQ ID MO: 269 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 562 amino acids 
{RJ TYPE: amino acid 
(D> TOPOLOGY: linear 

(il) MOLECULt; TYPE: protein 

(iii^ HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISII: Porphyromonas gingival 

(ix) FEATURE: 



71- (A) NAME/KEt: misc featur" 

[B) LOCATION 1.,.562 
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10 



15 



20 
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(xi) SEQUEIICE DESCRIPTION: SEQ ID NO: 269 

Thr Arg Asn Val Ser Leu lie Lys Met Pco Arg lie MeL Lys Leu Lys 
^ 5 10 15 

lie Ala Leu Ary Leu Leu Leu Ala Thr Phe Ala He Val Leu Phe Ser 

20 25 30 

Pro Leu Ala Lys Ala CJln Met Asp lie Gly Gly Asp Asp Val Leu Tl- 

35 40 45 

Glu Thr Het Ser Thr Leu Sec Gly Tyr Ser Glu Asp Phe Tyr Tyr Lys 

50 55 60 

Met Ala Val Ala A*p Asn Gly Trp He Tyr Val Met Leu Asp Phe S-r 
€5 70 -75 Qo 

Arg He Tyr Phe Asp Asp Val Arg Leu Tyr Arg Ser Lys Asp G^ y Gly 

85 90 95 

Ala Thr Tyr Gin Lys Leu Gly Ser Leu Gly Ser Leu Val Pro Tyr Asd 

100 105 110 

Phe Asp Val Ser His Cys Asp Phe He Val Thr Gly Lys Asp Glu Asp 

115 120 125 

Asp He Asn Val Trp Thr Val Met Thr Ala Phe Glu Tyr Val Glv Giv 

130 135 140 

Thr lie Gly Asn Gly Val Leu Leu H-t His Arg Jlis Asp Ala Asp H« 
150 155 160 

Asn Asn Thr Glu Cys Val Tyr Lys Lys Asp Phe Pro Asn Asn Arg Leu 

165 170 175 

Met Gly Val Ala Tift Ala Ser Asn Tyr Arg Ala Pro Ser Pro Tyr Gly 

180 105 290 

Leu Gly Gly Asp Pro Phe Ala Leu Ala Val Ala Val Ser Gly Ser Glv 

195 200 205 

Ser Asp His Ser Phe Leu Asp Tyr He the Ser Leu Asp Gly Gly Val 

210 215 220 

His Phe Glu Gin Lys Arg He Tyr Thr Arg Pro Gin Lys Leu Thr II* 
225 230 235 240 

Asn Arg Val Asp Leu Ser Leu Gly Ser Thr Ser Pro Ser Leu Gly Phe 

245 250 255 

Asn Thr Trp Pro Leu Het Gly Val Val Phe Glu Het Asn Lys Asn Leu 

260 265 270 

Asp Gly Ph« Asp He Gly Phe He Ser Asn Phe Val Asp Tyr Asp Pro 

275 280 285 

Arg Tyr Ala Trp Ser Glu Pro He He He Glu Glu Asp Cys Glv Tro 

290 295 300 

Thr Asp Phe Asn Pro Leu Gly Ala Leu Ser He Glu He Gin Het Het 

T . . 320 
Leu Asp Asp Asn Ser Asp Asn Thr Val Gly Gly Glu Arg Ser His Asn 

325 330 335 

Phe Leu He Thr Tyr Pro Gly Jlis Tyr Val Tyr Pro Lys Gin Ser Phe 

3*10 345 350 

A.?n Tvr Ser Pro Gly His Thi Pro Thr Lys Lys Asp Leu Val Phe Ly- 

355 360 365 

His C-ys He Gly Ho Pro Ala Leu Ala Tyr Asp Lys Glu Gly Asp Arg 

370 375 
Tyr Leu Thr Thr Phe Gin Asp His Asn Leu Met Arg Tyr Arg Trp He 
f®^ 390 395 ^ y P 

Lye Tyr Asp Asp He Asn Ser Phe Tyx Gly Trp Ser Trp Pro Tyr Val 

405 410 415 

Tyr Ala Lys Glu Ala Lys Asp Lys Lys Arg Arg Arg Pro Gin Val Ala 

420 425 430 

Leu ^^sn Pro Thr Asn Gly Lys Ala Cys Trp Val Trp His Thr Arg Ly.*. 

435 440 " ' 

Ser Pro Tyr Asp Glu Thr Lys Pro His Pro Thr Pro Val He Tie Lys 

455 460 
His Phe Leu Trp Ser Asp Thr Glu Trp Val His Ala Leu Asp Val Gly 
^''O 475* 480 

Asp Val Leu Gin Lys Glu Gly Ser Met Lys Leu Tyr Pro Asn Pro Ala 

485 490 495 

Lys Glu Tyr Val Leu He Asn Leu Pro Lye Glu Gly Gly His Glu Ala 

500 505 510 

Val Val Tyr Asp Met Gin Gly Arg He Val Glu Lys Val Ser Phe Ser 

515 520 525 

G)y Lys Glu Tyr Lys Leu Asn Val Gin Tyr Leu Ser Lys Gly Thr Tyr 

530 535 540 

Het Leu Lys Val Val Ala Asp Thr Glu Tyr Phe Val Glu Lys He H« 

vIlGlu 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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(2) INFX5R»ATI0N FOR SEQ ID NO: 270 

il) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

fli) MOLECULE TYPE; protein 

(ill) HYPOTHETICAL: YES 



(Vi) ORIGINAL SOURCE: 

ORGA^]ISI■^: Porphyromonas ginglvalis 

(ix) FEATURE: 

(A) NAME/KEY: mlsc £eatur» 

(B) LOCATION 1...391 

20 (XI) SEQUENCE tJESCRIPTIOtI: SEQ ID HO:;?70 

Gin Met Lyc Arg Leu Leu Pro Phe Leu Lsu Leu Ala Gly Leu Val Ala 

10 15 
25 ^.^'^ ^^"^ Pro Arg He Pro Gin Val Asp Va i 

20 p5 30 

His Thr Arg He Ala Arg A^n Ala Aig Tyr Arg Leu Asp Lys Tie s.r 

4 0 4 5 

Val Pro Asp Ser Arg Gin He Phe Asp Tyr Phe Tyr Lys Giu Glu Thr 

iJU lie Pro Thr Lys He Gin Thr Thr Thr Gly Gly Ala He Thr Ser He 

Asp Ser Leu Phe Tyr Glu Asp Asp Arg Leu vfl Gin Val Arg Tyr Phe 
85 90 95 

35 J-^^J Gin Ala Glu Lys Tyr Val Tyr Asp Gly 

105 

Ser Lys Leu Val Leu Arg Glu He Arg Lys Ser Pro Thr Asp Glu Thr 

^, 120 125 

Pro He Lys Lys Val Ser Tyr His Tyr Leu Cys Gly Ser Asp Met Pro 

135 140 
Phe Glu He Thr Thr Glu Met Ser Asp Gly Tyr Phe Glu Ser His Thr 
"0 355 160 

Gly Lys He Ala Arg He Asp He Met Thr 
165 170 JL75 

45 "° "^"^ ^^"^ Thr Gly Arg Met Val 

185 190 



40 



Leu Asn Tyr Leu Asn Gly Lys He Ala Arg He Asp He Met Thr Gin 

Gin Asn Pro Ser Ala Glu Leu lis Glu Thr Gly Arg Met Val Tyr Glu 

185 190 
PI.e Asp Ala A«n Asn Asp Ala Val Leu Leu Arg Asp Ser Val Phe Leu 

1-5 200 206 

Pro Leu Gin Asn Lys Trp Val Glu Met Phe Thr His Arg Tyr Thr Tyr 
Rn ^1^ 220 

OU Asp Asn Lys His Asn Cys He Arg Trp Glu Gin Asp Glu Phe Gly Thr 

235 240 
Leu Thr Leu Ala Asn Asn Phe Glu Tyr Asp Thr Thr He Pro Leu Ser 

245 250 255 

Ser Val L.u Phe Pro Thr His Glu Glu Phe Phe Arg Pro Leu Leu Pro 

Asn Phe Met Lys Ills Met Arg Thr Lys Gin Thr Tyr Phe fin Asn Ser 

2^5 280 285 

Gly Glu Gly Leu Ser Glu Val Cys Asp Tyr Asn Tyr Phe Tyr Thr Asp 
Kfl 295 -^00 

t>U Met Gin Gly Asn Ale Leu Thr Asp Val Ala Val Asn Glu Ser He Lys 

He Tyr Pro Arg Pro Ala Thr Asp Phe Leu Irl He Glu Gly Ser Gin 
32- 330 335 



Leu Leu Arg Leu Ser Leu Phe Asp Het Asn Gly Lys Leu He Arg Ala 

o^U 2AS 

Thr Gly Asp Leu Ala He 
360 

Tyr He Ala Glu II- Thr 
7n 375 30 0 

/U Arg Ala Lys Val Ser Leu Arg 



Thr Glu Leu Thr Gly Asp Leu Ala He He Cly Val Ala Ser Leu Pro 

360 365 
Arg Gly Thr Tyr He Ala Glu 11= Thr Ala Ala Asn Ser Lys Thr II. 

Arg 

385 390 
(2) INFORIIATIW FOR SEQ ID NO: 271 
7^ <i) SEQUENCE CHARACTERISTICS: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(A) LENGTH: 428 amino acl-ds 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) HOLE'JULE TYPt: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginqivaiis 



(ix) FEATURE: 

(A» MAME/KEY: misc_f«ature 
tD» LOCATION i. . .428 

(xij SEQUEMCE DESCRIPTION: SEQ ID HO:271 

Lys Arq Asn Pro Leu Pro Leu Thr Aia Ser Asn Arg Lys lie 
: ^ 10 

Lye Met Arg Gin His Leu Ser Leu Phe Pro Phe He Leu Phe 

^' 25 30 

Leu Ala Phe Ser Tyr Vai Giy Cys Arg Thr Val Arg Gin Thr 

35 40 45 

Gin Ser Glu Arg Tyr Val Val Val Leu Ser Leu Asp Gly Phe 

S5 6Q 
Asp Tyr Thr Asp Arg Ala Arg Thr Pro Ala Leu Asp Arg Met 

70 75 
Glu Gly Leu Ser Gly Ser Leu Gin Pro Cys Phe Pro Ser L-u 

85 90 
Pro Asn His Tyr Ser Met Ala Thr Gly Leu Tyr Pro Asp His 

100 105 
He Val Ala Asn Glu Phe Val Asp Ser Leu Leu Gly ll- Phe 

lli> 120 125 

Ser Asp Arg Lys Ala Val Glu Thr Pro Gly Phe Trp Gly Gly 

135 i/|o 
Val Trp Acn Thr Ala Ala Arg Gin Gly He Arg Thr Gly Val 

150 155 
Trp Val Gly Ser Glu Thr Ala Val Asn Gly Asn Arg Pro Trp 

165 170 
Lys Lys Phe Ser Ser Thr Val Pro Phe Arg Asp Arg Ala Asp 
180 185 loS 

He Ala Trp Leu Gly Leu Pro Glu Lys Glu Arg Pro Arg L-u 

195 200 205 

Trp Tyr lie Glu Glu Pro Asp Met He Gly His Ser Gin Thr 

215 220 
Ser Pro Leu Thr Leu Ala Met Val Glu Arg Leu Asp Ser Val 
225 230 235 

Tyr Phe Arg Lys Arg Leu Asp Ser Leu Pro He Ala Ala Gin 

245 250 
Phe He He Val Ser Asp His Gly Met Ala Thr Tyr Glu Asn 
2^0 265 270 

Cya Val Asn Leu Ser His Tyr Leu Pro Ala Asp Ser Phe Leu 

275 280 285 

Ala Thr Gly Ala Fhe Thr His Leu Tyr Pro Lys Pro Ser Tyr 
. 290 295 300 

Arg Ala Tyr Glu He Leu Arg Ala He Pro His He Ser Val 

310 315 
Lys Gly Glu Val Pro Lys Arg Leu Arg Cys Gly Thr Asn Pro 

325 330 
Gly Glu Leu Val Vai He Pro Asp He Gly Ser Thr Val Ph- 
. ^""^ 345 350 

He Asn Glu Asp Val Arg Pro Gly Ala Ala His Gly Tyr Asp 

-^55 360 355 

Ala Pro Glu Met Arg Ala Leu Leu Arg Ala Val Gly Pro Asp 

" 3*^5 330 

Pro Gly Ser Arg Val Glu Asn Leu Pro Asn He Thr He Tyr 

390 395 
He Cys Arg Leu Leu Gly He Glu Pro Ala Pro Asn Asp Ala 

4 05 410 
Thr Leu Leu Asn Gly Leu He Arg Asp Lys Arg Pro 
420 425 

(2) INFORf lATIOI^ FOR SEQ ID HO: 272 

(i) SEQUEI^CE CHARACTERISTICS: 



Phe He 
15 

Leu Leu 

Pro Lys 

Arg Pro 

Ala Gin 

BO 
Thr Phe 
95 

His Gly 

Arg He 

Glu Pro 

Tyr Phe 
160 
Arg Trp 
175 

Ser Val 

I.CU Her 

Pro Glu 

Val Gly 
240 
Thr Aso 
255 

Glu Lys 

Tyr Met 

Thr Glu 

Tyr Arg 
320 
Arg Leu 
335 

Phe Ala 

Asn Gin 

Phe Arg 

Pro Leu 
4 00 
Asp Glu 
415 
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(A) LENGTH: 282 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) >K)LECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas glnglvalis 

iix] FEATURE: 

(A) HAI'IE/KEY: inisc_f eature 

(B) LOCATIOTJ 1 . . .282 

(xi) SEQUEMCE DESCRIPTIOH: SEQ ID NO:272 

Leu ser Arg Gly Val Phe Pro Leu Meu Tyr Gly Arg Arg Gly Ser II- 

Arg Ala Ser Scr Gly His Arg Asp Lys He Phe Lys Asn Thr lie He 

2S 30 
Arg Phe He Thr Het Lyr Val Gly Leu Phe He Pro Cys Tyr Val Asn 



PCT/AU98/01O23 



10 



15 



25 



45 



60 



65 



70 



75 



Ala Val Ti^r Pro Glu Val Gly He Ala Thr Tyr Lys Leu Leu Lys Ser 
55 60 



40 45 
He Ala Thr Tyr Lys Le 
60 

Leu Asp He Asp Val Asp Tyr Pro Het Asp Gin Thr Cj^s Cys Gly Gin 

Pro liet Asn Ala Gly Phe Glu Gin Lys lia Gin Lys Leu Ala Leu 

Arg Phe Glu Glu Leu Phe Glu Ser Tyr Asp V..1 v«l Val Gly ^ro Ser 

« 105 no 

Ala Ser Cys Val Ala Phe Val Lys Glu Asa Tyr Asp His He Leu Arg 



120 125 

35 I30 ""^^^ ""^^ 1^5 ^^"^ ^'^g cys 

Glu Phe Lea His Asp Asp Leu Lys He Thr Ser Leu Pro Ser Arg Phe 

1^0 155 igQ 

Ala His Lys Val Ser Leu His Asn Ser Cys His Gly Val Arg Glu Leu 
An 165 170 175 

4U His Leu Ser Thr Pro Ser Glv. Val His Arg Pro Tyr His Asn Lys Val 

180 las 190 

Arg Arg Leu Leu Glu Met Val Gin Gly He Glu Val Phe Glu Pro Lys 

195 200 20s 

Arg Tl9 Asp Glu Cys Cys Gly Phe Gly Gly Het Tyr Ser Val Glu Glu 

^■'-^ 215 220 

Pro Glu Val Ser Thr Cys Met Gly His Asp Lys Val Leu Asp His He 
2-^0 235 OA(\ 

Ser Thr Gly Ala Glu Tyr He Thr Gly Pro Asp Ser Ser Cy^ Leu HeL 

245 250 255 

His Met Gin Gly Val He Asp Arg Glu Lys Leu Pro He Lys Thr He 



cn 245 250 oe.e, 

50 His M^i- n^r. Gly Val He Asp Arg Glu " " 

2^0 265 270 

His Ala Val Glu He Leu Ala Ala Asn Leu 
275 280 

55 (2) DIFOKHATION FOR SEQ ID MO: 273 

(il SEQUENCE CHARACTERISTICS: 

(A» LENGTH: 251 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qingivalis 

(Ix) FEATURE: 

(A) NAME/KEY: ml sr:_f eature 

(B) LOCATION 1- . .251 

(>:i) SEQUENCE OESCRIPTION: SEQ ID NO: 273 
Pro Leu I.ys Lys Arg Met Asp He Val Ser Mat Aia Asp Lys Ala Leu 
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^5 10 15 

Val Val Glu Hct Arg Asp Val Thr Leu Cys Gin Glu Glu Asn Val He 
20 25 30 

^ Phe Gin Aan Leu Asn Leu Thr T.eu Ser Ala Gly Asp Phe Val Tyr Lou 

He Gly Scr Val Gly Scr Gly Lys Ser Thr Leu Leu Lys Ala T^u Tyr 

Ala Glu Val Pro He Ser Ala Gly Tyr Ala Arq Val He Asp Tyr A*p 

10 Leu Ala Lye Leu Lys Arg Uys Gin Leu Pro ^yr Leu Arg Arg Asn Lou 

OO 90 gg 

Gly He Val Phe Gin Asp Phe Gin Leu Leu Asn Gly Arg Thr Val Ala 
. 1-0-*i 110 

i.lu A^n Leu Asp Phe Val Leu Arg Ala Thr Asp Trp Lye Asn Arg Ala 

_ ^ lis 120 125 

Arg Glu Gin Arg He Glu Glu Val Leu Thr Arq Val Gly Met Ser 

- 7 , J40 

Aig Lys Ala Tyr Lys Arq Pro His Glu Leu Ser Gly Gly Glu Gin Gin 

20 Arg Val Gly He Ala Arg Ala Leu Leu Ala Lys Pro Ala Leu He Leu 

Ala Asp Glu Pro Thr Gly Asn Leu Asp Ser Val Thr Gly L«u G^n H- 
1.1 o , 190 
Ala Ser Leu Leu Tyr Glu He Ser Lys Gin Gly Thr Ala Val Leu Met 

200 205 
Thr His Asn Ser Ser Leu Leu Ser Hie Leu Pro Ala Arg Thr Leu 
215 220 
Ala Val Arg Lys Asn Gly Asp Ala Ser Ser Leu Val Glu Leu Ser Ala 
on 230 235 -j^n 

»30 Asp Ala Val Ser Arq Lys Asn Thr Glu He Asp 
245 ^ 

(2) INFOWIATION FOR SEQ ID MO: 274 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 23 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



PCT/AU98A)ia23 



40 



50 



lii) MOLECULE TYPE: protein 
(iil) HYPOTHETICAL: YES 



^"'^J ORIGINAL SOURCE: 
^ fA) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) HAIIE/KE^-: misc feature 

(B) LOCATIOiT 1...238 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 274 

Thr Arg His Cys Pro Ala Cy« Arg Ser Ser Phe His Leu He Lys Thr 

55 Ser Lys Thr Met He Glu He Ser Asn Leu Thr Lys Val Phe Thr 
20 25 30 

Glu Glu He Glu Thr Val Ala Leu Asp Gly Val Ser Leu Lys Val Asp 

60 ^^'^ ti^ ^^"^ "^^y P--^ '^^'^ <51y Gly Lys Ser 

Thr Leu Leu Asn He Leu Gly Leu Leu Asp Asn Pro Thr Ser Gly He 

70 75 ^ OQ 

Tyr Lys Leu Asp Gly Ala Glu Val Gly Asn Leu Arg Glu Lys Asp Arg 
cc ®^ 90 05 

" Thr Ala Val Arg Lys Gly A.„ xl. oiy phe Val Phe Gin Ssr Phe Aan 
100 105 110 

Leu He Glu Glu Met Thr Val Ser Glu Asn Val Glu Leu Pro Leu Val 

120 12S 
70 ^^'"^ ^30 "^^^ i3f ^""^ <51u Glu Ala 

Leu Arg Lys Met Ser He Ser His Arg Ala Gly His Phe Pro Asn Gin 
T 150 155 

Leu Ser Gly Gly Gin Gin Gin Arg Val Ala He Ala Arg Ala Val Val 
75 Ala Asn Pro Lys Lcn lie Leu Ala Asp gI! Pro Thr Gly Asn ^lu Asp 
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10 



185 

Ser Lys Asn Gly Ala Asp Val Met Glu Leu Leu Arg Gly Leu Asn Arg 

200 205 

Glu Gly Ala Thr Ilo Val M« v,l Thr HI. S« Glu His Asp Ala Arc, 

215 220 
Ser Ala Gly Arg He He Asn Leu Phe Asp Gly Lye He Arg 
230 235 

i2] INFORIIATION FOR SEQ ID HO: 275 



(1) SEQUEMCE CHARACTER I STICS: 

(A) LENGTH: 604 amino acids 

(B) TYPE: amino acid 
iO) TOPOLOGY: linear 

(H) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

tvl ) ORICIHAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingival! s 

(ix) FEATURE: 
«tr <A) NAME/:<EY: misc feature 

(B) LOCATIOII 1...604 

(xi) SEQUENCE DESCRIPTIOfi; SEQ ID NO: 275 

2^ Ser Thr Glu Thr Asn Ser Lys Ser Glu Met Lys Glu Phe Phe Lys Met 

Phe Phe Ala Ser He Leu Gly Val He ^hr Ala Gly He He Leu Phe 

25 30 

^..e 

Qe; , 40 4s 



Cys lie Phe Leu Pho He Phe Phe Gly He Val Ala Gly He Ala Se. 
Ly. Ala Thr Gly Gly Thr lie Pro Lys He Glu Ala Asn Ser He Leu 



His He Unk A^,,, Ser Ser Phe Pro Glu He Val "r Ala Asn Pro Trp 

ser Met Leu Thr Gly Lys Asp Glu Ser Val Ser Leu Ser Gin Ala Val 

90 oe; 
Glu Ala He Gly Gin Ala Lys Asn Asn Pro Asn He Thr Gly Il« Phe 

no 

45 Arg Arg Ala Leu Gin Asp Phe Jy^ Met Ser Gly Lys Phe Val Val Ser 

1-^^ 140 
Tyr Ala Asp Arg Tyr Thr Gin Lys Gly Tyr Tyr Leu Ser Ser He Ala 

155 1 pn 

Asp Lys Leu Tyr Leu Asn Pro Lys Gly Met Leu Gly L«u He Gly ill 

Ala Thr Gin Tl.r Met Phe Tyr Lys Asp Wa Leu Asp Lys Phe Gly Val 

165 190 
Lys net Glu He Phe Lys Val Gly Thr Tyr Lys Ala Ala Val Glu Pro 
cc 200 2^5 

Phe Met Leu Asn Arg Met Ser Asp Ala Asn Arg Glu oin He Thr Thr 

Tyr He Asn Gly Leu Trp Asp Lys He Thr Ser A^p He Ala Glu Ser 

235 "An 
Arg Lys Thr Ala Met Asp Ser Val Lys Met Phe Ala Asp Lys Gly Olu 

250 9 c 

net Phe Gly Leu A, a Glu Lys Ala Val Glu Met Lys Leu Val Isp Glu 

Leu Ala Tyr Arg Thr Aip Val Glu l" GU. Leu Lys Lys Met Ser Gin 

65 Arg Gly Glu Lys Asp Glu Leu Arg Phe Val Ser Leu Ser Gin Val Leu 

Ala Asn Giy Pro Met Asn Lys Thr Lys Gly Ser ^g He Ala Val Leu 

Phe Ala Glu Gly Glu He Thr Glu Glu He Ho Lys Lys Pro Phe A.'p 

Thr Asp Gly Ser Ser Tie Thr Cln Glu Leu Ala Lys Glu He Lye Ala 

Ala Ala Asp Asp Asp Asp He Lys Ala Val Val Leu Arg vTl Asn Ser 

75 Pro Gly Gly s„r Ala Phe Thr Ser Glu Gin He Trp Lys Gin Val Ala 
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375 380 
Afp Leu Ly3 Ala Lys Lys Pro He Val Val Ser Het Giy Asp Val Ala 
390 395 jnn 

Ala ser Gly Gly Tyr Tyr He Ala Cya Ala Ala Asn Ser He Val Ala 

405 410 415 

Glu His Thr Thr Leu Thr Gly Ser He Gly He Phe Gly Met Phe Pro 

425 430 
Asn Phe Ala Gly Val Ala Lys Lya He Gly Val Asn Met Asp Val Val 



435 440 

Gin Thr Ser Lys Tyr Ala Asp Leu Gly Asn Thr Phe Ala Pro Met Thr 

455 460 
Val Glu Asp Arg Ala Leu He Gin Arg Tyr He Glu Gin Gly Tyr Asp 

Leu Phe Leu Thr Arg Val Ser Glu Gly Arq tin Arg Thr Lys Ala Gin 

405 490 495 

He Asp Sor He Ala Gin Gly Arg Val Trp Leu Gly Asp Lys Ala Leu 

500 505 
Ala Leu Giy Leu Val Asp Glu Leu Gly Gly Leu Asp Thr Ala He Lys 

515 520 525 

Arg Ala Ala Lys Leu Ala Gin Leu Gly Gly Asn Tyr Ser He Glu Tyr 

=>J0 535 540 

Gly Ly^ Thr Lys Arg Asn Phe Phe Glu Glu Leu Leu Ser Ser Ser Ala 
550 555 ego 

Ala Asp Met Lyr Sjr Ala He Leu Ser Thr He Leu Ser Asp Pro Glu 

Sc5 570 
He Glu Val Leu Arg Glu Leu Arg Ser Met Pro Pro Arg Pro Ser Gly 

580 585 590 

He Gin Ala Arg Leu Pro Tyr Tyr Phe Met Pro Tvr 
595 600 

(2) THEX)RMATTOH FOR SEQ ID NO: 27 6 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 324 amino acids 
(3) TYPB: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

40 (ill) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyroroonas gtngivalls 

45 (ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .324 

5Q *^i> SEQUENCE DESCRIPTION: SEQ ID NO: 27 6 

Leu Thr Leu Phe Cys Cys Asn Tyr Phe Lys Gin Met Arg Ala Asn He 
Trp Gin Il« Leu Ser Val Ser Val Leu Phe Phe Phe Gly Thr Ala He 

55 Gly Gin Ala Gin Ser Arg Asn Arg ^hr Tyr Glu Ala Tyr Val Lys Gin 

Tyr Ala Asp Glu Ala He Arg Gin Met Ser Arg Tyr Ifn He Pro Ala 

55 50 
Scr Tie Thr He Ala Gin Ala Leu Val Glu Thr Gly Ala Gly Ala Ser 

Thr Leu Ala Ser Val His Asn Asn His Phe ofy He Lys Cys His Lys 
_ „ ^5 90 05 

Ser Trp Thr Gly Lys Arg Thr Tyr Arg Thr Asp Asp Ala Pro isn Glu 
_ ^-'^ 105 110 

USJ Ci^ Phe Arg Set Tyr Ser Ala Ala Arg Glu Ser Tvr Glu Asp His Ser 
a 120 125 

Arg Phe Leu Leu Gin Pro Arg Tyr Arg Pro Leu Phe Lyr Leu Asp Arg 
135 140 



60 



70 lis ''^''^ '^^'^ ""^y <51'> Arg Ci-s Gly Tyr Ala 

Thr Asn Arg Gly Tyr Ala Asn Leu Leu He iys Met Val Glu Leu iyr 
Glu Leu Tyr Ala Leu Asp Arg Glu Lys ^yr Pro Ser Trp Phe hVs Lys 

75 Ser Tyr Pro Gly Ser Asn Lys Ly.- Ser His Gin Thr Thr ^ys Gin Lys 
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200 205 
L.ln Ser GXy Leu Lys His Glu Ala Tyr Phe Ser Tyr Gly Leu Leu Tyr 

210 215 220 

lie He Ala Lys Gin Gly Asp Thr Phe A^p Ser Leu Ala Glu Glu Phe 
"■^-^ 230 235 240 

Asp M-t Arg Ala Ser Lys Leu Ala Lys Tyr Asn Atp Ala Pro Val Asn 

245 2S0 255 

Phe Pro He Glu Lys Gly Asp Val He Tyr Leu Glu Lys Lys His Ala 
r. ^ 265 270 

Cys Ser He Ser Lys His Thr Gin His Val Val Arg Val Gly Asp Ser ^ 

275 



280 285 



^^^^ oio ^'"^ ^^"^ "^y^ ^^^9 H«t Lys Asn Leu Tyr 

295 300 
Lys Leu Asn Asp Lys Asp Gly Glu Tyr He Pro Gin Glu Gly Asp He 
, 310 315 

Leu Arg Leu Arg 

(2) rilFOKMATION FOR SEQ ID fIO:277 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: 1 In-ar 

(li) MOLECULE rY?E: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) WAIIE/KEY: iiiiEC_f eature 

(B) LOCATION 1...533 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277 

Arg He Pro Asp Glu Gin Thr Gly Arg He Met Asp Gly Arg Arg Tyr 

10 IS 
Sar Asp Gly Leu His Gin Ala He Glu Ala Lys Glu His Val Lys Val 

2° 25 30 

Glu Ala Ala Thr Glri Thr Phe Ala Thr He Thr Leu Gin Asn Tyr Ph- 

35 45 
Arg Met Tyr His Lys Leu Ala Gly Met Thr Gly Thr Ala Glu Thr Glu 

•^^ S5 SO 

Ala Gly Glu Leu Trp Asp He Tyr Lys T.cu Asp Va] Val Val II- Pro 
" "'O 75 " gQ 

Thr Asn Lys Pro He Ala Arg Lys Asp Met Asn Asp Arg He Tyr Lys 

85 go 95 

Thr Ala Arg Glu Lys Tyr Ala Ala Val He Glu Glu He Val Arg Leu 

100 105 110 

Val Glu Glu Gly Arg Pro Val Leu Val Gly Thr Thr Ser Val Glu He 

115 120 125 

Ser Glu Leu Leu Ser Arg Met Leu Arg Leu Arg Gly He Gin His Asn 
135 140 



Val Leu Asn Ala Lys Leu His Gin Lys Glu Ala Glu He Val Ala Gin 

Ala Gly Gin Lys Gly Thr Val Thr He Ala Thr Asn Met Ala Gly l^g 

170 175 
Gly Thr Asp He Lys Leu Ser Ala Glu Val Lye Lys Ala Gly Gly Leu 

185 190 
Ala He He Gly Thr Glu Arq His Glu S<r Arg Arg Val Asp Ary Gin 

200 205 
Leu Arg Gly Arg Ser Gly Arg Gin Gly Asp Pro Gly Ser Ser Tie Ph.; 

215 220 
Tyr Val Ser Leu Glu Asp His Leu Met Arg Leu Phe Ala Thr Glu Lys 

Til o , ^^"^ 235 240 

11- Ala S-r Leu Met Asp Arg Leu Gly Phe Lys Glu Gly Glu Val Leu 

Glu Asn Asn Met Leu Ser Lys Ser Val G^u Arg Ala Gin Lys Lys Val 

265 270 
Glu Glu Asa Asn Phe Gly He Arg Lys His Leu Leu Glu Tyr Asp Asp 

280 285 
\al Met Asn Ser Gin Arg Glu Val H- Tyr Thr Arg Arg Arg His Ala 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



PCT/AU98/01023 



193/490 





290 










295 










300 










Leu 


Het 


G\y 


Glu 


Arg 


He 


Gly 


Met 


Asp 


Val 


Leu 


Asn 


Thr 


lie Tyr 


Asp 


305 










310 










31S 










320 


Val 


Cys 


hys 


Ala 


Leu 


He 


Asp 


Asn 


Tyr 


Ala 


Glu 


Ala 


Asn Asp 


Phe 


Glu 










325 










330 










335 




Gly 


Phe 


LjS 


Glu 


Asp Leu 


Met 


Arg 


Ala 


Leu 


Ala 


lis 


Glu 


Ser 


Pro 


He 








340 










34 5 










350 






Thr 


Cln 


Glu 


He 


Phe 


Arg 


Gly 


Lys 


Lys 


Ala 


Glu 


Glu 


Leu 


Thr Asp Het 






355 










360 










365 








Leu 


Pile 


Asp 


Glu 


Ala 


Tyr 


Lys 


Ser 


Phe 


Gin 


Arg 


Lys 


Met 


Asp 


Leu 


He 




370 










375 










380 








Ala 


Glu 


Val 


Ala 


His 


Pro 


Val 


Val 


His 


Gin 


Val 


Phe 


GJu 


Thr 


Gin 


Ala 


385 










390 










395 










400 


Alci 


Val 


Tyr 


Glu 


Arg 


Tie 


Leu 


He 


Pro 


He 


Thr Asp 


Gly Lys Arg 


Val 










4 05 










410 










415 




Tyr Asn 


He 


Gly 


cys 


Asn 


Leu 


Arg 


Glu 


Ala 


Asp Glu 


Thr 


Gin 


Gly 


Lys 








420 










425 










430 






Ser 


lie 


He 


Lys 


Glu 


Phe 


Glu 


Lys 


Ala 


He 


Val 


Leu 


His 


Thr 


He 


Asp 






435 










440 










445 






Glu 


Ssr 
4 50 


Trp 


Lys 


Glu 


His 


Leu 
4 55 


Arg 


Glu 


Met 


Ajsp 


Glu 
460 


Leu 


Arg 


Astn 


Ser 


Val 


Gin 


Astt 


Ala 


Sor Tyr Glu 


Asn 


Lya 


Asp 


Pro 


Leu 


Leu 


He 


Tyr 


Lys 


46S 










470 










475 








480 


Leu 


Glu 


Ser 


Tyr 


Glu 


Leu 


Phe 


Arg 


Lys 


Met 


Val 


Glu 


Ala 


Met 


Asn 


Arg 










485 










4 90 










4S5 


Ly^ 


Thr 


Val 


Aid 
500 


He 


Leu 


Met 


Arg 


Ala 

505 


Arg 


lie 


Pro 


Val 


Pro 
5X0 


Glu 


Ala 


Pro 


Ser 


Gin 


Glu 


Glu 


Leu 


Glu 


His 


Arg 


Arg 


Gin 


He 


Glu 


He 


Arg 


His 






515 










520 










525 






Ala 


Thr 
530 


Glu 


Gin 


Arg 

























(2) IHFORt-IATTOH FOR SEQ ID HO: 278 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 720 anano acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Uii) HYPOTHRTXCAL: YES 

(vij ORIGIIJAL SOURCE: 

(A) ORGAHISH: Porphyroroonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: mlsc:_feature 

(B) LOCATION 1...720 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 278 



Lys 


Ser 


Cys 


Arg 


Val 


He 


Giy 


Gin 


Thr 


Arg 


Arg 


Tyr 


Gly Cys Cys Pro 


1 








5 










10 






15 


Val 


Gly 


Leu 


Arg 


Pro 


Cys 


His 


Gin 


Ser 


Leu 


Ser 


Giy 


Val Cys Arg Cys 








20 










25 








30 


Leu 


Phe 


Gin 


Ser 


Arg 


Pro 


Ala 


Val 


Ala 


Phe 


Ala 


Arg 


Lys Gly Gin Arg 






35 










40 










45 


Arg 


His 


Axg 


Arg 


Ser 


Glu 


Ser 


Gly 


Arg 


Arg 


He 


Arg 


Ala Leu Gin Gly 




50 










55 










60 


Val 


Gin 


His 


His 


Gin 


Thr 


Asn 


Glu 


His 


Glu 


Val 


Met 


He Ser Val Asn 


65 










70 










75 




eo 


Asn 


Leu 


Thr 


Val 


Asp 


Phe 


Gly 


Thr 


Arg 


Leu 


Leiu 


Phe 


Asp Gin Val Ser 










85 










90 






95 


Phe 


Val 


He 


Asn 


Arg 


Arg 


Asp 


Arg 


He 


Ala 


Leu 


Val 


Gly Lys Asn Gly 








100 










105 








110 


Ala 


Gly 


Lys 


Ser 


Thr 


Leu 


Leu 


Lys 


Leu 


He 


Ala 


Gly 


Met Glu Glu Pro 






115 










120 










125 


Thr 


Ser 


Gly 


His 


He 


Ala 


Arg 


Pro 


Lys 


Gly 


He 


Arg 


He Gly T/r Leu 




130 










135 








140 


Pro 


Gin 


Val 


Het 


Arg 


Leu 


Gin 


Asp 


Gly 


His 


Thr 


Val 


Tyr Glu Glu Val 


145 










150 










155 




160 


Glu 


Gin 


Ala 


Phe 


Asn 


Asp 


He 


Arg 


Gin 


He 


Glu 


Glu 


Glu Tie Arg Arg 










165 










170 






175 


Leu 


Ser 


Asp 


Glu 


Mot 


Ala 


Gly 


Arg 


Thr 


Asp 


Tyr 


Glu 


Ser Asp Asp Tyr 
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ISO 185 190 

lie Arg Leu lie Giu His Tyr Thr Asn Met Sec Glu Thr Leu Scr I^u 

200 205 
net Gin Gin Gly Asn Tyr His Aia Aia lie Glu Gin Thr Leu lie Giv 
^ ^ 210 215 220 

Leu Gly Phe Giy Arg Giu Asp Phe His Arg Pro Thr Ala Asp Phe Ser 
^' 230 235 240 

oly Gly Trp Arg Met Arg He Glu Leu Ala Lys Leu Leu Leu Gin Arg ' " 

in 250 255 

lU Pro Glu Val Leu Leu Leu Asp Glu Pro Thr Asn His Leu Asp He Glu 
260 265 270 

S-r He Gly Trp Leu Glu Gin Phe lie Ala Thr Asn Ala Gly Aia Val 
2''5 280 295 

ie: ^ ^t'i ^^-^ ^^'^ "^^^ Ala Phe He A^p Asn Thr The Thr Arq 

290 295 

Thr He Glu He Glu Leu Gly His He Tyr Asp Tyr Lys Thr Asn Tyr 
i 315 320 

Ser His Tyr Val Glu Leu Arg Glu Giu Arg Lsu Arg Gin Gin H-t Arg 
m ^25 330 335 

SCU Ala Tyr Glu Asn Gin Gin Lys riet He Arg Asp Thr Glu Asp Phe 11- 
- ^, , ^"^O 345 350 

Glu Arg Phe Arg Tyr Lys Ala Thr Lys Scr Val Gin Val Gin Ser Arq 

360 365 
25 ^Xf. '^^^ ^9 Glu lie Asp Giu Arg Asp 

375 380 
Arg S'er Ala Phe His Phe Arg Phe He Pro Ala Gin Pro Ser Gly S-r 
^ 395 400 

Tyr Pro Leu He Val Asp Asp Leu Ala Lys Ala Tyr Gly Asp His Gin 
on ^^"^ 410 4 IS 

Vai Phe Ser Gly Ala Thr Tyr Thr He Glu Arq Gly Glu Lys Val Ala 
r>i. , 425 430 

Phe Val Gly Lys Asn Gly Ala Giy Lys Ser Thr Met Val Lys Cys He 
435 

^^"^ ^^"^ '^P "^^^ Glu Leu Gly Hi^ Asn 

450 

Val Gin Leu Gly Tyr Phe Ala Gin Asn Glu Ala Gin Glu Leu Arg Gly 

470 ^ 
Asp Leu Thr Val Phe Asp Thr He Asp Arg Glu Ala Val Gly Asp He 
ATk 485 4 435 

W Arg Leu Arg Leu Asn Asp Leu Leu Gly Ala Phe Leu Phe Gly Giv Glu 
500 505 5X0 

Aia Ser Glu Lys Lys Val Ser Val Leu Ser Gly Gly Giu Arg Ala Aro 
SIS 520 525 

A'^g Leu Leu Leu Gin Pro Ala Asn Phe Leu He Leu 
530 535 

Asp Glu Pro Thr Asn His Leu A^p Her Arg Ser Lys Asp Val Leu Lys 

555 560 
Giu Ala He Lys Asn Phe Asp Gly Thr Val He Val Val Ser His Asp 
Cf\ 565 570 575 

tjU Arg Glu Phe Leu Asp GD y Leu Val Ser Lys Val Tyr Glu Phe Aia Asp 
580 535 590 

Gly Gin Val Asrt Glu His Leu Gly Gly He Tyr Asp Tyr Lou Arg Thr 
595 600 605 

55 ^ ^fn ^^"^ '^^'^ Giu Arg Thr Thr Thr He Glu 

615 620 
Thr Lys Thr Thr Arg Glu Aia He Pro Glu Thr Glu Ala Lys Ala A«r 
630 635 g^^Q 

Tyr Arg Arg G)n Lys Glu Val Ala Lys Gin Leu Arg Thr Leu Glu Arg 
Rn S4S 650 655 

Thr Val Ala Thr Cys Glu Glu Arg He Gly Lys Leu Glu Sor Glu Leu 

660 665 670 

Gin Ala He Glu Met Leu Leu Gin Asp Pro Lys His Ala Thr Asp Ala 
^^5 680 685 

65 son ^^"^ '^^'^ Glu Lys Ala 

695 700 



70 



75 



Mot Glu Asp Trp Glu Gin Aia Ser Giu Ala Leu Ser Glu Ala Gin Gly 

715 720 

(2) IHFORHATION FOR SEQ ID MO: 27 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 386 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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10 



20 



25 



30 



35 



45 



50 



55 



70 



(ii) HOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FFATURE: 

(A) NAHE/KEY: misc featur- 

(B) LOCATION 1. . .386 

(xl > SEQUENCE DESCRIPTION: SEQ ID NO:279 



15 1^'" ^'^^ Asn Phe Lou Lys 

5 10 IS 

Lys Glu Pro Phe Ly^ He Phe Ser Met lie Tyr L-u Leu Leu Asp Thr 

^- 2S 30 

Tie Thr A^n Arg Ala Gly Thr Glu Arg Ala Vai He hsrx Leu Ala Asn 

35 4 0 4 5 

Asn Leu His Ala Asn Gly His Arg Val Ser Leu Val Ser Val Cys Thr 

Lys Glu Gly Glu Pro Ser Phe Gin Val Glu hys Gly He Glu Val His 

70 75 
His Leu Gly He Arg Leu Tyr Gly Asn Ala Lou Ala Arg Lys Thr Val 
■85 ~ 



90 95 



Tyr Phe Lys Ala Tyr Arg Arg He Lys Ala Leu Tyr Lys Lys Arg Glu 

100 105 110 

Pro Val Lou Leu He Gly Thr Asn He Pho He Asn Thr He Leu Ser 

lis 120 l;>c 

130 ^""^ '^^'^ 

Ser Tyr Asp He Ala Arg Pro He Thr Lys Arg He Arg Gly Phe Leu 

150 155 
Tyr ser Gly Leu Asp Ala Val Val Ala Leu Thr Lys Arg Asp Gin Gin 

165 170 175 

ser Phe Glu Ala He Leu Arg Gly Arg Ser Lys Ala Tyr Val He Pro 

180 185 190 

Asn Gin Val Ser Phe Thr Thr Val Gin Arg Asp Ala Thr Thr His Lys 
An 200 205 

210 ^""^ zlt ^^"^ "^^"^ '^^'^ "^^^ ^^'"^ ^^'^ 

Met He Glu Asp Ala Sor Arg Val Leu Arg Glu Arg Pro Asp Trp Lys 
230 235 240 

Leu Ho He Val Gly Asp Gly Glu Asn Glu Ser Met Leu Arg Lys Glu 

245 250 255 

Tie Ala Ser Arg Asn Met Glu Ser Gin He Glu He His Pro Ser Thr 

260 265 270 

Pro Glu He Arg Lys Tyr Tyr Glu Ser Ser Ala He Tyr Leu Met Thr 

2'-' 280 285 

Ser Arg Phe Glu Gly I.eu Pro Met Val Leu Leu Glu Ala Glu Ala Tyr 

290 295 300 

Ala Leu Pro He He Ser Tyr Asp Cys Pro Thr Gly Pro Arg Glu Leu 

He Glu Asn Gly Arg Asn Gly Phe Leu Val Pro Met Glu Ala Mis Glu 

330 335 

Asp Phe Ala Asp Lys Leu Arg Leu Leu Met Asp Asp Glu Thr Leu Arg 

345 350 

Lys Lys Met Gly Gin Glu Ser Glu Leu Met Val Lys Ser Tyr Ser Pro 
«ri , 365 

Ala Asn He Tyr Glu Cys Trp Lys Lys Leu Phe Val Glu He Gly Tyr 

Met Asn 
385 

65 12) INFORllATIOTJ FOR SEQ ID NO:280 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 amino acids 

(B) TYPE: amino acid 
(0) TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

__ till} HYPOTHETICAL: YES 

/O 
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(vi» ORIGINAL SOURCE: 

(A) ORGAfUSH: Porphyromon as gingivalis 

tlx) FEATURE: 

(A) HAJ4E/KEY: misc f eature 
IB) LOCATIOW 1 . . , 655 

(xl) SEQUENCE DESCRIPTION: SEQ ID llO:2B0 



10 



Thr Thr Lys GIu Asn Glu Tlir The Thr Lys Asn Glu Tyr Acq lie He^ 
^ S 10 15 

Gly Lys He II- Gly Tie Asp Leu Gly Thr Thr Asn Ssr Cys Val Ser 

20 25 30 

Val Leu Glu Gly Aan Glu Pro He Val He Thr Asn Ser Glu Gly Lys 
lO 35 40 45 

Arg Thi The Pro Ser Val Val Ala Phe Val Asp Gly Gly Glu Arg Lys 

50 55 60 

Val Gly Asp Pro Ala Lys Arg Gin Ala He Thr Asn Pro Thr Lys Thr 

70 75 80 

He Tyr Ser He Ly- Arq Phe Met Gly Glu Thr Tyr Asp Gin Val Ser 

85 90 95 

Arg Glu Val Glu Arg Val Pro Phe Lys Val Val Arg Gly Asp Asn Asn 

100 105 110 

Thr Pro Arg Val Asp He Aj?p Gly Arg Leu Tyr Thr Pro Gin Glu H-» 

lis 120 125 

Ser Ala Met He Leu Gin Lys MeL Lys Lye Thr Ala Glu Asp Tyr Leu 

130 135 140 

Gly Gin Glu Val Thr Glu Ala Val He Thr Val Pro Ala Tyr Phe Asn 

i**^ , 160 

Asp Ala Gin Arg Gin Ala Thr Lys Glu Ala Gly Glu He Ala Gly Leu 

165 170 175 

Lys Val Arg Arg lie Val Asn Glu Pro Thr Ala Ala Ser Leu Ala Tyr 

180 185 190 

Gly Leu Asp Lys Ser Asn Lye Asp Men Lys He Ala Val Phe Asp Leu 

195 200 205 

Gly Gly Gly Thr Phe Asp He Ser He Leu Glu Leu Gly Asp Gly Val 

210 215 220 

Phe Glu Val Lys Ser Tlir Asn Gly Asp Thr His Leu Gly Gly Asp Asp 
225 230 235 240 

Phe Asp His Val He He Asp Trp Leu Ala Giu Glu Phe Lys Ser Gin 

245 250 255 

Glu Gly Val Asp Leu Arg Gin Asp Pro Met Ala Het Gin Arg Leu Lys 

260 265 270 

Glu Ala Ala Glu Lys Ala Lys lie Glu Leu Ser Ser Thr Ser S»r Thr 

275 280 285 

Glu He Asn Leu Pro Tyr He Met Pro Val Asn Gly He Pro Lys His 

250 29b 300 

Leu Val IJot Thr Leu Thr Arg Ala Lys Phe Glu Gin Leu Ala Asp Ara 
310 315 320 

Leu He Gin Ala Cys Val Ala Pro Cys Glu Thr Ala Leu Lyu Asp Ala 
^, „ 330 335 

G]y Met Ssr Arg Gly Asp He Asp Glu Val He Leu Val Gly Gly Ser 

340 345 350 

Thr Arg He Pro Ala He Gin Glu Tic Val Glu Lyr He Phe Gly Lys 

355 360 365 

Ala Pro Ser Lys Gly Val Asn Pro Asp Glu Val Val Ala Val Gly Ala 

370 375 380 

Ala He Gin Gly Gly Val Leu Thr Gly Glu Val Lys Asp Val Leu Leu 
390 395 400 

Leu Asp Val Thr Pro Leu Ser Leu Gly He Glu Thr Met Gly Gly Val 

405 410 415 

Het Thr Arg Leu He Asp Ala Asn Thr Thr He Pro Thr Lys Lys Ser 

420 425 430 

Glu He Phe Thr Thr Ala Val Asp Asn Gin Pro Scr Val Glu He His 

435 440 445 

Val Lsu Gin Gly Glu Arg Ser Leu Ala Li's Asp Asn Lys Ser He Giv 

450 455 460 

Arg Phe Asn Leu Asp Gly He Ala Pro Ala Pro Arg Gin Thr Pro Cln 

7n ^, ^''^ 480 

/U He Glu 'vol Thr Phe Asp He Asp Ala Asn Gly He Leu Asn Val Thr 
485 490 495 

Ala His Asp Lys Ala Thr Gly Lys Lys Gin Asn He Arg Tie Glu Ala 

500 505 510 

S«r Ser Gly Leu Ser Asp Asp Glu He Lys Arg Met Lys Glu Glu Ala 
515 520 525 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 
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Gin Ala Asn Ala Glu Ala Asp Lys Lys Glu Lys Giu Arg lie Asp Lvs 

530 535 540 

He Asn Gin Ala Asp Ser Hct He Phe Gin Thr Glu Lys Gin L*u Lvs 
5 550 555 560 

Glu Leu Gly A*p Lys Phe Pro Ala Asp Lys Lys Ala Pro He Asp Thx 

565 570 575 

Ala Leu Asp Lys Leu Lys Glu Ala His Lys Ala Gin Asp Val Ala Aia 

SOO 585 590 

He As?p Thr Ala Met Ala Glu Leu Gin Thr Ala Leu Ser Ala Ala Gly 

595 600 605 

Glu Glu Leu Tyr Lys Asn Aia Gly Ala Ala Gin Gly Gly Ala Gin Pro 

€10 615 620 

Gly Pro Asp Phe Gly Gly Ala Gin Gly Pro Ser Ala Gly Asp Gin Pro 
625 630 635 640 

Ser Asp Asp Lys Asn Val Thr Asp Val Asp Phe Glu Glu Val Lys 
645 650 655 

(2) IMFORHATTOH FOR SEQ ID NO:2Sl 

(il SEQUENCE CHARACTERISTICS: 

(A) LEMG-IH: 4 67 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY: linear 

(iij MOLECULE* TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi] ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

Ux) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION I. . .467 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 281 

Lys Trp Ala Arg Thr Thr Thr Leu Arg lie Ser Asn Ala Thr lie Tyr 

is 10 15 

Lys Thr Met Arg Tyr Asp Leu Ala He He Gly Gly Gly Pro Ala Gly 

20 25 30 

Tyr The Ala Ala Glu Arg Ala Ala Lys Gly Gly Leu Lys Thr Leu Leu 

35 40 45 

He Glu Lys Asn Aia Leu Gly Gly Val Cys Leu Asn Glu Gly Cvs Il» 

50 Sb 60 

Pro Thr Lyflt Thr Leu Leu Tyr Ser Ala Lys Val Leu His Gin He Aia 
65 70 75 80 

Thr Ala Ser Lys Tyr Ala Val Ser Gly Thr Ala Asp Gly Leu Asp Leu 

85 90 95 

Gly T.ys Val He Ala Arg Lys Gly Lys He He Arg Lys Leu Thr Ala 

100 105 110 

Gly He Arg Ser Arg Lau Thr Glu Ala Gly Val Glu Met Val Thr Ala 

lis 120 125 

Glu Ala Thr Val Thr Gly Cys Asp Aia Asp Gly He He Gly He Thr 

130 135 140 

Ala Gly Glu Ala Gin Tyr Lys Ala Ala Asn Leu Leu Leu Cys Thr Glv 
ISO 155 leS 

Ser Glu Thr Phe He Pro Pro He Pro Gly Val Glu Gin Thr Glu T-r 

165 170 175 

Trp Thr Asn Arg Glu Ala Leu Gin Asn Lys Glu He Pro Thr Ser Leu 

180 ifls 190 

Val He He Gly Gly Giy Val He Gly H=t Glu Phe Ala Ser Ph» Ph» 

195 200 205 

Asn Gly He Gly Thr Gin Val His Val Val Glu Met Leu Pro Glu He 

210 215 220 

Leu Asn Gly He Asp Pro Glu His Ala Ala Met Leu Arg Ala Itts Tvr 
230 235 240 

Glu Lys Glu Gly H- Lys Phe Tyr Leu Gly His Lys Val Thr Ser Val 

245 250 255 

Arg Asn Gly Ala Val Thr Val Glu Tyr Glu Gly Glu Ser Lys Glu II- 

260 265 270 

Glu Gly Glu Arg He Leu Met Ser Val Gly Arg Arg Pro Val Leu Gin 

275 280 285 

Gly Phe Glu Ser Leu Gly Leu Val Leu Ala Gly Lys Gly Val Lvs Thr 
290 295 300 
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30 



35 



40 



Asn Glu Arg Het Gin Thr Ser Leu Pro Asn Vnl Tyr Ala Ala Gly A.p 
He Thr Gly Phe Ser I.eu Leu Ala His Thr III Val Arg Glu Ala llu 

C J2D 330 33S 

!> Val Ala val Aap Gin lie Leu Sly Lys Thr Afip Glu Thr Met Ser Tyr 

Atg Ala val Pro Gly Val Val Tyr T^r Asn Pro Glu Val A^a Gly Val 

•^^^ 3e0 3gs 

10 370 '"^'^ ^''^ ^"^g '^l^ Tyr Thr Val 

^ 380 

Arg Ar, Leu Pro Met Ma Phe Ser Gly Arg Phe Val Ala Glu Asn Glu 

Gin Gly A^n Gly Glu C^-^ Lys Leu Leu Leu Glu GJu Asn Arg Leu 

lie Gly Ala His Leu He Gly Asn Pro Ala Gly Glu Leu He Val Thr 

425 A'\f\ 
Ala Ala Met Ala He Glu Thr Gly Met Thr Asp Arg Gin He Glu Arg 

20 Jio ^""^ Lys Glu Thr Leu 

Ala Gly Gly 
465 

2g (2) IHFORIIATIOW FQR SEQ ID NO: 262 

(i) SEOLIENCE CHARACTERISTICS: 

(A) LENGTH: 945 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginqivalis 
{ ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATIOH 1 . . .94 5 



(xi) SEQUENCE DESCRIPTIOII: SEQ ID HO: 282 

Pro Lye He Leu Met Glu Leu Lys Arg Phe Leu Ser Leu Gly Leu Leu 

Leu Val Gly Phe He Pro Met Lys Leu Ser Ala Gin Gin Ala Gin Pro 

25 -an 
Leu Pro Thr A=p Pro Ala Val Arg Val Gly Lys Leu A^p A^n Gly Leu 

50 Thr Tyr Phe He Arg Hi. A|n otu A^n Pro Lys Asp Arg Ala A,p Phe 

Phe He Ala Gin Lys Val Gly Ser He Leu Glu Glu Asp Ser Gin Ser 

Gly Leu Ala His Phe L^u Glu His Met Ala Phe Asn Gly Thr Lys ][^n 

Phe Pro Gly Lys Asn Leu 11, Asn Tyr Leu Glu Thr He Gly Val Arg 

1 05 1 1 n 

l^he Gly Gin Asn Leu A^n Ala Ser Thr Gly Ph. Asp Lys iSr Glu Tyr 

60 Thr lie M.t ASP val Pro Thr Thr A.-g Gin Gly lie He Asp Ser Cys 

Leu Leu He Leu His A|p Trp Ser Asn Asn He Thr Leu Asp Gly His 

Glu He ASP Glu Glu Arg Gly Val He Gin g" Glu Trp Arg A, a irg 

Arg Asp Ala Asn Leu Arg Met Phe Glu Ala He Leu Ala Lys HI He'- 

Pro Gly Asn Lys Tyr Ala Glu Arg Pro He Gly Leu Met Asp VaJ 

7n . , 200 205 

70 val Leu Asn Phe Lys His Asp Glu Leu Arg Asn Tyr Tyr Lys Lys Trp 

Tyr Arg Pro Asp Leu Gin Gly Leu Val He Val III Asp He Asp Val 
Asp Tyr val Glu Asn Lys He Lys Glu Leu Phe Ly3 Asp Val Pro Ma 

250 255 
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Pro Val Asn Pro Ala Glu Arg Tie Tyr Thr Pro Val Glu Aap Asn Ast> 

260 265 270 

Glu Pro Tie Val Ala lie Ala Thr Asp Ala Glu Ala Thr Thr Thr Gin 

275 280 285 

Leu Ser He Ser Phe Lys Ser Asp Pro Thr Pro Gin Glu Val Arq Glv 

290 2S5 300 

Ser lis Phe Giy Leu Val Glu Asp Tyr Met Lys Gin Val He Thr Thr 
305 310 315 320 

Ala Val Asn Glu Arg Leu Ser Glu He Thr His Lys Pro Asn Ala Pro 

325 330 335 

Phe Leu Ssr Ala Gly Ala Phe Phe Ser Asn Phe Met Tyr He Thr Gin 

340 345 350 

Thr Lys Asp Ala Phe Asn Phe Val Ala Thr Val Arg Glu Gly Glu Ala 

355 360 365 

Glu LyB Ala Met Asn Ala Leu Val Ala Glu He Glu Ser Leu Arg Gin 

370 375 380 

Phe Gly He Thr Lys Gly Glu Tyr Asp Arg Ala Arg Thr Asn Val L*u 
385 390 395 4OO 

Lys Arg Tyr Glu Asn Gin Tyr Asn Glu Arg Asp Lys Arg Lys Asn Asn 
405 

Ala Tyr Ala Asn Glu Tyr Ser Thr Tyr Phe Thr Acp Gly Gly Tyr lie 

420 425 430 

Pro Gly Tie Glu Val Glu Tyr Gin Thr Val Asn Ala Phe Ala Pro Gin 

435 440 445 

Val Pro Leu Glu Ala Phe Acn Gin Ala He Ala Cln Met He Asp Pro 

450 455 
Val T.ys Asn Ala Val Val Thr Leu Thr Gly Pro Ser Lys Ala Glu Ala 
465 470 475 480 

Lys He Pro Ssr Glu Ala Asp Phe Leu Ala Ala Phe Lys Ala Ala Arg 

485 490 495 

Gin Gin Lys Val Glu Ala Lys Lys Asp Glu Val Ser Asp Gin Lys L^u 

500 505 510 

Met Glu Lys Ala Pro Lys Ala Gly Lys He Val Ser Glu Lys Lys Asp 

515 520 525 

Gin Lys Phe Gly Thr Thr Glu Leu Thr Leu Ser Asn Giy He Lvs Val 

530 535 540 

Tyr Leu Lys Lys Thr Asp Phe Lys Ser Asn Glu He Leu Met S-c Ala 
550 555 S60 

Leu Scr Pro Gly Gly He Leu Ser Gly Lys His Ala Pro Asn Gin Ser 

565 570 575 

Val Met Asn Ser Phe Met Asn Val Gly Gly Leu Gly Asn Phe Asp Ala 

580 585 590 

He Gin Leti Asp Lys Val Leu Thr Gly Arg Ser Ala Ser Val Ser Pro 

595 600 605 

Ser Leu Ser Leu Leu Ser Glu Gly Leu Ser Gly Lys Thr Thr Val Glu 

615 620 
Asp Met Glu Thr Phe Phe Gin Leu Ht Tyr Leu Gin Met Thr Ala Asn 
f25 630 635 640 

Arg Lys Asp Pro Glu Ala Phe Lys Ala Thr Gin Glu Lys Leu Tyr Asn 

645 650 655 

Asn Leu Lys Asn Gin Glu Ala Asn Pro Met Ala Ala Leu Met Asp Ser 

660 665 670 

He Arg His Thr Met Tyr Gly Asp Asn Pro Met Met Lys Pro Met Lys 

^75 680 685 

Ala Ala Asp Val Glu Lys Val Asn Tyr Asp Gin Val Met Ala Phe Tyr 

690 695 700 

Asn Glu Arg Phe Ala Asp Ala Gly Asp Phe Met Phe Phe Phe He Gly 
3°^ , ■'^^ 715 720 

Asn Leu Asp Glu Ala Lys Met Lye Pro Leu Tie Glu Thr Tyr Leu Ala 

''25 730 735 

ser Leu Pro Asn Leu Lys Arg Gly Asp Lys Met Asn Lys Ala Gin Val 

740 745 750 

Pro Ala Ala Arg Ser Gly Lys He Asp Cye Lye Phe Glu l.yr. Glu M-t 

755 /go 
Asp Thr Pro Ser Thr Thr He Phe Asp Val Val Ser Gly Asn Val Glu 
, ■'''5 780 

Tyr Thr Leu Lys Asn Ser Leu L«u Leu Glu Val Phe Ser Ala Val Met 
®^ ''^O 795 800 

Asp Gin Val Tyr Thr Ala Thr Val Arg Glu Lys Glu Gly Gly Ala Tyr 

805 810 81S 

Ser Val Ala Ali. Phe Gly Gly Leu Glu Gin Tyr Pro Cln Pro Lys Ala 

820 825 830 

Leu Met Gin He Tyr Phe Pro Thr Asp Pro Ala Arg Ala Glu Glu Met 

835 840 84S 

Asn Ala He Val Phe Ala Glu Leu Glu Lys Leu Ala Lys Glu Gly Pro 
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fiso fl«i5 eeo 

Asn Val Glu ryr Phe Lys Lys Thr He Glu Asn Leu Asn Lys Gin Hi* 
863 B70 875 880 

Lys Glu Ser Leu Arg Glu Asn Arg Phe Trp Leu Glu Ala Met Lys Ala 

88b 890 895 

Ser Phe Phe Glu Gly Asn Asp Phe lie Thr Asp Tyr Glu Ser Val Leu 

900 90S 910 

Asn Gly Leu Thr Pro Ala Glu Leu Gin Lys Phe Ala Ala Asp Leu Leu 

915 920 925 

Lys Gin Gin Asn Arg Val Val Val Mot Met Ala Pro Val Ala Lys Ala 
930 935 940 

Gin 
945 



15 { 2] InroRIIATIOH FOR SEQ ID NO: 283 

(1) SRQUENCE CHARACTERISTICS: 

<A) LENGTH: 686 amino acids 
(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

{iij IIOLECL'LE TYPE: protein 

(iii) MVPOTHETICAL: YES 



25 



30 



35 



50 



55 



65 



(vi) ORIGINAL SOURCE: 

(A) ORGAHISII: Porphyromonas gingival is 

(Ix) FEATURE: 

(A) HAHE/KEY: mi5C_f eature 

(B) LOCATION 1. . .686 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283 



Tyr Thr Ilet Ser Lys Lys Gly Thr lie Gly Val Thr Ser Asp Asn He 

15 10 15 

Phe Pro Val 11= Lys Lys Phe Leu Tyr Ser Asp His Glu He Ph** L-u 

20 25 30 

Arg Glu He Val Ser Asn Ala Val Asp Ala Thr Gin Lys Leu Lys Thr 
40 35 40 45 

Leu Thr Ser Val Gly Glu Phe Lys Gly Glu Thr Gly Asp Leu Aro Val 

50 55 60 

Thr Val Ser Val Asp Glu Val Ala Arg Thr He Thr Val Ser Asp Arg 

Gly Va3 Cly Met Thr Glu Glu Glu Val Glu Lys Tyr He Acn Gin He 

85 90 95 

Ala Phe Ser Ser Ala Glu Glu Phe Leu Glu Lys Tyr Lys Asp Asp Lys 

100 105 110 

Ala Ala He He Gly His Phe Gly Leu Gly Phe Tyr Ser Ala Phe H»t 

lis 120 125 

Val Ser Glu Arg Val Asp Val He Thr Arg Sar Phe Arg Glu Asp Ala 

130 135 140 

Thr Ala Val Lys Trp Ser Cys Asp Gly Ser Pro Glu Tyr Thr Leu Glu 

ISO 155 
Pro Ala Asp Lys Ala Asp Arg Gly Thr Asp He Val Met His He Asp 

165 170 175 

Glu Glu Asn Ser Glu Phe Leu Lys Lys Glu Lys He Glu Gly Leu Leu 

180 185 190 

Gly Lys Tyr Cys Lyn Phe Leu Thr Val Pro He He Phe Gly Lys Lys 
60 195 200 20b 

Gin Glu Trp Lys Asp Gly Lys Met Gin Asp Thr Asp Glu Asp Asn Gin 

210 215 220 

He Asn A-p Thr His Pro Ala Trp Thr Lys i.ys Pro Ala Asp Leu Lys 
225 230 235 240 

Asp Glu Asp Tyr Lys Glu Phe Tyr Arg Ser Leu Tyr Pro Met Ser Glu 

245 250 255 

Glu Pro Leu Phe Trp He His Leu Asn Val Asp Tyr Pro Phe Asn L»u 

260 265 270 

Thr Gly He Leu Tyr Phe Pro Lys He Lys Asn Asn Leu Asp Leu Gin 

275 280 285 

Arg Asn Lys He Gin Leu Tyr Cys Asn Gin Val Tyr Val Thr Asp Glu 

290 2?5 300 

Val Gin Gly He Val Pro Asp Phe Leu Thr Leu Leu His Gly Val H** 
310 315 320 

Asp Ser Pro Asp He Pro Leu Asn Val Ser Arg Ser Tyr Leu Gin Ser 



75 
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50 



55 



60 











325 










330 










335 




Asp 


Ala 


Asn 


Val 
340 


Lys 


Lys 


He 


Ser 


Ser 
345 


His 


He 


Thr 


Lys 


Lys 
350 


Val 


Ala 


Asp 


Arg 


Leu 
355 


Glu 


Glu 


He 


Phe 


Lys 
360 


Asn 


Asp 


Arg 


Pro 


Thr 
365 


Phe 


Glu 


Glu 


Lys 


Trp 


Asp 


Ser 


Leu 


Lys 


Leu 


Phe 


Val 


Glu 


Tyr 


Gly Met 


Leu 


Thr Asp 




370 










375 










380 










Glu 


Lys 


Phe 


Tyr 


Glu 


Arg 


Ala 


Ala 


Lys 


Phe 


Phe 


Leu 


Phe 


Thr Asp Met 


3BS 










390 










395 










4 00 


Asp 


Gly 


His 


Lys 


Tyr 


Thr 


Phe 


Asp Glu 


Tyr 


Arg Thr 


Leu 


Val 


Glu 


Gly 










405 










410 










415 


Val 


Gin 


Thr Asp 


Lys 


Asp 


Gly 


Gin 


Val 


Val 


Tyr 


Leu 


Tyr Ala 


Thr Asp 








420 










425 










430 






hys 


Hi3 


Gly 


Cln 


Tyr 


Ser 


His 


Val 


Lys Arg 


Ala 


Ser Asp 


Lys 


Gly Tyr 






435 










440 










445 








Sec 


Val 


Met 


Leu 


Leu 


Asp 


Gly 


Gin 


Leu 


Asp 


Pro 


His 


He 


Val 


Ser 


I,eu 




450 










455 








460 










Lou 


Glu 


Gin 


Lys 


Leu 


Glu 


Lys 


Thr 


His 


Phe 


Val 


Arg 


Val 


Asp 


Ser 


Asp 


465 










470 










475 










480 


Thr 


lie 


Asn 


Asn 


Leu 


IIq 


Arg 


Lys 


Glu 


Glu 


Arg TUa 


Glu 


Val 


Lys 


Leu 










485 










490 










4 95 




Ser 


Asp 


Thr 


Glu 
500 


Arg 


Ala 


Thr 


Leu 


Val 
505 


Lys 


Leu 


Phe 


Glu 


Ala 
510 


Arg 


Leu 


Pro 


Arg 


Asp 
515 


Glu 


Lys 


Lys 


His 


Phe 

520 


Asn 


Val 


Ala 


Ph-% 


Glu 
525 


Ser 


Leu 


Gly 


Al a 


Glu 


Gly Glu 


Ala 


He 


Leu 


He 


Thr 


Gin 


Ala 


Glu 


Phe 


Met 


Arg 


Aiy 




530 










S35 










540 






Met 


Arg 


Asp Met 


Ala 


Gin 


Lsu 


Gin 


Pro 


Gly 


Met 


Ser 


Phe 


Tyr 


Gly 


Glu 


545 










SiiO 










555 








560 


Leu 


Pro 


Asp 


Sftr 


Tyr 
565 


Asn 


Leu 


Val 


Leu 


Asn 
570 


Thr 


Asp 


His 


Pro 


Leu 
575 


He 


A^p 


Arg 


Val 


Leu 

580 


Ser 


Gly 


Glu 


Lys 


Glu 
585 


Ser 


Val 


Glu 


Pro 


Ser 
590 


Leu 


Thr 


Glu 


Leu 


TVrg 


Ala 


Lys 


H e 


Ala 


Glu 


Lsu 


Lys 


Ala 


Glu 


Glu 


Ala 


Lys 


Leu 






595 










600 










605 






Leu 


Asp 
610 


Glu 


Glu 


Lys 


Gly 


Lys 
615 


Lys 


Pro 


Glu 


Glu 


He 
620 


Pro 


Val 


Ala 


Thr 


Lys 


Glu 


Ala 


T.ys 


Glu 


Asn 


Asn 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


l-hr 


Glu 


Gly 


625 










630 










635 








640 


Ser 


He 


Asn 


Asp 


Gin 


Leu 


Thr 


Lys 


Tyr Ala 


Gin 


Asp Asn 


Glu 


Leu 


He 










64b 










650 










655 




Gly 


Gin 


Leu 


He 


Asp 


Leu 


Ala 


Leu 


Leu 


Gly 


Ser 


Gly Leu 


Leu 


Thr 


Gly 








660 










665 










67 0 






Glu 


Ala 


Leu 
675 


Ala 


Glu 


Phe 


He 


Arg 
680 


Arg 


Ser 


Gin 


Arg 


Leu 
685 


Leu 







10 



15 



20 



25 



30 



35 



40 



45 

(2) IHFOW1ATIO(>I FOR SEQ 10 MO: 284 

(i) SEQUEKCE CHARACTERISTICS: 

(A} LENGTH: 4 82 amino acids 
(Dl TYPE: amino acid 
in) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAN X SI 1: Porphyromonas gingiva lis 

iir.) FEATURE: 

(A) NAME/ KEY: miFc_f eaturs 
<B) LOCATION 1...4 82 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284 

Asp Tie Arg Gin Lys Arg Pro Cys Phe Asn Ala Asn Leu T-yr Phe Tyr 

IS 10 15 

Arg Ala Met Glu Lys Leu He Asp He Leu Val Val Asp A^p Asp Val 

70 20 25 30 

Ala Val Cys Ala Ala Leu Arg Leu Val Leu Lyt* Arg Ala Gly Tyr Acn 

35 40 45 

Pro Val He Ala Asn Ser Pro Asp Glu Ala Leu Ser He Met Arg Asn 

50 55 60 

75 Pro Asp Gly Gly Cys Lys Pro Ala Val He Leu Met Asp Met Asn Phe 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ri^Ji -tP«4, 1F«" ir"h ^^i, ipJ^i .rff tRXt, ipsi, 

U Q^L^IC^ O O w %,3 iE3 iCi; 0 :0 €;4r 



wo 99/29870 



202/490 

"'O 75 SO 

Ser Leu Scr Thr Ser GJ y Arq Glu Gly Leu Glu Leu Leu Glu Lys Met 

90 95 
Gin lie Phe Thr Set Cys Pro Val He Leu Met Thr Ala Trp Ala Ser 

105 110 
He Pro Leu Ala Val Glu Gly Met Arg Leu Gly Ala Phe Asp Phe He 

lis 120 125 

Gly Lys Pro Trp Asp Asn Asp Arg Leu Leu Arg Thr lie Asp Thr Ala 

130 135 
Leu His Leu Ala Ala Pro Ser Ala Val Ala Asn Pro Ser Glu Gin Ser 
150 155 160 

Asp Arg Asp Thr Ala Arg Gin Pro Lys Ala Thr Val Gin Glu Asn Asp 

165 170 175 

Pro Cys Ala His He He Gly Arg Ser Asp Ala He Cys Lys He Lv^ 

180 las 190 

Glu Arg lie Arg Arg He Ala Pro Thr His Ala Ser Val Leu H- Thr 

195 200 205 

Gly Glu Ser Gly Thr Gly Lys Glu Leu He Ala Glu Ala Leu His Arn 

^10 215 220 

Gly Ser Lys Arg Ala Ser Ala Pro The Val Lys Val Asn Leu Gly Gly 
225 230 23S 240 

He Pro Glu Ser Leu Phe Glu Ser Glu Leu Phe Gly His Lys t.ya Glv 

250 255 
Ala Phe Thr Asn Ala Phe Ser Asp Arg Lys Gly Arg Phe Glu Leu Ala 

260 265 270 

Asp Gly Gly Thr He Phe Leu Asp Glu He Gly Glu Leu Pro Val GJ v 

275 2B0 2Bb 

Asn Gin Val* Lys Leu Leu Arg Val Lciu Gin Glu Gin Thr Phe Glu Pro 

290 295 300 

Leu Gly Glu Ser Val S-r His Arg Val Asp He Arg Val Val Scr Ala 

310 315 
Thr Asn Ala Ser Leu Glu Arg Met Val Ala Glu Gly Arg Ph- Arg Glu 

325 330 335 

Asp Leu Tyr Tyr Arg He Asn Leu He His Leu His Leu Pro Pro Leu 

^-tO 345 350 

Arg Glu Arg Gin Glu Asp He Gin Leu Leu Val Glu Ala Phe Ser Glu 

355 360 365 

Ala Phe Ala Gin Ser Asn Gly Leu Pro His Ala Val Trp Ser Ala Glu 

370 375 
Ala Met Arg Arg He Cys Ala Met Pro Leu Pro Gly Asn Val Arg Glu 
390 395 400 

Leu Lys Asn Val Val Glu Arg Thr Leu Leu Leu Ser Gly Ser Arg Glu 

405 410 43^5 

He Ser Ala Arg Asp Val Ala Asp Phe Gly Ser Gin Val Thr Ala Ala 

^20 430 
Asp lUs Ser Asp Glu Arg Ala Leu Thr Asp Met Glu Glu Ala Ala H- 

435 440 445 

Arg Glu Thr Leu Thr Lys Tyr Asn Gly Asn Val Ser Arg Ala Ala Arg 

450 455 460 

Ala Leu Gly Leu Ser Arg Ala Ala Leu Tyr Arg Arg Met Glu Lys Tyr 

If^ , ^''^ 475 480 

Gly Leu 



PCT/AU98/01023 



(2) IHFOWtATION FDR SEQ ID NO: 285 

ti) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 263 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGXMAL SOURCE: 

(A) ORGAHISII; Porphyroinonas gingival Is 

t ix ) FEATURE : 

(A) HAIIE/KEY: misc_f eature 
IB) LOCATIOfJ 1. , .263 

(Kl) SEQUEKCE DESCRIPTION: SEQ ID NO: 285 

Arg Sei Leu Gin Ser Phe Gin A.-n Lys Lys His Ser Ser Met Leu Ly: 
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15 10 IS 



He 


Lys Asn 


Leu 


His 


Ala Thr 


Val 


Gin 


Gly 


Lys 


Glu He Leu 


Lys 


Glv 






20 








25 






30 


lie 


Asn Leu 


Glu 


He 


Asn Ala 


Gly Glu 


He 


His 


Ala He Met 


Gly 


Pro 




35 








40 








45 




Asn 


Gly Ser 


Gly 


Lys 


Ser Thr 


Leu 


Ser 


Ser 


Val 


Leu Val Gly His 


Pro 




50 






55 










60 






Ser 


Phe Glu 


Val 


Thr 


Glu Gly 


Glu 


Val 


Thr 


Phe 


Asn Gly He Asp 


Leu 


65 








70 








75 






80 


Leu 


Glu Leu 


Glu 


Pro 


Glu Glu 


Arg 


Ala 


His 


Leu 


Gly Leu Phe 


Leu 


Ser 








65 








90 




95 




Phe 


Gin Tyr 


Pro 


Val 


Glu He 


Pro 


Gly Val 


Ser 


Met Val Asn 


Phe 


Met 






100 








105 






110 






Arg 


Ala Ala 


Val 


Asn 


Glu His 


Arg 


Lys 


Ala 


He 


Gly Ala Glu 


Pro 


Val 




115 
















125 






Ser 


Ala Ser 


Asp 


Phe 


Leu Lys 


Met 


Met 


Arg 


Glu 


Lys Arg Ala 


He 


Val 




130 






135 










140 






GlU 


Leu Asp 


Asn 


Lys 


Leu Ala 


Ser 


Arg 


Ser 


Val 


Asn Glu Gly 


Phe 


Ser 


14S 








150 








155 




160 


Gly Gly Glu 


Lys 


Lys 


Arg Asn 


Glu 


He 


Phe 


Gin 


Met Ala Met 


Leu 


Glu 








165 








no 






175 




Pro 


Lys Leu 


Ala 


He 


Leu Asp 


Glu 


Thr 


Asp 


Ser 


Gly Leu Asp 


He 


Asp 






180 








185 






190 




Ala 


Leu Arg 


He 


Val 


Ala Gly 


Gly Val 


Asn Arg 


Leu Arg Ser 


Pro 


Glu 




195 








200 








205 






A£n 


Ala Ala 


He 


Val 


He Thr 


His 


Tyr 


Gin 


Arg 


Leu Leu Glu 


Tyr 


He 




210 






715 










220 




Lys 


Pro Asp 


Phe 


Val 


His Val 


Leu 


Tyr 


Ly? 


Gly 


Arg He Val 


Lys 


Ser 


225 








230 








235 




24 0 


Gly Gly Ala 


Glu 


Leu 


Al a Leu 


Thr 


Leu 


Glu 


Glu 


Lys Gly Tyr 


Asp 


Trp 








245 








250 






255 


Tie 


Lys Glu 


Glu 


He 


Gly Glu 

















260 

(2) IHFOW4ATIOH FOR SEQ ID NO: 28 6 

(ij SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 4 62 amino acids 

(B) TYPE: amino acid 
(Dl TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

<vi) ORIGIt^AL SOURCE: 

(A) ORGAHISM; Porphyroroonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc_teature 

(B) LOCATION 1. . .462 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 286 



Ser 


Met 


Ala 


Lys 


Glu 


Lys 


Thr 


He 


Tyr 


Val 


Cys 


Arg 


Ser Cys Gly Thr 


1 








5 










10 






15 


Lys 


Tyr 


Ala 


Lys 


Trp 


Gin 


Gly 


Asn 


Cys 


Asn 


Ala 


Cys 


Gly Glu Trp Asn 








20 










25 








30 


Ci-s 


He 


Acp 


Glu 


Glu 


i*ys 


Val 


Pro 


Ala 


Pro 


Ala 


Ser 


Gly Lys His Ala 






35 










40 










45 


Ala 


Lys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro Arg Leu Leu 




50 










55 








60 


Gin 


Asp 


Val 


Glu 


Ser 


Gly 


Asp 


Glu 


Glu 


Arg 


He 


Arg 


Leu Gly Asp Glu 


65 










70 










75 




80 


Glu 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


He 


Val 


Lys 


Gly Ala Phe Val 










85 










90 






95 


Leu 


Leu 


Gly 


Gly 


Glu 


Pro 


Gly 


He 


Gly 


Lys 


Ser 


Thr 


Leu He Leu Gin 








100 










105 








110 


Thr 


Val 


Leu 


Arg 


Leu 


Pro 


Gin 


Leu 


Arg 


Thr 


Leu 


Tyr Val Ser Gly Glu 






115 










120 










125 


Glu 


Ser 


Ala 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu Gly Gin Ala 




130 










135 










140 


Met 


Asn 


Gly 


Cys 


Tyr 


Val 


Tyr 


Cys 


Glu 


Thr 


Asn 


He 


Glu Arg He Leu 


145 










150 










155 




160 


Ser 


Arg 


Ala 


Glu 


Glu 


Leu 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


He Asp Ser He 
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WO 99/29870 PCT/AU98A)1023 

204/ 490 

165 170 175 

Gin Thr Vo3 Tyr Thr Giu <ilu Met Glu Ser Sex Ala Gly Ser Val Gly 

180 les 190 

Gin lie Arg Glu Cya Ala Ala I.eu Leu Leu Lys Tyr Cys Lys Thr Thr i. - - l ^ 

> 195 200 205 

Gly lie Pro Val Tie Val lie Gly Hi* He Thr hya Glu Gly Ser He 

210 215 220 

Ala Gly Pro Lys Val Leu Glu His He Val Asp Thr Val Leu Leu Phe 
225 230 2.35 240 

Asp Gly Affp Lys His His Leu Tyr Arg He Leu Arg Gly Gin Lys Asn 

245 250 255 

Arg Tyr Gly Ser Thr Ser Glu Leu Gly He Tyr Glu Met Arg Gin Asp 

260 265 270 

Gly I.eu Arg Gly Val Glu Asn Pro Ser Glu His Leu He Thr Arg A^n 
1^ 275 280 285 

Arg Giu Asp Leu Ser Gly He Ala He Ala Val Ala Met Glu Gly He 

290 295 300 

Arg Pro He Leu He Glu Ala Gin Ala Leu Val Ser Ser Ala He Tyr 
305 310 315 320 

Ala A^n Pro Gin Arg Ser Ala Thr Gly Phe Asp He Arg Arg Met Asn 

325 330 335 

Met Leu Leu Ala Val Leu Glu Lys Arg Ala Gly Phc Lys Leu He Gin 

340 345 350 

Lys Asp Val Phe Leu Asn He Ala Gly Gly He Ly5 He Ala Asp Pro 

355 360 365 

Ala Thr A-?p Leu Ala Val He Ser Ala Val Leu Ala Snr Ser Leu As-p 

370 375 380 

He Val He Pro Pro Ala Val Cys Het Thr Gly Glu Val Gly Leu Ser 
385 390 395 400 

Gly Glu He Arg Pro Val Ser Arg He Glu Gin Arg He Thr Glu Ala 

*0S 410 415 

Arg Arg He Gly Phe Lys Glu He Leu Vai Pro Ala Asp Asn Phe Arg 

420 425 430 

Gin Glu Asp Ala Gly Arg Phe Gly He Arg Leu Val Pro Val Arg Lys 
OO 435 440 445 

Val Glu Glu Ala Phe Arg His Leu Phe Ser Lys Gly Arg Glu 
450 455 460 



20 



25 



30 



40 



45 



50 



55 



60 



65 



70 



75 



(2) IHFORl'lATIOIJ FOR SEQ ID HO: 287 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEUiGTH: 373 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

Ml) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival Is 

(ix) FEATURE: 

(A) NAME/KEY: mlsc_feature 

(B) LOCATION 1...373 



(xi) SEQUENCE DESCRXPTIOH: , 


SEQ ID NO: 287 










Gly Ser 


Cys 


Gly Ser 


Ser Pro 


Ala 


He Gly 


Arg 


Ser 


Gly 


Cys 


Ser 


Arg 


1 




5 






10 










15 


Cys Asp 


Arg 


Arg Ho 


Gly His 


Gly 


Leu Ser 


Thr 


Ser 


Ala 


Phe 


Arg 


Thr 


Tyr Ala 




20 






25 








30 




Gly 


Leu Val 


Cys His 


Ser 


Arg Leu 


Arg 


Asp 


Gin 


Ser 


Ser 


Cys 


Arg Vai 


35 






40 








45 






Cys 


Gly Met 


Arg Pro 


He 


Gly Gin 


Leu 


Phe 


Ptie 


Pro 


Ser 


Cys 


50 






55 






60 








Pro Arg 


Arg 


Ser Ala 


Ser Asp 


Ala 


Gly TVrg 


Thr 


Glu 


Arg 


He 


Gin 


Ser 


65 






70 




75 








8G 


Pro Leu 


Gin 


Asn Leu 


Leu Gin 


Glu 


Met Gin 


Plte 


Ser 


Pro 


Thr 


Asn 


Arg 


Arg Asn 




85 






90 










95 


Pro 


Phe Asp 


Asn Met 


Asn 


Ser Arg 


His 


Leu 


Thr 


lie 


Thr 


He 






100 






105 








110 






He Ala 


Gly 


Leu Ser 


Leu Phe 


Val 


Leu Thr 


Leu 


Gly 


Gly 


Cys 


Ser 


Val 


Ala Gin 


115 






120 








125 








Gin 


Asp Thr 


Gin Trp 


Thr 


Leu Gly 


Gly 


Ly? 


Leu 


Phe 


Thr 


Ser 
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wo 99/29870 PCT/AU98/01023 

205/490 

130 135 140 

Ala Trp Ilo Gin Arg Scr Ala Glu Tyc Gin Ala Leu Cys He Gin Ala 
150 155 160 

Tyr Asn He Ala Thr Glu Arg Val Asp Ala Leu Pro Ala Glu Arq Lvs 
> 165 170 175 

Gin Gly Asp Arg Pro Tyr Ala lie Val Thr Asp Ilo Asp Glu Thr He 

180 185 190 

Leu Asp Asn Thr Pro Asn Ser Val Tyr Gin Ala Leu Arg Gly Lys Asp 

19S 200 205 

Tyr Asp Glu Glu Thr Trp Gly Lya Trp Cys Ala Gin Ala Asp Ala Asp 

210 215 220 

Thr Leu Ala Gly Ala Leu Ser Phe Phe Leu His Ala Ala Asn Lys Gly 
225 230 235 240 

He Glu Val Phe Tyr Val Thr Asn Arg Arg Asp Asn Leu Arg Glu Ala 
15 245 250 255 

Thr Leu Gin Asn Leu Gin Arg Tyr Gly Phe Pro Phe Ala Asp Glu Glu 

260 265 270 

His Leu Leu Thr Thr His Gly Pro Ser Asp Lys Glu Pro Arg Arq Leu 

275 280 285 

I.ys He Gin Glu Gin Tyr Glu He Val Leu Leu tie Gly Asp Asn Leu 

290 295 300 

Gly Asp Phe His His Phe Phe Asn Thr Lys Glu Glu Ser Gly Ara l.vf 
305 310 315 320 

Gin Ala LffU Gly Leu Thr Ala Gly Glu Phe Gly Arg His Phe He Met 
25 325 330 335 

Leu Fro Asn Pro Asn Tyr Gly Ser Trp Glu Pro Ala Trp Tyr Gly Gly 

340 345 350 

Lys Tyr Pro Pro Leu Pro Glu Arg Asp Lys Ala Leu Lys Gin Leu Hi- 
355 360 365 

oU Ser Gin Asn Ser Arg 
37 0 

(2) INFORMATION FOR SEQ ID HO: 288 

35 (i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vil ORIGINAL SOURCE: 
45 (A) ORGAJJISM: Porphyromonas ginglvalis 



50 



60 



65 



70 



75 



(ixl FEATURE: 

(A) HAl-IE/KEY: micc_feature 
(D) LOCATIOI4 1 . . .426 

t;;i) SEQUEIJCE DESCRIPTlOt* : SEQ ID MO: 288 



Arg 


Trp 


Gly 


Phe 


Ser 


Asn 


Phe 


Val 


Ala 


Met 


Ser 


Thr Asn 


He 


Asp Val 


1 








5 










10 








15 


Gin 


Gin 


He 


Lys 


Gin 


Arg 


Phe 


Gly 


He 


He 


Gly 


Ser Ser 


Pro 


Leu Met 


Glu 






20 










25 






30 




His 


Ala 


He 


Arg 


Val 


Ala 


Ala 


Gin 


Val 


Ala 


Pro Thr Asp Met Ser 






35 










40 








45 






Val 


Leu 


Val 


Thr 


Gly Glu 


Ser 


Gly 


Ser 


Gly 


Lys 


Glu Phe 


Phe 


Pro Gin 


He 


50 










55 








60 




He 


His 


Tyr 


Tyr 


Ser 


Ala 


Arg 


Lys 


His 


His 


Ser Tyr 


He 


Ala Val 


65 










70 










75 




80 


Asn 


Cys 


Gly Ala 


He 


Pro 


Glu 


Gly Thr 


He 


Asp 


Ser Glu 


Leu 


Phe Gly 


His 








85 










90 








95 


Arg 


Lys 


Gly 


Ser 


Phe 


Thr 


Gly 


Ala 


Val 


Ser 


Asp Arg 


Lys 


Gly Tyr 








100 










105 








110 


Phe 


Glu 


Glu 


Ala 


Ser 


Gly Gly 


Thr 


He 


Phe 


Leu 


Asp Glu 


Val 


Gly Glu 






115 










120 








125 




Leu 


Pro 


Leu 


Pro 


Thr 


Gin 


Ala 


Arg 


Leu 


Leu 


Arg 


Val Leu 


Glu 


Thr Gly 




130 










135 








140 






Glu 


Phc 


He 


Pro 


Val 


Gly Ala 


Ser 


Gin 


Ser 


GTn 


Lys Thr 


Asp 


Val Arg 


14b 










150 










155 




150 


He 


Val 


Ala 


Ala 


Thr 


Asn 


Val 


Asn 


Leu 




Glu 


Ala Val 


Ala 


Asn Gly 










165 










170 








175 


Lys 


Phe 


Arg 


Glu 


Asp 


Leu 


Phe 


Phci 


Arq 


Leu 


Asn 


Thr Val 


Pro 


He Glu 
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20 



206/490 

180 185 190 

Val Pro AJa Leu Art? Met Arg Pro Asp Asp Val Pro Leu Leu Phe Arg 

195 200 205 

Arg Phe Ala Ala Asp Ser Ala Glu Lys Tyr Arg Met Pro Pro Leu Arg 
5 210 215 220 

Leu Ser Asp Glu Ala Arg Thr lie Leu Met Arg Tyr Arg Trp Pro Gly 
225 230 235 240 

Asn Val Arg Glu Leu Arg Asn lie Thr Asp Arg Leu Ser He Leu Glu 

250 . „ . 255 

lU Glu Glu Arg Thr Val Ser Ala Glu Thr He Thr Arg Tyr Leu Asp Ala 
260 265 270 

Glu Gly Met Gin Asp Leu His Pro Val Val He Arg Arg Asn Glu Thr 

275 2B0 285 

Thr Glu Ala Asp Lys Gin He Pro His Tyc Glu Arg Glu He He Tyr 
15 290 295 300 

Gin Val Leu Tyr Asp Het Lys Lys Glu He Ala Asp Leu Lys Gly Met 
305 310 315 320 

Met Asn Arg Leu Ala His His Glu Gin Pro Ser Trp Pro Val Gly Ser 

325 330 335 

Asp Val Trp Gly Asn Asp Asp Lys Arg Thr Ala Asp Pro Lys Trp Gly 

3-10 345 350 

Vn3 Sor Thr His Lys Ala Pro He Ala Asn Ala Ala Glu Pro Val Glu 

355 360 365 

Pro lie Gin Glu Ala Ser Glu Tyr Thr Glu Asp Pro Val Ser Leu Glu 
^5 370 375 380 

Glu Val Glu I.yr? Lys Met He Ser Leu Ala Leu Glu Arg His Gly Gly 
3B5 390 395 400 

Arg Arg Lys Gin Thr Ala Glu Glu Leu Lys He Sor Glu Arg Thr Leu 

on 415 

OU Tyr Arg Lys He Lys Glu Tyr Gly Leu Glu 
420 425 

(2) INFORJHATION FOR SEQ ID NO: 28 9 

35 (i) SEQUF.MCE CHARACTERISTICS: 

(A) LENGTH: 653 amino acids 

(B) TYPE: amino acid 
iV) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vij ORIGIHAL SOURCE: 
4'? [A) ORGAIJISt'l: Porphyromonas gingival Is 



50 



55 



60 



65 



70 



75 



(ixl FTATURE : 

(A) tWIE/KEY: mi3c_featur- 

(B) LOCATIOIJ 1. . .653 





(Xi) SEQUEHCF. DESCRIPTION: 


SEQ 


ID HO:2B9 








Lys 


Asn Leu 


Pro 


Arg 


Tyr 


Ala 


Pro 


Ala 


Phe 


Gly Arg 


Asn Arg 


Lys 


Arg 


1 


Gin Pro 




5 










10 








15 


Glu 


Asp 


Pro 


Asp 


Asn 


Asp Asp 


Ser 


Arg 


Gly Arg Ser 


His 


Arg 


Gin 




20 










25 








30 




Ser Gly 


Arg 


Ala 


Val 


Thr 


Val 


Val 


Cys 


Arg Arg 


lie Gin 


Thr 


Pro 




35 










40 










45 






Leu 


Pro His 


Met 


He 


Leu 


Leu 


Phe 


Gly Gly 


Thr 


Thr 


Glu Gly Arg Ala 




50 








55 










60 








Ala 


Ala Arg 


Val 


Leu 


Asp 


Glu 


Ala 


Gly 


Ser 


Pro 


Phe 


Phe Tyr 


Ser 


Thr 


65 








70 










75 






80 


Lys 


Gly Asn 


Leu 


Gin 


Glu 


He 


Gin 


Ser 


Ser 


His 


Gly 


His Arg 


Leu 


Thr 


Gly 


Ala Het 




85 










90 






95 




Thr 


Val 


Ala 


Asp 


Met 


Val 


Ser 


Phe 


Cys 


Arg Lys 


Glu 


Glu 


He 




100 










105 








110 






Arg Leu 


He 


Val 


Asp 


Ala 


Ala 


His 


Pro 


Phe 


Ala 


Glu Glu 


Leu 


His 




115 










120 










125 






Ala 


Ser Val 


Ala 


Glu 


Ala 


Ser 


Glu 


Gin 


Thr 


Gly 


He 


Pro Val 


Val 


Arg 




130 








135 










140 






Tyr 


Gl u Arq 


Gin 


Tyr 


Pro 


Pro 


Ary 


Glu 


Glu 


Gly 


He 


Val Trp 


Cys 


Ala 


145 








150 










155 




160 


Asn 


Tyr Asp 


Thr 


Ala 


Ala 


Glu 


Arg 


Het 


Leu 


Gly Asp 


Gly Val 


Gin 


Arg 


Leu 






165 










170 








175 


Leu MeL 


Leu 


Thr 


Gly 


Val 


Asn 


Thr 


He 


Pro 


Lys 


Leu Ala 


Ala 


Phe 
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WO 99/29870 PCT/AU98/01023 

207/ 490 

180 ia5 290 

Trp Lys Glu Arg Thr Thr Phe Cys Arg lie Lsu Lys Arg A^p Glu Ser 

195 200 205 

Val Ala Leu Ala Glu Lys Asn Gly Phe Pro Ala Glu Arg lie Val Phe 
' 210 215 220 

Phe Glu Pro His Ala Asp Glu Glu Leu Met Gin Ala Val Arg Pro Asp 
22S 230 235 240 

Ala lie He Thr Lys Glu Ser Gly Glu Ser Gly Tyr Phe Arg Glu Lys 

245 250 255 

He Glu Ala Ala Arg Arg Met Gly He Arg lie Tyr Ala Val Val Arg 

260 265 270 

Pro Pro Leu Pro Pro Ser Phe He Pro Val Gly Gly Pro Val Gly Leu 

275 280 285 

Arg Arg Ala Val Glu Arg Leu Val Pro Gly Phe Fhe Ser Leu Arg Ser 
15 290 295 300 

Gly Phe Thr Thr Gly Thr Thr Ala The Ala Ala Val Val Ala Ala Het 
305 310 315 320 

Tyr Arg Leu Met Gly Leu Gly Ser Leu Ala Glu Ala Pro Val Glu L°u 

325 330 335 

Pro Ser Gly Glu He Val Ser Leu Pro He Ala Glu He Arg Glu Glu 

340 345 350 

Glu Asp Ala Val Vai Ser Ala Val Leu Lys Asp Ala Gly Asp Asp Pro 

355 360 365 

Asp Val Thr Acn Gly Het Ala Val Cys Ala Thr He Arg Leu Asn Pro 
25 370 375 380 

Glu His Glu Glu Val Arg Phe Leu Gin Gly Glu Gly Val Gly Val Val 
385 390 395 400 

Thr Leu Pro Gly Leu Gly Leu Glu Val Gly Gly Pro Ala He Asn Leu 

405 410 415 

Val Pro Arg Arg Met Met Thr Ala Glu Val Arg Arg Leu Tyr Ala Gin 

420 425 430 

Gly Gly Val Asp He Thr He Ser Val Pro Glu Gly Arg Glu Ala Ala 

435 440 445 

Thr Gin Thr Phe Asn Pro Arg Leu Gly He Arg Asp Gly He Ser II- 

450 455 460 

He Gly Thr Ser Gly Val Val Lys Pro Phe Ser Ala Glu Ala Phe Val 
470 475 480 

Gly Ala He Arg Lys Gin Val Gly He Ala Thr Ala Leu Gly Ala Asn 

485 490 495 

His He Val Leu Asn Ser Gly Ala Lys Ser Glu Arg Tyr Val Lys Gly 

500 505 510 

Ala Tyr Pro Ala Leu He Pro Gin Ala Phe Val Gin Tyr Gly Asn Phe 

S15 520 525 

Val Gly Glu Ser Leu Ser Cys Val Ala Ser Phe Pro Ser Val Arg Ser 
45 530 535 540 

Val Thr Val Gly He Het Leu Gly Lys Ala Val Lys Leu Ala Glu Glv- 
5**^ i>i>0 555 560 

Tyr Leu Asp Thr His Ser Lys Lys Val Val Met Asn Arg Arp Phe I.pn 
cn 57C 575 

OU His Glu Leu Ala Arg Gin Ala Gly Cys Ser Glu Asp He His Ala He 
580 585 590 

He Asp Ser Leu Asn Leu Ala Arg Glu Leu Trp Thr Met Pro Ser Ala 

59S 600 605 

Glu Asp Ser Asp Arg Leu Leu Arg Lys He Ala Glu Arg Ser Trp Glu 
55 610 615 620 

Thr Cys Arg Pro Ser Val Pro Ser Ala Glu Leu Glu Leu Leu Leu H- 
«5 630 635 640 

Asp Glu Ser Gly Ala Tie Arg Phe Arg He Gly Gly Glu 
645 650 



30 



40 



60 



65 



(2) IHFOFU-IATION FOR SEQ ID MO: 290 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 451 amino acid? 
(D) TYPE: amino acid 
(Dl TOPOLOGY; linear 



Ui) MOLECULE TYPE: protein 

70 (ill) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAl-flSM: Porphyromonas gingival is 

75 (ix) FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



■rP-^i iF'^' M-*^j x^i -rtP rf«=fc ™=-h iF=is iF'=4i ^F=ii 




Wd 99/29870 PCT/AU98/01023 
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10 



20 



30 



(A) NAME/ KEY: mi rfc_f eature 
(8) LOCATIW 1. . .451 

SEQUFJIce DESCRIPTION: SEO ID HO: 2 90 

Gin Gin Arg Pro His Arg He He Met r.cu Arg Thr Phe Arg He Gly 
15 10 IS 

Gly lie His Pro Pro Glu Asn Lys Leu Ser Ala Gly Lys Pro Val Glu 

20 25 30 

Val Leu Pro He Pro Ser Gin Val Val He Pro Leu Gly Gin His He 

35 40 45 

Gly Ala Pro Ala Thr Ala Thr Val Lys Lys Gly Asp Glu Val Lys Val 

50 55 60 

Gly Thr He He Ala Gin Ala Gly Gly Phe Val Ser Ala Asn He His 
15 6S 70 75 80 

Sec Ser Val Ser Gly Lys Val Leu Lys He Asp Asn Val Tyr Asp Ser 

85 90 95 

Ser Gly Tyr Pro Lys Pro Ala Val Phe He Ser Val Glu Gly Asp Glu 

100 lOb 110 

Trp Glu Glu Gly He Asp Arg Ser Pro Ala He Val Lys Glu Cys Afin 

115 120 125 

Leu Asp Ala Lys Glu He Val Ala Lys He Ser Ala Ala Gly He Val 

330 135 140 

Gly Leu Gly Gly Ala Thr Phe Pro Thr His Vai Lys Leu Ser Pro Pro 
25 145 ISO 155 160 

Pro Gly Asn Lys Ala Glu He Leu lie He Asn Ala Val Glu Cys Glu 

165 170 175 

Pro Tyr Leu Thr Ser Asp His Val Leu Met Leu Glu His Gly Glu Glu 

180 185 190 

He Met He Gly Val Ser He Leu Met Lys Ala He Gin Val Asn Lys 

195 200 205 

Ala Val He Gly Val Glu Asn Asn Lys Lys Asp Ala He Ala His Leu 

210 215 220 

Thr Lys Leu Ala Thr Ala Tyr Pro Gly He Glu Val Met Pro Leu Lys 
225 230 235 240 

Val Gin Tyr Pro Gin Gly Gly Glu Lys Gin Leu He Asp Ala Val He 

24S 250 255 

Arg Lys Gin Val Lys Ser Gly Ala Leu Pro He Ser Thr Gly Ala Val 

260 265 270 

Val Gin Asn Val Gly Thr Val Phe Ala Val Tyr Glu Ala Val Gin Lys 

275 280 285 

Asn Lys Pro Leu Val Glu Arg He Val Thr Val Thr Gly Lys Lys Leu 

290 295 300 

Ser Arg Pro Ser Asn Leu Leu Val Arg He Gly Thr Pro He Ala Ala 
45 305 310 315 320 

Leu He Glu Ala Ala Gly Gly Leu Pro Glu Asn Thr Gly Lys He He 

325 330 335 

Gly Gly Gly Pro Met Met Gly Arg Ala Leu Leu Ser Pro Asp Val Pro 

340 345 350 

Val Thr Lys Gly Ser Ser Gly Val Leu He Leu Asp Arg Glu Glu Ala 

355 360 365 

Val Arg Lys Pro Met Arg Asp Cys He Arg Cys Ala Ly.-? Cys Val Gly 

370 375 380 

Val Cys Pro Met Gly Leu TVen Pro Ala Phe Leu Met Arg Asp Thr Leu 
385 390 395 4OO 

Tyr Lys Ser Trp Glu Thr Ala Glu Lys Gly Asn Val Val Asp Cys He 

405 410 415 

Glu Cys Gly Ser Cys Ser Phe Thr Cys Pro Ala Asn Arg Pro Leu Leu 

420 425 430 

Asp Tyr He Arg Gin Ala Lys Lys Thr Val Met Gly He Gin Arg Ala 

4 35 440 445 

Arg Lys Gin 
450 

65 (2) INPORHATIOH FOR SEQ ID HO:291 

H) SEOyEHCE CHARACTERISTICS: 

(A) LENGTH: 4 89 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

fill) HYPOTHETICAL: YES 



40 



50 



60 



70 



75 
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209/490 

{vil ORIGINAL SOURCE: 

(AJ OKGATJI SH: Porphyromonas gingival is 

(ix) FEATX'RE: 

(A) UMJE/KEY: mlsc_feature 
IB) LOCATION 1, . . 489 



10 



20 



35 



40 



55 



(xi) SEQUENCE DESCRIPTIOTJ: SEQ ID NO: 291 

Met Asn Ser Gin Lys Lys Glu Ala Phe Asn Met. Ly55 Arg He Gin Leu 
^ S 10 15 

Thr Leu He Ala Leu Ph- Ala Ala Val Ala Gly Leu Val Ala Gin Atn 

20 25 30 

Ala Tyr Glu Gly Val He Ser Tyr Lys He Ser T.eu Asp Lys Thr Glv 
13 3S 4 0 4 5 

Asn Lys Val Val Lew Asn Gly Ala Ala Asp Met Ser Asn Leu Lys Leu 

SO 55 60 

Lys Ser Thr Gin Met He He Val Thr Pro He Leu Arg Ser Glu A^p 

"'O 75 80 

Gly Thr Ser Arg Val Glu The Pro Ser Val Val He Thr Gly Arg Asn 

85 90 95 

Arg Thr Lys Ala Leu Lys Ary Glu He Ala Phe Ser Ser Ala L«»u Pro 

100 105 no 

Gin Ala Lyr Hii? Ala Ala Gin Tyr He Arg Arg His Asn Gly Lys S-r 

115 120 125 

Glu Gin Phe Ala Phe Thr Gly Glu Ills Ala Tyr Ala Ser Trp Met Met 

130 135 
Arp Ala Lys Phe Val ^al Arg Glu Glu Val Arg Gly Cys Ala Lys Cy.^ 

on leo 

Pro Val Gly Leu Ser Ser Asn He VaJ Pro Phe Asp Pro Leu Ph*> Asn 

1«S 170 
Pro Ala Glu Ala Pro Tyr Leu Leu Ala His He Thr Pro Ala Glu Glu 

IBO 1B5 1^90 

Val Glu Lys Gin Arg Glu Ser Ser Phe Asp Ala Tyr lie Asn Phe Lvs 

155 200 205 

Val Affn Lys Ala Asp Val Leu Pro Glu Tyr Arg Asn Asn l.ys Ala Glu 

210 215 220 

Leu Glu Lys He Lys Glu Phe Val Ser Thr Val Lys Ala Asn Pro Asn 
230 235 240 

Tyr Ser Val Asn Lys Met He He Glu Gly Phe Ala Ser Pro Glu Ala 

245 2S0 255 

Ser He Ala His Asn Lys Ala Leu Ser Glu Arq Arg Ala Lys Arg Leu 

2C0 265 270 

Ala Glu Glu Leu Val Arg Lys Tyr Gly Lys Thr Leu Pro Asn He Thr 

27S 280 285 

Thr Glu Phe Gly Gly Glu Asp Trp Lys Gly I.eu T.ys Leu Ala He Glu 

2^>C» 295 300 

Lys Ser Asp He Ala A»p Arg Asp Arg Val Leu Glu He He Aen S-r 
en 315 320 

OU Asp Lys Tyr Ala Asp Asp Asp Ala Arg Glu Gin Ala Leu Lys Gin Leu 
32b 330 335 

Ser Ser Tyr Arg Tyr He Leu Asp Gin He Tyr Pro Asn Leu Arg Arq 

340 345 350 

Asn Thr He Thr Met Gly Tyr He Val Arg Asp Tyr Thr Leu Glu Glu 

3ii5 360 365 

Ala Arg Glu He He Lys Thr Ala Pro Lys Glu Leu Ser Glu Ala Glu 
^■'O 375 380 

3II '^^'^ ^""^ 350 ^"'^ '^^'^ "^"^ ^^"^ ^"'^ 

60 Phe Ala Leu Asn Thr Thr Leu Lys Tyr Ph- Pro Glu Ser Val Thr Glv 

405 410 415 

Arg He Asn Leu Ala Val Ala Ala Phe Asn Gly Gly Aijp Val Gin Gin 

^20 425 430 

Ala He Ala Leu Leu Ser Pro He Gin Thr Glu Lys Gly Val Ser A-n 

00 435 440 445 

He Leu Gly Ala Ala Tyr Ala Arg Thr Gly A^p Phe Ala Arg Ala Gin 

455 460 

Thr Phe Phe Arg Lys Ala Val Ala Glu Gly Asp Ala Asn Ala Gin Arg 

70 1 t . 480 
/U Asn Leu Asp Met Leu Leu Gly Lys Lys 

465 



75 



(2) IlIFORMATlOtJ FOR SZQ ID NO: 292 
(i) SEQUENCE aiARACTERISTICS: 



SUBSTTTUTE SHEET (Rule 26) (KO/AU) 
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WO 99/29870 



15 



45 



50 



210/490 



(A} LEHGTH: 384 amino acids 
(B) TYPE: amino acid 
(D} TOPOLOGY: linear 

5 tii) MOLECULE TYPE: pror.eln 

(iil) HYFOTHBTICAL: YES 

(vi) ORIGINAL SOURCE: 
^0 <A) OKGAMISII: Porphyromonas ginglvalls 



(i^'.t FEATURE: 

(A) NAME/ KEY: miec feature 

(B) LOCATION 1...384 



Ui» SEQUENCE DESCRIPTION: SEQ ID NO:292 

Asp Met Ala Glu Lys Arg Asp Tyr Tyr Glu Val T.eu Gly Val Ser Lys 

20 Asn Ala Thr Asp Asp Glu Leu Lys L/s Ala Tyr Arg Lys Lys Ala He 

25 30 
Gin Tyr His Pro Asp Lys Asn Fro Gly Asp Lys Glu Ala Glu Glu His 
35 40 ^5 

25 '^^'^ ^'V-:^ 

Arg Ser Gin Tyr Asp Gin Phe Gly Hi^ Ala Gly Leu Gly Gly Ala Ala 

Gly Gly Gly Phe Ser Gly Gly Gly Het Ser Met Glu Asp He Phe Ser 

95 

JU Arg Phe Gly Asp Leu Phe Gly Gly Phe Gly Gly Phe Gly Gly Phe Ser 

105 1X0 
Asp Met Gly Gly Gly Ser Arg Arg Arg Val Arg Arg Gly Ser Asp Leu 
lis 120 12b 

35 ^""^ llo ^""^ ^^'^ fl? 

Lys Lys Val Lys Val Lys Lys Gin Val Val Cys Ser Lys Cys Arc Gly 

155 

Asp Gly Thr Glu Glu Ala Asn Gly Lys Thr Thr Cys Gin Thr Cys Hi- 
An -^^^ 170 175 

40 Gly Thr Gly Val Val Thr Arg Val Ser Asn Thr Phe Leu Gly Ala Her 

185 190 
Gin Thr Gin Ser Thr Cys Pro Thr Cys His Gly Glu Gly Glu He He 

200 205 
Thr Lys Pro Cys Ser Lys Cys Lys Gly Glu Gly Val Glu He Gly Glu 

Glu Val He Ser Phe His He Pro Ala Gly Val Ala Glu Gly Het Gin 

Met Ser Val Asn Gly Lys Gly Asn Ala Ala Pro Ary Gly Gly Val Asn 

^''^ 250 255 

Gly Asp Leu He Val Val He /aa Glu Glu Pro Asp Pro Asn Leu He 

260 265 270 

Arg Asn Gly Asn Asp Leu He Tyr Asn Leu Leu He Ser Val Pro Leu 

275 200 285 

55 lin ^^'^ "^'^^ "^^^ ""^^ '^"^ Asp Gly Arg Ala 

295 300 

Lys He Arg He Glu Ala Gly Thr Gin Pro Gly Lys Met Leu Arg Leu 

3X0 315 

Arg Asn Lys Gly Leu Pro Ser Val Asn Gly Tyr Gly Met Gly Asp G^n 
cr\ 325 330 -a-ac 

60 Leu V3l Asn Val A.n Vai Tyr He Pro Glu Ser lie Asp Ala Zy^ A.p 

340 -a.ic ■* ^ 

Ala He Ala Ala Het Glu 

3^5 360 365 



340 3,j5 250 

Glu Gin Ala He Ala Ala Het Glu Asn Ser Asp Ser Pt.e Lys Pro Thr 
-*=*^ 360 365 

65 ""^^ 370 ''''' ^''^ ''^^ lie ''^P ^^'^ '-y"- '^Vr Arg Glu Het Leu Asp 

37^ 380 

(2) IWFORHATIOrJ FOR SEQ ID NO: 293 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 



PCT/AU98/01023 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



*F^i iF^' iP^ii 

u rsi 
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(iiij HYPOTHETICAL: YES 

(vl) ORIGTHAL SOURCE: 
^ (A) ORGANISM: PorphyroiT)onA.s gingival is 

iiX) FEATURE: 

<A) HAHE/KEY: misc_f eature 
«B1 LOCATION i. . .309 

10 (xi) SEQUENCE DESCRXPTIOTJ: SEQ ID NO: 293 

Arg Phe Asp Ser Thr Thr Asn Val Ser Gin He Asn Leu Arg Thr Glu 
^ S 10 15 

Met Ly? Lys Leu He Leu Ala Thr Leu Gly Leu Met Ala He Ala M»L 
13 20 25 30 

Leu Ser Cys Ser Ser Asn Asn Lys Asp Leu Glu Asn Lys Gly Glu Ala 

35 40 45 

Thr Leu Leu Val Thr Phe Gly Ser Ser Tyr Lys Ala Pro Arg Glu Thr 
50 55 60 



20 



25 



40 



45 



50 



55 



60 



65 



70 



75 



Tyr Ala Lys He Gin Lys Thr Phe Ala Ala Ala Tyr Pro Asp Gin Arg 
65 70 



He Ser Trp Thr Tyr Thr Ser Ser Ho He Arg Lyc Lys Leu Ala Gin 

05 90 05 

Gin Gly He Tyr He Asp Ala Pro Asp Glu Ala Leu Glu Lys Leu Ala 

■ 105 110 

Arg Leu Gly Tyr Lys Lys He Asn Val Gin Ser Leu His Val H- Pro 

lis 120 125 

Gly Arg Glu Tyr Asp Glu Met He Asp Phe Val Asn Lys Phe Lys Ala 

on ^-^^ 

Ala His Ser Asp He Thr Val Lys Val Gly Ala Pro Leu Phe Asp Thr 

155 160 
Asp Glu Aep Met Arg Glu Val Ala Glu He Leu His Lys Arg Phe Gin 

165 170 175 

Gin Thr He Glu Lys Gly Glu Ala He Val Phe Met Gly His Gly Thr 
'^^ ISO 185 190 

Glu His Ala Ala Asn Asp Arg Tyr Ala Arg He Asn Lys He Met Lys 

195 200 205 

Asn Tyr Ser Lys Phe Met He Val Gly Thr Val Glu Ser Asp Pro Ser 

210 215 220 

He Asn Asp Val He Ala Glu Leu Lys Glu Thr Gly Ala Thr Ala Val 
230 235 240 

Thr Met Met Pro Leu Met Ser Val Ala Gly Asp Hie Ala Thr Asn Asp 

245 250 255 

Met Ala Gly Aep Glu Asp Asp Ser Trp Lys Thr Leu Leu Thr Asn Ala 

260 265 270 

Gly T^^r Thr Val Ser He Asp Lys Leu Asp Asn Gly Asn Phe Ser Ala 

280 285 

^^"^ Sij; ^^"^ '^"'3 Asn He Trp Leu Lys His Met Lys 

290 295 

Ala Thr Ser Ala Arg 

305 

(2) INFORMATION FOR SEQ ID NO: 294 



ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 91 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

(ix) FEATURE: 

(A) NAI-IE/KEY: misc_feature 
(D) LOCATION 1 . . .491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294 

Arg Arg Asp Ala Arg Arg Gin Leu Val Arg Arg Arg Arg Lys Glu Leu 

Ser Gly Thr Ala Asn Thr Arg Cys Arg Arg Arg Vai Phe Pro Glu Arg 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



!T=h ir—h W™' iF'»ii iF*H) ,jr= — "^i iP^i »F-'^i -31=11 

UJi ™ii :0 -ai.. in:: 43 53 IB :U :0 43^ IUi3 



WO 99/29870 

212/490 

2° 25 30 

Arg Ser Ala lie Ary Trp Cys Phe Pro Arg L-ti Arrj A.-p Gly Leu Gin 

40 ^5 
Pro lie Arg Phe Cly Ser Leu Leu Ala Asn Pro Gin Gin Leu Ser Arg 

50 55 gQ 

Gly Val Arg Leu Arg Ser Phe Glu Ser Arg Arg Gin Pro He Cys Arg 

70 75 
Phe Gin Gly He Gin Gly Lys Arg Arg Asp Il« Leu Gin Pro Glu Gly 

35 90 05 

A^n Strr Leu Leu Arg Thr His Ala His Ala Ser Arg Leu Gly kla Gly 

Arg Ser Tyr Arg Tyr Leu Ala Pro Arg Thr Thr Ser Arg Pio Pro Thr 

120 12s 
Ala Phe Leu Leu Phe Ala Pro Met Thr Sor Val Ser His Leu Arg Thr 

130 135 
lie Ser Val Ala Gly He Leu Ala Ala Leu Gly Gly Ala Val Leu H- 
ISO j^g* 
Leu Phe Gly Val Asn Leu Phe Leu Gly Ser Val Ala Xle Pro M-t S-r 

1G5 170 3^75 

Glu lie Phe Arg His Leu Phe Ser Asp Arg Pro Glu Gly Gly Glu Ala 

l*'^ 105 190 

Leu Val His Tyr Asn He Leu Trp Lys Ser Arg Leu Pro Glu Ala Leu 

200 205 
Thr Ala Ala Phe Ala Gly Ala Gly Leu Ser Val Ser Gly Leu Gin M-t 

2-^0 215 220 

Gin Thr Val Phe Arg A*n Pro Leu Ala Gly Pro Ser Val Leu Gl- 11= 
- 230 23S 240 

Ser Ser Gly Ala Ser Leu Gly Val Ala Leu Val Val Leu Leu Ser Gly 

^'^^ 250 255 

Ser Leu Gly Gly Val Ala Leu Ser Ser Leu Gly Tyr Met Gly Glu Val 

260 265 270 

Ala Met A^n He Ala Ala Ala Val Gly Ser Leu Ala Val Met Gly Lew 

2"5 280 285 

He Val Phe Val Ser Thr Lya Val Arg Ser Ills Val Thr Leu Leu H- 

290 295 300 

He Gly Val Met He Gly Tyr Val Ala Thr Ala Val He Gly Val Ph- 

315 320 
Lys Phe Phe Ser He Glu Glu Asp He Arg Ala Tyr Val He Trp Gly 

325 330 335 

Leu Gly Ser Phe Ser Arg Ala Thr Asp Ser Gin Leu Ser Phe Phe Ala 
340 

He Leu Met Leu He Phe He Pro Ala Gly Met Leu Leu Val Lys Gin 

355 360 365 

Leu Asn Leu Leu Leu Leu Gly Glu Ser Tyr Ala Arg A.'=:n Leu Gly Leu 

^'^ 375 380 

Asn Thr Arg Arg Ala Arg Leu Leu Val He Ser Sei Ala Gly Leu Leu 

390 395 
He Ala Thr Val Thr Ala Tyr Cys Gly Pro He Gly Phe Leu Gly M-t 

^05 415 
Ala Val Pro His I.eu Ala Arg Val He Phe His Thr Ser A«p His Arg 

, 425 
He Leu Hot Pro Ala Thr Cys Leu He Gly Ser Ala Leu Ala Leu Phe 

435 ^40 445 

cys Asn He He Ala Arg Met Pro Gly Fhe Glu Gly Ala Leu Pro Val 
455 460 



Asn Ser Val Thr Ala Leu Val Gly Ala Pro He He Val Thr Val Leu 
Phe Arg Arg Arg Arq Phe Lys Glu Glu Thr Asp * ° 

J o c 



475 

485 490 
(2) TUFOWIATIW FOR SZQ ID NO: 2 95 



(i) SEQUEUCE CHARACFERISTICS: 

(A) LENGTH: 763 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

tiii) iJYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISr-I: Porphyromonas gingivalis 

(ixj FEATURE: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



His 
1 

He 

Ala 

Ala 

Asp 
65 
Thr 

Ala 

Arg 

Tyr 

Asp 
145 
Leu 

Gin 

hys 

Sor 

Set 
225 
Lys 

Lys 

Lys 

Gin 

Val 
305 
Gly 

S-r 

Ser 

Leu 

Asp 
385 
Asp 

Thr 

Thr 

Thr 

Leu 
465 
Asp 

Leu 

Tie 

Gly 

Thr 
£45 



(A) NAIIF./KEY: luisc feature 
IB) LOCATION 1 . . .763 

iKi) SEQUENCE DrsCRIPTIOIi: SEQ ID NO :2 9b 

Phe Leu Val Thr lie Met Arg Thr Lys Thr Tie 

5 10 
Sor Phe lie Ala Leu Leu Ser Ser Ser Leu Ser 

20 25 
Val Leu Thr Gly Ser Val Ser Asp Ala Glii Thr 

^5 4 0 

Gly Ala Arg lie Glu Val Lys His Thr Asn He 

S5 60 
Ala Gly Gly His Phe Glu He Lys Asn T.eii Pre 

70 75 
lie He Cys Ser Leu Gly Gly Tyr Gly Gin Lys 

95 90 
He Giu Ala Gly Gin Thr Lys Thr He Ser Phe 

100 105 
Thr Asn Asn Leu Glu Glu Val Val Val Thr Gly 

115 120 
Arg Leu Val Asp Ala Pro Val Ala Thr Glu Val 
130 135 

He Ala Ser fhe Ser Ala Pro Thr Ser Glu Ali 

- 150 

Ser Pro Ser Phe Asp Phe Gly Pro Asn Leu ll«t 

165 170 
Leu Asn Gly Leu Ser Ser Lys Tyr II" L-^u 11** 

180 1Q5 
Arg Val Tyr Gly Asp Val Gly Gly Gin Ala Asp 

19S 200 
Pro Asp Gin He Glu Arg He Glu Leu Val Lvs 
210 215 220 

Leu Tyr Gly Ser Asp Ala Tie Ala Gly Val H- 

230 235 
Lys Asn Thr Asn Arg Leu Ser Ala Tyr Thr Ser 

245 250 
Tyr Asn Asp Arg Gin Thr Asn Thr Ser Leu Asp 

260 265 
Phe Ser Ser Asn Thr Asn Tyr Phe Phe Tyr His 

275 2S0 
Asn Ser Pro Phe Glu He Ly.<- Lys Lys Lys Glv 
290 295 3oS 

Leu Glu Glu Thr Tyr Lys Lys Thr Phe Arg Ala 
310 

Val Ser Gin Scr Leu Ser Tyr Tyr Ala Thr Asn 
325 

Gly Asn Val Gin Tyr Asn Lys Arg Gin Tl*> Phr: 

340 3.15 
Glu Lys Lys Ala Tyr Asp Met Asp Tyr Arg Ala 

355 360 
Gly Thr Asn Tyr Leu Phe Pro Asn Gly L-u His 
370 375 380 

Ala Val Tyr Asp Arg Phe Arg Phe Gly Tyr L^u 

390 39i, 
Ser Ser Glu Ser Leu He Asn Asn Gin Gly Gin 

405 410 
Phe Phe Pro Gly Gin Leu Arg Asn Lys Asn Asp 

420 425 
Ala Glu Ala Arg Gly Val Phe Thr Leu Pro Tyr 
435 

Gly Gly Leu Glu Tyr Phe Arg Glu Glu Leu He 

n.. "^^^ ^60 

He Thr Asp Lys Ala Asp Ala Ser Thr LftU S'^r 

olu Trp Lys Pro Leu Asp Trp Phe Asn Met Thr 

485 490 

Val His His Gin Glu Phe Gly Thr Arg M»t Thr 

^00 505 

Leu Ala Lys Tyr Gly Pro Leu Asn Phe Arg Ala 

515 520 

Tyr Lys Thr Pro Thr Leu Lys Glu Leu Phe Ala 
535 540 

Thr Met Gly Ser His Asn Leu Tyr Leu Gly Asn 

555 



Phe Phe 

Ala Gin 

30 
Gly Glu 
45 

Val Ala 

Ala Gly 

Glu Glu 

Ala Leu 
HQ 
Thr Gly 
125 

Leu Thr 

Lou Leu 

Gly Ser 

Leu He 
190 
Leu Ser 
205 

Gly Ala 
Asn Val 
His Arg 



He Asn 
270 
Thr Asp 
2a5 

Ser Gly 

Gin Glu 

Asn Leu 

Thr Pro 
350 
Leu Thr 
365 

Thr Leu 

Tyr His 

Thr Glu 

Gin He 
430 
Ala Gin 
445 

Ser Pro 

Ala Tyr 

Ala Gly 

Pro Lys 
510 
Thr Tyr 
525 

Arg Asn 
Ala Asp 



Ala He 
15 

Ser Lys 

Pro Leu 

Gly Ala 

Gin His 

80 
Val Val 
9b 

Arg Leu 
Thr Arg 

Al a Lyt- 

Gin Gly 
160 
Phe Met 
175 

Asp Gly 

Arg He 

Ser Ser 

He Thr 
240 
He Ser 
255 

He Gly 



Gly Trp 

Glu Pro 

Asn Gin 
320 
Ser Phe 
335 

Thr Phe 

Ala Ser 

Ser Phe 

Asp Lys 
400 
Gin Pro 
415 

Arg Tyr 

Lys Leu 

Tyr Asn 

Val Gin 
480 
Phe Arg 
4 95 

Val Ser 

Ala Asn 

Glu LetJ 

Leu Lys 
560 
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Pro Gin Met S-r Asp Tyr Tyr Ala Leu Gly Lsu Glu Tyr Asn Gin Gly 

565 570 575 

Pro lie Ser Phe Ser Ala Thr Val Tyr Asp Asn Glu Leu Arg Asn Leu 
_ 580 585 590 

O lie Ser Phe Met Asp lie Pro Thr Ser Pro Glu His Glu Ala Gin Gly 
S95 600 605 

lie Lys Lyc Thr Lys Gin Tyr Ala Asn lie Gly Lys Ala Arg Ser Arq 

610 615 620 

Gly Leu Asp Val Leu Cys Asp Ala Ser He Gly Trp Gly He Lys Leu 
10 €25 630 635 640 

Gly Ala Gly Tyr Ser Leu Val Glu Ala Lys Asn Leu Gin Thr Asp Glu 

S-^S 650 655 

Trp Leu Glu Gly Ala Ala Arg His Arg Ala Asn Val His Ala Asp Trp 
-c- 665 670 

ID Val His Tyr Trp Gly Gin Tyr Arg Lea Gly Val Ser Leu Phe Gly Arq 
€75 680 685 

He Gin Ser Glu Ary Tyr Tyr I.ys A--;p Gly Asn Ala Pro Asp Tyr Thr 

690 695 700 

Leu Trp Arg Leu Ala Thr Ser His Arg Phe Ala His Phe Arg His He 
20 705 7]C 7X5 720 

He Leu Asp Gly Thr Leu Gly He Asp A.-^n Leu Phe Asp Tyr Val Asp 

725 730 735 

Asp Arg Pro rfet Gly Val Asn Tyr Ala Thr Val Thr Pro Gly Arq Thr 

'^^0 745 750 

Phe Phe Ala Gin He Ala He Arg Phe Acn Arrn 
7 55 7 60 



25 



(2) INFORl-lATION FOR SEQ ID IJO:2 96 

30 (i) Sfc:QUEMCE CHARACTERISTICS: 

(A) LEMGTIl: 36 S amino acids 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

35 (11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 



40 



45 



50 



Ivl) ORIGTrJAL SOURCE: 

(A) ORGAhlISM: Porphyroraonas gingival is 

(ix) FEATURE: 

(A) liAME/KEY: misc_f eature 

(B) LOCATIOt* 1, . .365 

(xi) SEQUEIJCE DESCRIPTION: SEQ TD NO: 2 96 

Cys Cys Ala Arg Lau Arg Asp Thr Ser Pro Arg Lys Lys Ser Phe Arg 
^5 10 15 

Asn Asp Thr Val Phe Pro Tyr Phe Val He His Leu He Lys His H- 

20 25 30 

He He Met Thr Asp Asn Lys Gin Arg Asn He Val Phe Pro Ala Phe 

35 40 45 

Leu Leu Leu Leu Gly Val He Ala Val Val Thr He Val Gly Phe Phe 

50 55 60 

Met Leu Arg Pro Ala Glu Glu lie He Gin Gly Gin He Glu Val Thr 
" 70 75 80 

Glu Tyr Arg Val Ser Sar Lys Val Pro Gly Arg He Lys Glu Leu Arg 

on 50 S5 
DU Val Sc-x Glu Gly Gin Gin Val Gin Ala Gly Asp Thr Leii Ala Val H« 

iOO 105 110 

Glu Ala Pro Asp Val Ala Ala Lys Met Glu Gin Ala Lys Ala Ala Glu 

115 120 125 

Ala Ala Ala Gin Ala Gin Asn Ala Lys Ala Leu Lys Gly Ala Arg Ser 

130 135 

Glu Gin He Gin Ala Ala Tyr Glu Met Trp Gin Lys Ala Gin Ala Gly 

ISO 155 160 

Val Ala He Ala Thr Lys Thr His Gin Arg Val Gin A£.n Leu Tyr Asp 

7n ^''O 175 

/U Gin Gly Val Val Pro Ala Gin Lys Leu Asp Glu Ala Thr Ala Gin Arq 
180 185 190 

Anp Ala Ala He Ala Thr Gin Lys Ala Ala Glu Ala Gin Tyr Asn Met 
195 200 205 

7tz ^f? *21^ Asp Lys Leu Al- Ala Ser Ala Leu 

210 215 220 
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Val 


Asp 


Arc; 


Ala 


Arg 


Gl y 


Ala 


Val 


Ala 


Gl u 


Val 


GllJ 


Ser 


Tyr 


He 


Asn 


225 










230 










235 








240 


Glu 


Thr 


Tyr 


Leu 


He 


Ala 


Pro 


Arg 


Ala 


Gly 


Glu 


Val 


Ser 


Glu 


He 


Phe 










245 










250 










255 


Pro 


LyfiT 


Ala 


Gly 


Glu 


Leu 


Val 


Gly 


Thr 


Gly 


Ala 


Pro 


He 


Met 


Asn 


He 








260 










265 










270 






Ala 


Glu 




Gly 


A&'P 


Met 


Trp 


Ala 


Ser 


Phe 


Ala 


Val 


Arg 


Glu 


Asp 


Phe 






275 










280 










285 






Leu 


Ser 


Ser 


Met 


Thr 


Met 


Gly Ala 


Val 


Leu 


Glu 


Thr 


Val 


Val 


Pro 


Ala 




290 


Glu 








255 










300 










Leu 


Asn 


Glu 


Lys 


Vol 


Arg 


Phe 


Lys 


He 


Thi 


Phe 


lie 


Lys 


Asn 


Met 


305 










310 










315 






320 


Gly Thr 


Tyr 


Ala 


Ala 


Trp 


Lys 


Ala 


Thr 


Lys 


Thr 


Thr 


Gly 


Gin 


Tyr Asp 










325 










330 










335 




Leu 


Lys 


Thr 


Phe 


Glu 


Val 


Lys 


Ala 


Thr 


Leu 


Ala 


Asp 


Lys 


Asp 


Lys 


Ala 








3-10 










345 










350 




Gin 


Lys 


Leu 


Arg 


Pro 


Gly 


Met 


Ser 


Val 


He 


11- 


Arg 


Lys 












355 










360 








365 









12} IHFORHATIOH FOR SEQ ID NO: 297 



U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 320 amino acids 
(Bl TVPE: amino acid 
(D) TOPOLOGV; linear 



(ii) I-IOLECULE TYPR: protein 
(iii ) IIVPOTIIETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGA^USH: Porphyromonas gingival is 

(Ix) FEATURE: 

(A) NAUR/KEY: niisc_feature 

(B) LOCATION 1. . .320 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 297 

Pro Gin Ser Ser Pro Asp Arg Arg Ser Phe Gin Asn Val Met Asn Lys 

15 10 15 

Tyr His Ser Gin Ser Val Leu Glu Val Gly Lys He Gly He Val He 

20 25 30 

He Phe Ala Pro He Val Arg Asn Val His Gin Gin Pro Pro Phe Leu 

35 40 45 

Ser His Lys Ser He Met Arg He Val Ser Asn Phe Leu Phe Val S-r 

50 55 60 

Phe Ser Val Leu Lou Phe Ala Ser Cys Arg Ser Gin Arg Glu Lys Val 
65 70 75 80 

Val Tyr Leu Gin Asp He Gin Thr Phe 7\sn Arg Glu He He Ala Lys 

85 90 95 

Pro Tyr Asp Val Lys He Glu T.ys Asp Asp Val Leu Asn He Leu Val 

100 105 110 

Ser Sec Arg Asp Pro Glu Leu Ser Thr Pro Tyr Asn Gin Val Leu Thr 

115 120 125 

Thr Arg Ala Leu Ala Arg Asn Gly Tyr Gly Thr Asn Ser Asn Glu Gly 
130 135 140 ^ 

Phe Leu Val Asp Ser Lys Gly Tyr He Asn Tyr Pro He Leu Gly Gin 

150 155 ^ 

He Tyr Val Glu Gly Leu Thr Arg Thr Glu Leu Glu Lys Glu He Gin 

165 170 175 

Lys Arg He He Ser Ser Gly Phe He Lys Asp Pro Thr Val Thr Val 

lOO 185 190 

Gin Leu Gin Asn Phe Lys Val Ser Val Leu Gly Glu Val Asn His Pro 

195 200 205 

Gly Ser Het Ser Val Lys Gly Glu Arg He Thr Leu Leu Glu Ala He 

210 215 220 

Gly Met Ala Gly Asp Leu Thr He tyr Gly Arg Arg Asp Arg Vol Phe 
230 235 240 

Val He Arg Glu Thr Asp Gly His Arg Glu Val Phe Gin Thr Asp Leu 

245 250 255 

Arg Lys Ala Asp Leu Leu Ala Ser Pro Val Tyr Tyr Leu His Gin Asn 

260 265 270 

Asp Val He Tyr Val Glu Pro Asn Asp Lys Lys Thr Gin Met Ser Glu 
275 280 
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lie Acn Gin Asn A^n Asn Val Asn Vai Trp Leu Sex Val Thr Ser Thr 

290 295 JOO 

Leu Val Ser He Ser Thr Leu Thr Tie Thr He He Asp I'/s Thr Lys 
305 310 315 320 

5 

12) IllFORMATIOU FOR SEQ TD HO: 2 93 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 amino acids.' 
10 (Bl TYPE: amino acid 

(D> TOPOLOGY: linear 

(in MOLECULE TYPE: protein 

15 (iii) HYPOTHETICAL: YF,S 

{vlj ORIGINAL SOURCE: 

(A| ORGANISM: Porphyromonas- gingival is 

20 (ix) FEATURE; 

(A) UAIIE/KEY: misc_feature 

(B) I.OCJ\TION 1. . . 502 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



(xi) SEQUENCE DESCRIPTIOr^: SEQ ID JJO:29P 



Ser 


Lys 


He 


VaJ 


J,ou 


Arg 




Fhe 


s 


Thr 


Leu 


Al a 


Arg 


Met Lyr? 


Ly.T 


1 








5 










10 












Thr 


Asn 


htu 


Phe 


Leu 


Ser 


Leu 


Leu 


Val 


H 


Phe 


He 


Thr 


Gly Ser 


Phe 








20 










25 










30 




Met 


Thr 


Ala 


cys 


Als 


Gin 


Lys 


Ser 


Lys 


Thr 


Asn 


Lys 


Leu 


Thr Glu 


Glu 






35 










40 










45 






Asp Axg 


Ser 


Arg Asn 


Glu 


Tyr 


Val 


Gin 


Ser 


Het 


Asp 


Val 


Leu Ser 


Asn 




50 










55 










60 








He 


He 


Gly TUin Val 


Arg 


Leu 


Tyr 


Phe 


Val 


Asp 


Thr 


He 


Scr He 


Lys 


65 










70 










75 








80 


His 


Met 


Thr 


Arg Arg 


Gly 


He 


Asp 


Ala 


Met 


Leu 


Gly 


Gly 


Leu Asp 


Pro 










85 










90 








95 




T^-r 


Thr 


Glu 


Tyr He 


Pro 


Tyr 


Glu 


Glu 


Het 


Asp 


Glu 


Leu 


Lys Leu 


Met 








100 










105 










110 




Thr 


Thr 


Gly 


Glu 


Tyr 


Ala 


Gly 


Val 


Gly 


Ala 


He 


He 


Ser 


Gin Arg 


Pro 






115 










120 










125 






Asp 


Ser 


Ala 


Val 


He 


lie 


Gin 


Arg 


Pro 


Met 


Glu 


Gly 


Met 


Pro Ala 


Asp 


13Q 










135 










140 








Giu 


Ala 


Gly 


Leu 


He 


Ala 


Gly 


Asp 


Arg 


He 


Leu 


Thr 


He 


Asp Gly Lys 


145 








150 










15S 








160 


A£p 


Phe 


Arg 


Lys 


Ser 


Thr 


Thr 


Pro 


Lys 


Val 


Scr 


Gin 


Ala 


I.ftvj Lys 


Gly 










165 










IVO 








175 




He 


Ala 


Gly 


Thr 


Val 


Ala 


Lys 


Val 


Thr 


Val 


liet 


Arg 


Tyr 


Gly Glu 


Thr 






100 










185 










190 




Lys 


Pro 


Arg 


Thr 


Phe 


Ser 


Vai 


Lys 


Arg 


Gin 


Lys 


Val 


He 


Het Asn 


Ser 






195 










200 










205 






Val 


Thr Tyr 


Ser 


Gly 


Met 


Leu 


Asp 


Gly 


Ser 


He 


Gly 


Tyr 


He Arg 


Leu 




210 










215 










220 








Asn 


Asn 


Phe 


Thr 


Asp 


Lys 


Ser 


Ala 


Glu 


Glu 


Val 


Arg 


Thr 


Ala Leu 


Leu 


225 










230 










235 








240 


Asp 


Leu Arg Asp 


Lys 


Gin 


Gly 


Ala 


Lys 


Gly 


Leu 


He 


Leu 


Asp Leu 


Arg 










245 










250 








255 




Gly Asn 


eiv 


Gly Gly 


Leu 


Met 


Gin 


Ala 


Alo 


II" 


Glu 


He 


Val Asn 


Leu 








260 










265 










270 




Phe 


Val 


Pro 


Lys 


Gly 


Lys 


€]u 


Val 


Val 


Thr 


Thr 


Lys 


Gly 


Arg lie 


Ala 






275 










280 










205 






Glu 


Ser 


Ala 


Ser 


Val 


Phe 


Arg 


Thr 


Leu 


Thr 


Glu 


Pro 


He 


Asp Thr 


Lys 




290 










295 










300 








Leu 


Pro 


He 


Val 


Val 


Leu 


He 


Asp 


Gly 


Gin 


Ser 


Ala 


Ser 


Ser Ssr 


Glu 


305 










310 










315 








320 


lift 


Val 


Ala 


Gly Ala 


Leu 


Gin 


Asp 


Met 


PiSp 


Arg 


Ala 


Val 


Leu Met 


Gly 










325 










330 








335 




Gin 


Ly£ 


Ser 


Tyr Gly 


Lys 


Gly 


Leu 


Val 


Gin 


Thr 


Thr 


Arg 


Gin Lfiu 


Pro 








340 










345 










350 




Tyr 


Asn 


Gly Val 


He 


Lys 


Leu 


Thr 


Thr 


Ala 


Lys 


Tyr 


Tyr 


He Pro 


Ser 






355 










360 










365 






Gly Arq Cys 


He 


Gin 


Arg 


Leu 


Asp 


Tyr 


Ser 


Arg 


Thr 


Asn 


Arg Thr 


Gly 




370 










375 










380 








Met 


Ala 


Thr 


Ala 


He 


Pro 


Asp 


Ser 


Leu 


His 


Lys 


He 


Phe 


Tyr Thr 


Ala 


385 










390 










395 








400 
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Ala Gly Ary Arg Val Glu Asp Ala Gly Gly He I,eu Pro Asp ll« Glu 

400 41,0 4X5 

Val r.y.-^ Gin Asp Thr Ala Ala Thr Leu Leu Tyr Tyr Met Ala 1 1*- Asn 
420 42S 
b Asn Asp Val Phe A4?p Phe Val Thr Gly Tyr Val Leu Lys His Lys Thr 
•135 440 445 

He Ala Lys Pro Glu Asp Phe Ser He Thr Asn Glu Asp Tyr Ala Ala 

•J50 45S 460 

Phe Cys Lys Met Met Glu Glu Lys Lys Phe Asp Tyr Asp Arg Gin Ser 
lU 465 470 475 480 

Gly Lys Met Leu Asp Lys Leu Glu Glu Leu Ala Lys He Glu Gly Tyr 

•iSS 490 495 

Leu Pro Glu Ala Asn Ser Glu Leu Lys Ala Leu Arg Glu Lys L'=-u Lys 
500 505 510 

ID Pro Asn Leu S-r Arg Asp Leu Leu Arg Phe Lye Lys Glu He Thr Asn 
SIS 520 525 

Tyr Leu Asn Aifi Glu He Val Thr Arg Tyr Tyr ryr Glu Arg Glv Ser 

530 535 S40 

He Arg Gin Ser Leu Pro Giu Asp Lys Val Val Lys Glu Ala H- Lys 
20 545 550 sSb slo 

Leu Leu Lys Asp His Pro Glu Gin He Arg Gin He Leu Ala Ala Pro 

565 570 575 

Lys Ala Glu Asn Lys Gly 
590 



25 



45 



50 



60 



70 



(2) IIIFORlIATIOi; FOR SEO ID MO: 2 99 



(i) SEQUF.IICE CHARACTERISTICS; 

(A) LENGTH: 9fl5 amino acids 
30 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

35 ilii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyronvonas gin^ivalis 

40 (ixj FEATURE: 

{A) HAHE/KEY: misc_f eature 
(Bl LOCATION 1, . . 985 



{yA) SEQUENCE DESCRIPTION: SEQ ID NO: 299 



Val Thr Asp Lys Met Gin Asn Lys Gly Phe Val He Val Il« Thr Ser 
^5 10 i5 

Ala Leu Ala He He Cys Ala Phe Tyr T.eu Ser Phe Ser Phe Val Thr 

20 25 30 

Asn Arg T^-r Glu Lys Lys Ala Lys Ala Met Gly Asp Val Ala Gly Met 

35 4 0 45 

Ala Tyr Leu Asp Ser Met Ser Asn Glu Lys Val Trp Phe Gly Tyr Thr 

i»0 55 60 

Leu Lys Glu Ala Gin Ala Gin Gin He Gly Leu Gly Leu Asp Leu Lye 
OO 65 70 75 80 

Gly Gly Met A^n Val He Leu Lys Leu Asn Ala Ser Asp Leu L-u Arg 

85 90 95 

Asn Leu Ser Asn Lys Ser Lsu Asp Pro Asn Phe Asn Lys Ala Leu Glu 

100 105 110 

Asn Ala Ala Lys Ser Thr Glu Gin Ser Asp Phe He Asp H- Phe Val 

115 120 lpc^ 

Lys Glu Tyr Arg l.ys Leu Asp Pro Asn Gly Aig Leu Ala Val He Phe 

130 135 140 

Gly Ser Gly Asp Leu Arg Asp Gin He Thr Ala Lys Ser Thr Asp Ala 

ISO 155 
Asp Val Val Arg Leu Leu Lys Glu Lys Tyr Asn Ser Ala Val Glu Ala 

165 1/0 175 

Ser S»he Asn Val Leu Arg Ala Arg He Asp Ala Phe Gly Val Val Ala 

180 185 _ 190 

Pro Asn Leu Gin Arg Leu Glu Gly Gin Gly Arg He Leu Val Glu Leu 

195 200 2C5 

Pro Gly Val Lys A^^p Pro Glu Arg Val Arg Thr Leu Leu Gin Arg Ser 
210 215 220 

7f; ^ic ^^'^ '^''^ ^"'^ "^V-r Lys Phe Glu Glu Val Ser Gly 

225 230 235 240 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



•iV^i -ii^ fi"=' -tf»*t> iiii ""-li tF=^h rf"^ -rtP "-'^ tP^i ri"*h 

iui -"^ii r3 iii^ a o i« oi o o lui u 



15 



25 



35 



55 



680 685 



^^"^ ^-'-^ ^''^ Ser Ph- Thr lie Giy Leu Asn 

60 7 0b Arg Asn Tyr Vnl Val Lys Phe A:.p 

Gin Pro Val Ser Ser Glu Ala Val Arg Ser lia Leu Ser S«r Pro Leu 

Gin Glu Lys Val Leu Val Thr Ser He Gly Thr Glu Cly Thr Glu Val 

^'^^ 745 750 

Arg lie Ser Thr Asn Tyr hys He Gin Glu Glu Ser Glu Glu Thr Glu 

^55 765 

Ala Glu He Thr Asp Lys Leu Tyr Gin Ser Leu Lys Giy Phe Tyr Thr 
775 780 

70 ?85 """^ "^ "^ s« SI" 

Lys vai s« Pro ser Met Ser Ser A..p il. rhr Arg Gly Ala He Tcp 

005 S2.0 815 

Ala Val Leu Leu Ser Met He Phe Mot Ala He Tyr He Leu He Arc 

75 Phe Arg Asp 11. Ser Phe Ser Ala oly Vol Fhe Val Ser Vai Ala Ala 
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Arp Leu Tie Ala Ala Asn Asp Arg Leu Ser Glu Leu Ala Met Asn Asn 

245 2S0 255 

Thr Asp Ala Thr Pro Glu Thr Glu Pro Ala Thr Thr A^p Ser Val Ala 
c 260 265 270 

t> Ala Thr Ala Asp Ser Ala Ala Val Gin Ala Val Ali Asp Ser Ala Thr 
^75 280 28S 

Val Ala Gin Lys Glu Ala Lys Asp Ala Thr Arg Lys Asp Ala Leu Ph- 
290 295 300 

in ^Jf ^^"^ '^'^"^ '^^^ A^*? Ala Val Val Gly Vai Ala 

310 315 320 

Arg Arg Ala Asn Met Ala Gin He Ser Giu Met Leu Gin Gin Ala Hie 

325 330 
Asp Leu Lys Val Thr Arg Glu Asp Val l.oii Fhe Leu Trp Gly Ala Lvs 

3-10 3.] 5 3S0 

Ala He Glu Asp Pro Glu Thr Lys Lys Glu Thr Asp Leu Tyr Glu L-u 

355 360 365 

Tyr Ala He Arg Thr Asn Arg Thr Gly Asp Pro A.-p Leu Gly Gly Asp 
3/0 375 

9n Xll ^""^ Asn Asp Phe Gly Arg Ser 

390 395 ^ 

Glu Pro He Val Ser Met Thr Het Asn Glu Glu Giy Ala Arg Lys Trp 

405 ^3^5 

Ala Arg He Thr Lys Asp Asn Val Giy Arg Ala lie Ala He Val Leu 

420 42S 430 

Asp Gly Val Val Tyr Ser Ala Pro Asn Val /urn Asp Glu II- Thr Glv 
135 445 

^-'^ ^^^^ '^^^ '^'-1 '^l" '^i" Ala Gly Asp 

450 45S 46Q 

-in ^Vi ^^"^ ^^"^ Ly^ Asp Ala Thr Val Ser II- 

''^^ 470 ^75 

Glu Gin Glu Asn Val He Gly Pro Thr Leu Gly Ala Glu Ser He Lys 

'IBS 490 
Ala Gly Phe Leu Ser Phe Leu Leu Ala Leu Val He Leu Met Cys Tvr 

5*^0 505 510 

Het cys Leu Ala T^^r Gly Phe Leu Pro Gly Leu He Ala Asn Gly Ala 

S15 520 525 

Leu He Val Asn Ser Phe Phe Thr Leu Gly Val Leu Ala Ser Phe His 
53fl 535 540 

40 5^5 "^^"^ ''^^ ""^^^ "^^^ ^''^ ""^^ 

Met Ala Val Asp Ala Asn Vai Leu He Phe Glu Arg He Lys Glu Glu 

565 570 

Leu Arg Ala Gly Lys Thr Pro He Arg Ala Val Thr Asp Gly Tyr Gly 

AC 595 590 

Asn Ala Phe Ser Ala He Phe Asp Ser Asn Val Thr Thr He II- Thr 

600 605 

Gly He He Leu Phe Leu Tyr Gly Thr Gly Pro He Arg Gly Phe Ala 

^ 615 620 

50 "^^^ ^'^ Thr Ala Val Phe 

*30 635 

Leu Thr Arg He Val Phe Glu Lys Leu Ala Lys Lys Gly Arg Leu Asp 

645 650 655 

Lys He Thr Phe Thr Thr Ser He Thr Arg Asn Leu Leu Val Asn Pro 

665 670 

Ser Tyr Asn He Leu Gly Lys Arg Lys Thr Gly Phe He He Pro Val 

675 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



•tP"it ii"^ ^F^i ^ ii=*Ji (T^is ™h tr=i» 

aji '^y '^ii ciii .Ji. icii 01 iUJ ir-ji c;* iui oi 



wo 99/29870 PCT/AU98/01023 

219/490 

845 

Thr Thr Ph« Cys He He Ala Leu Tyr Ala Lea Leu Trp Lya He Leu 
855 



20 



25 



30 



40 



50 



- — - O O ' J 

5 ""^^ ""^^ i-e^ Ala 

875 

Ue He Gly Tyr Ser Leu Asn Asp Thr Val Val Val Phe Asp Arg He 

Arg GXu Thr Met Lys Leu Tyr Pro A.^n Arg Asp Arg Tyr Gin Val He 

in 910 

lU Asn Asp Ala Leu Asn Ser Thr Leu Gly Arg Thr Leu Asn Thr Ser Leu 
Sio 920 
Thr Thr Phe He Val Het Leu Val He Phe He Phe Gly Gly Ala Thr 

935 

l'^ ^''^ ^""'^ ^^"^ Val He Gly Thr 

950 ^ 

Tyr ser Thr Leu Phe Val Ala Thr Pro Leu Ala Tyr Glu He Gin Lys 

965 970 97S ^ 

Arg Lys Leu Asn Lys Ala Ala Lys Lys 
980 



(2) IIIFORl-IATION FOR SEQ ID NO: 300 

U) SklQUEHCE CHARACTERISTICS: 

(A) LENGTH: 104 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY': linear 

(11) MOLECULE TYPE: protein 

liii) HYPOTHETICAL: YES 



{vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginglvalis 

35 (ix) FEATURE: 

(A) NAME/ KEY: mi^c feature 

(B) LOCATION 1. . .104 6 



(xi) SEQUENCE DESCRIPTlOhJ: SEQ ID NO: 300 

Asp Phe Pro Trp Val Arg Val Lys Pro Glu Lys Lys Arg Lys Gin His 

Asn ser Asn Asn Phe l.yr. Phe Asn Ala Lys Glu Lys Ser Het Lys Arg 

45 Met Thr Leu Phe Phe Leu Cys Leu Leu Thr Ser He Gly Trp Ala HeL 

Ala Gin Asn Arg Thr Val Lys Gly Thr Val He S^x Ser Glu Asp Asn 

Glu Pro Leu He Gly Ala Asn Val Val Val Val Gly Asn Thr Thr He 

Gly Ala Ala Thr Asp Leu Asp Gly Asn Phe Thr Leu Ser Val Pro Ala 

85 90 95 

Asn Ala Lys, Met Leu Arg Val Ser Tyr Ser Gly Met Thr Thr Lys Glu 

55 Val Ala He Ala Asn Val Met Lys He Val Leu Asp Pro ^^p Ser Lys 

120 j[25 
Val l*u Glu Gin val Val Val Lou Gly Tyr Gly Thr Gly Gin Lys Lou 

60 iVs "^"^ f|5 III Lys Leu Ala 

Glu Ly, Pro Val Ala A£n He Het Asp Ala Leu Gin Gly Gin Val "a 

Gly net Gin Val Her Thr Thr Ser Gly Pro Thr Ala Val 111 Ser 

K«; , , 190 
b5 /al Glu 11, His Gly Thr Gly Ser Leu Gly Ala Ser Ser Ala Pro Leu 

19b 200 205 

Tyr He Val Asp Gly Met Gin Thr Ser T.eu Asp Val Val Ala Thr Mot 

70 ^" ^" ^^"^ ^^"^ ^" ""^^ Ala 

Thr Ser He Tyr Gly Ala Arg Ala Ala Asn Gly Val Val Phe He Gin 

250 255 
Thr Lys Lys Gly Lys Met Ser Glu Ar| Gly Arg He Thr Phe Asn Ala 

75 ser Tyr Gly He Ser Gin He Leu Thr Lys Pro Leu Asn Met 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 







27 5 










280 


















list 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Asp 


Phe 


Gin 


Val 


Lys 


Ala 


Gly 


Phe 


Trp 




290 










295 










300 










Gly 


Aan 


Asn 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lys 


Asp 


Hert 


He 


Leu 


Ala 


Gly 


305 










310 










315 










320 


Ala 


Glu 


Asp 


Leu Tyr 


Gly Asn Tyr 


Asp 


Ser 


Leu 


Lys 


Asp 


Glu 


Tyr 


Gl y 










325 










330 










335 




Lys 


Thr 


Leu 


Phft 


Pro 


Val 


Asp 


Phe 


Asn 


Hlr 


Asp 


Ala 


Asp 


Trp 


Leu 


Lys 






340 










345 










360 






Ala 


Leu 


Phe 


Lys 


Thr 


Ala 


Pro 


Thr 


Ser 


Gin 


Gly 


Asp 


He 


Ser 


Phe 


Ser 






355 










360 










365 








Gly 


Gly 


Ser 


Gin 


Gly 


Thr 


Ser 


Tyr 


Tyr 


Ala 


Ser 


He 


Gly Tyr 


Phe 


Asp 


37 0 










375 










380 










Oln 


Glu 


Gly 


Met 


Ala 


Arg 


Glu 


Pro 


Ala 


Asn 


Phe 


Lys 


Arg Tyr 


Ssr 


Gly 


365 










390 










395 










4 00 


Axy 


Leu 


Asn 


Phe 


Glu 


Ser 


Arg 


He 


Asn 


Glu 


Trp 


T.cu 


Lys 


Val 


Gly 


Ala 








405 










410 










415 




Asn 


Leu 


Ser 


Gly 


Ala 


He 


Ala 


Asn 


Arg 


Arg 


Set 


Ala 


Asp 


Tyr 


Phe 


Gly 








420 










425 










430 






Lys 


Tyr 


Tyr 


Met 


Gly 


Ser 


Gly The 


Ph5 


Gly 


vai 


Leu 


Thr 


Het 


Pro Arg 






435 










440 










4 45 








Tyr 


Tyr 


Asn 


Pro 


Phe 


Asp 


Val 


Asn 


Gly 


Asp 


Leu 


Ala 


Asp 


Val 


Tyr 


Tyr 




450 










455 










460 










Met 


Tyr 


Gly 


Ala 


Thr 


Arg 


Pro 


Ser 


Met 


Thr 


Glu 


pro 


Tyr 


Phe 


Ala 


Lys 


465 










470 










'175 










480 


Met 


Arg 


Pro 


Phe 


Ser 


Ser 


Glu 


Ser 


His 


Gin 


Ala 


Asn 


Val 


Asn 


Gly 


Phfi 










485 










4 90 










495 




Ala 


Gin 


He 


Thr 


Pro 


He 


Lys 


Gly 


Leu 


Thr 


Lsu 


Lys 


Ala 


Gin 


Ala 


Gly 








500 










505 










510 






Val 


Asp 


He 


Thr 


Asn 


Thr 


Arg 


Thr 


Ser 


Ser 


Lys 


Arg 


Met 


Pro 


Asn 


Asn 






515 










520 










525 








Pro 


Tyr 


Asp 


Ser 


Thr 


Pro 


Leu Gly 


Glu 


Arg 


Arg 


Glu 


Arg 


Ala 


Tyr 


Arg 




530 










535 










540 










Asp 


Val 


Ser 


Lys 


Ser 


Ph« 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr Lys 


Phe 


Ser 


He 












550 










555 










560 


Asp 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Als 


Leu 


Met 


Gly 


His 


Glu 


Tyr 


He 


Glu 








565 










570 










57b 




Tyr 


Glu 


Gly Af p 


Val 


He 


Gly Ala 


Ser 


Ser 


Lys 


Gly 


Phe 


Glu 


Ser Asp 








580 










585 










590 






Lys 


I.Gtl 


H«t 


L«u 


Leu 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly 


Asn 


Ssr 


Leu 


Ser 


Leu 






595 










600 










605 








Pro 


Glu 


His 


Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arg 




610 










615 










620 










Phe 


Asn 


Tyr 


GlY 


Phe 


Asp 


Lys 


Trp 


Met 


Tyr 


He 


Asp 


Phe 


Ser 


Val 


Arg 


625 










630 










635 










640 


Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 


Ser 


Asn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 










645 










650 










655 




Tyr 


Ser 


Val 


Gly 


Gly Met 


Phe 


Asp 


He 


Tyr 


Asn 


Lys 


Phe 


He 


Gin 


Glu 








660 










665 










67 0 






Ser 


Affn 


Trp 


Leu 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


Met 


Ser 


Tyr Gly Thr 


Thr 






675 










680 










685 








Gly 


Aen 


Ser 


Glu 


He 


Gly Asn 


Tyr Asn 


His 


Gin 


Ala 


Leu 


Val 


Thr 


Vdl 


650 










695 










700 










Asn 


Asn 


Tyr 


Thx 


Glu 


Acp 


Ala 


Met 


Gly 


Leu 


Ser 


He 


S&r 


Thr 


Ala 


Gly 


705 










710 










715 










720 


Asn 


Pco 


AFp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 


Phe 


Gly 


Leu 










725 








730 










735 




Ala 


Ala 


Gly 


Ala 


Phe 


Asn 


Asn 


Arg 


Leu 


Scr 


Ala 


Glu 


Val 


Asp 


Phe 


Tyr 








740 










74 5 










750 






Val 


Arq 


Thr 


Thr 


Aijn 


Asp 


Het 


Leu 


He 


Asp 


Val 


Pro 


Wet 


Pro 


Tyr 


He 






755 










760 










765 








Ser 


Gly 


Phe 


Phe 


Ser 


Gin 


Tyr 


Gin 


Asn 


Vnl 


Gly 


Ser 


Het 


Lys 


Asn 


Thr 




77 0 










775 










7B0 










Gly 


Val 


Asp 


Leu 


Ser 


Leu 


Lys 


Gly Thr 


He 


Tyr 


Gin 


Asn 


Lys 


Asp 


Trp 


7 85 










790 










7 95 










800 


At'n 


Val 


Tyr 


Ala 


Ser 


Ala 


Asn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 










ao5 










810 










815 




Lys 


Leu 


Phe 


Phe Gly 


L*5U 


A.?n 


Lys 


Tyr 


M«L 


Leu 


Pro 


Asn 


Thr 


Gly Thr 








820 










825 










630 






He 


TCP 


Glu 


H- 


Gly 


Tyr 


Pro 


Asn 


Ser 


Phe 


Tyr 


Het 


Ala 


Glu 


Tyr 


Al« 






835 










040 










845 








G3y 


He 


Asp 


Lys 


Lye 


Thr 


Gly Lys 


Gin 


Leu 


Trp 


Tyr 


Val 


Pro Gly Gin 




850 










855 










860 










Val 


A£D 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


Thr 


Thr 


Ser 


Gin 


Tyr 


Ser 


Ala 


Asp 


665 










fl70 










875 










8B0 
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25 



30 



45 



50 



55 



60 



65 



70 



75 



Leu 


Glu 


Thr 


Arq 


He 
885 


Asp 


Lys 


Ser 


Val 


Thr 
890 


Pro 


Pro 


He Thr 


Gly 
895 


Gl y 


Phe 


Ssr 


Leu Gly Ala 


Ser 


Trp 


Lys 


Gly 


Leu 


Ser 


Leu 


Asp Ala 


Asp 


Phe 








900 










905 








910 






Ala 


Tyr 


He 


Val 


Gly Lys 


Trp 


Het 


He 


Asn 


Asn 


Asp 


Arg Tyr 


Phe 


Thr 






915 










920 










925 






Giu 


Asn 


Ala 


Gly 


Gly Leu 


Her 


Gin 


Leu 


Asn 


Lys 


Asp 


Lys Met 


Leu 


Leu 




930 










935 










940 








Asn 


Ala 


Trp 


Thr 


Glu 


Asp 


Asn 


Lys 


Glu 


Thr 


Aap 


Val 


Pro Lys 


Leu 


Gly 


945 










950 










955 








960 


Gin 


Ser 


Pro 


Gin 


?he 
965 


Asp 


Thr 


HI 5 


Leu 


Leu 
970 


Glu 


Asn 


Ala Ser 


Pho 
975 


Leu 


Aig 


Leu 


Lys 


A-cn 


Leu 


Lys 


Leu 


Thr 


Tyr 


Val 


Leu 


Pro 


Asn Ser 


Leu 


Phe 






980 










986 








990 






Al;> 


Gly 


Gin 


Asn 


Val 


He 


Gly Gly Ala Arq Val 


Tyr 


Leu HeL 


Ala 


Arg 
















1000 








1005 






Asn 


Leu 


Lsu 


Thr 


Val 


Thr 


Lys 


Tyr 


Lys 


Gly 


Phe 


Asp 


Pro Glu 


Ala 


Giy 




lOlC 








1015 








1020 






Gly 


Asn 


VbI 


Gly 


Lys 


Aen 


Gl n 


Tyr 


Pro 


Asn 


Ser 


Lys 


Gin Tyr 


Val 


Ala 


1025 








1030 








1035 






] 


Gly 


lie 


Gin 


T.ftil 


Ser 


Phe 





















10 



15 



20 

Gly He Gin T.ftii 

104 5 

(2) TMFOmiATIOtV FOR SEQ ID JIO:301 

(i) SRQUEMCE CHARACTERISTICS: 

(A) LEHGTH: 860 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

tii) IIOLECULE TYPE: protein 

(111 J HYPOTHETICAL: YES 

35 (vi) ORIGIIIAL SOURCE: 

(A) ORGAIJISH: Porphyromonas ginglvalis 

( i>: ) FEATURE : 

(A) HAME/KEY: misc feature 
40 (B) LOCATION 1...869 

(Xi) SEQUF.WCE DESCRIPTION: SEQ ID NO: 301 



Trp 


His 


Arg 


Asn 


lie 


Phe 


He 


Phe 


Ala 


Ser 


Thr 


Phe 


Ser Pro 


Lys 


Asn 


1 








5 










.10 








15 




Met 


Leu 


Pro 


Leu 


Pro 


Tyr 


Arg 


Tyr 


Ala 


Lys 


Thr 


Glu 


His Leu 


Phe 


Leu 








20 










25 








30 






Ala 


Lys 


Gly 


Tyr 


Cys 


Lys 


Asn 


Pro 


He 


Thr 


Asn 


He 


He He 


Leu 


Phe 






35 










40 










45 






Met 


Lys 


Lys 


I.ys 


TKsn 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Phe 


Val Ala 


Leu 


Leu 




50 










55 










60 








Thr 


Phe 


He 


Gly 


Ser 


Met 


Gin 


AlA 


Gin 


Gin 


Ala 


Lys 


Asp Tyr 


Phe 


Asn 


65 










70 










75 








80 


Phe Asp 


Glu 


Arg 


Gly 


Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lys 


Val Pro 


Asp 


Arg 










85 










90 








95 




Ala 


Val 


Leu 


Gin 


Glu 


Leu 


Ala 


Leu 


He 


Met 


Ser 


He 


Asp Glu 


Phe 


Asp 








100 










105 








110 






Pro 


Val 


Thr 


Asn 


Glu 


Ala 


He 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu Glu 


Ptie 


Glu 






115 










120 










125 






Ala 


Phe 


Leu 


Arg 


Tyr 


Gly 


Leu 


Lys 


Pro 


The 


Phe 


Leu 


Thr Pro 


Pro 


Ser 




130 










135 










140 








Met 


Gin 


Arg 


Ala 


Val 


Glu 


Met 


Phe 


Asp 


Tyr 


Arg 


Ser 


Gly Glu 


Lys 


Tyr 


145 










150 










155 








160 


Glu 


Trp 


Arn 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


He 


Ser Met 


Met 


Glu 










165 










170 








175 




Glu 


Ph- 


Gin 


Thr 


Lys 


Tyr 


Pro 


Ser 


Leu 


cys 


Thr 


Ttir 


Ser Val 


He 


Gly 








180 










18,S 








190 






Lys 


Set 


Val 


Lys 


Asp 


Arg 


Lys 


Leu 


Met 


He 


cys 


Lys 


Leu Thr 


Ser 


Ser 






19b 










200 










205 






Ala 


A3n 


Thr 


Gly 


Lys 


Lys 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Ser Thr 


Met 


His 




210 










215 










220 








Gly Asp 


Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu He Asp 


His 


225 










2 30 










235 








240 


Leu 


Leu 


Ser 


Asr 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


He 


Lys 


Asn He 


Leu 


Asp 










245 










2QQ 








255 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Lys Thr Giu Val T.p He Cys Pro Leu Thr A.n Pro Asp Gly Ala Tyr 

Arg Ala Gly Asn H.s Thr Val Gin Gl'y Al. Thr Arg Tyr Iln Ala A.n 

-^'^ 280 28'^ 

Asn V.l A^p Leu Asn Arg A^n Phe Lys Asp Asp Val Al^ Gly Arp His 

2 95 300 
Pro A.P Glv Lys Pro Trp Gin Pro Glu Ala Thr Ala Phe Met A.p V.u 

Glu Glv A=n Thr Ser Ph, Val Leu Gly Ala Asn lie His Gly Gl,- Thr 

Glu Val Val Asn Tyr Pro Trp Asp Asn Lys Lys Glu Arg His III Asp 

ASP Glu Trp Tyr Lys Leu He Ser Ar^ Asn Tyr Ala Ala Ala Cys Gin 

S.r lie ser Ala Se. Tyr Met Thr Sor Glu Thr Asn Jer Cly He He 

385 '"'^ Asp Asn Ala 

Asn Tyr Phe His Ar| Leu Ary Glu He Thr Lou Glu 11= Ser Asn ^h^ 
Lys Leu Val Pro Al. S.r Cln Leu Pro Lys Tyr Trp Asn Leu I'n Lys 
Glu Se. Leu Leu Ala Leu He Glu Glu Scr Leu Tyr Gly n° „i, Gly 
Thr val Thr Ser Ala Ala Asn Gly Gin Pro Leu Lys Cys Gin lie Leu 

4 j5 4 60 

lie Glu Asn His Asp Lyr Arg Asn Ser Asp Val Tyr Ser Asp Ala Thr 

Thr Gly Tyr Tyr Val Arc, Pro Ixe Lys Ala Gl'y Thr Tyr Thr Val Lys 

Tyr Lys Ala Glu Gly Tyr Pro Glu Ala ^hr Arg Thr He Thr 111 Lys 

A.P Lys rxu ?hr Val He Met Asp lie Ala Leu Gly As,. Ser Val P.o 

520 

Leu Pro Val Pro Asp Phe Thr Ala Ser Pro Met Thr He Ser Val Gly 
Glu ser vai ain Phe Gin Asp Gin Th. Thr Asn ^° P.o Thr A«n Trp 
Glu Trp Thr Phe Glu Gly Gly Gin Pro Ala Mot Ser Thr Glu Gin ^n 
Pro r.eu val Ser Tyr Ser His Pro Gly eln Tyr Asp Val Thr flu Lys 

Val Tip Asn Ala Ser Gly Ser Asn Thr He Thr Lys Glu "s Ph- H» 

600 

Thr val Asn Ala Val Het Pro Val Ala Glu Phe Val Gl^y Thr Pro Thr 

olJ> 65»0 

Glu He Glu Glu Gly Gin Thr Val Scr Phe Gin Asn Cln Ser Thr Asn 

A. a Thr Asn Tyr Val Tr^ He Phe Asp Gly "y Thr Pro Ala Thr ler 

01. ASP Glu Asn Pro Thr Val Leu Tyr ler Lys Ala Gly Gin Tyr Asp 

va, Thr Leu Lys Ala He s.r Ala ler Gly Glu The Val Lys Thr Lys 

Glu Lys Tyr He Thr Val Lys Lys Ala Pro Val Pro Alf Pro Val Ala 

Asp Phe Glu Gly Thr Pro Axg Lys Val Lys Lys Gly Glu Thr V.l Thr 

Phe Lys ASP ,..u Snr Thr Asn Asn Pro Thr Ser Trp Leu Trp Val Phe 

Glu Gly Gly ser Pro Ala Thr Ser Thr G^u Gin Asn Pro Val 111 Thr 

Tyr Asn Glu Thr Gly Lys Tyr Asp Val' Gin Leu Thr Ala Thr Asn Glu 

760 Tf,c, 
Gly Gly Ser Asn Val Lys Ly. Ala Glu Asp Tyr He Glu Val He Leu 

Asp ASP ser Val Glu Asp He vai Ala Gin Thr Gl'y He Val He Arg 

t-ro Gin Asn Gly Thr Lys Gin He Leu He Gl'u A.a.Asn Ala Ala lie 

Lys Ala val Leu Tyr Asp He Asn l{y Arg Val Val Leu Lys Th. 

Thr Pro Asn Gin Leu Arg Ser Thr Va" A.,p Leu Ser He Leu Pro Glu 

Gly He Tyr Thr He Asn He "s Thr Glu Lys Ser 111 Arg Thr Glu 
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850 855 860 

Lys lie His lie Gly 
86S 

(2) IIJFORMATIOM FOR SEQ ID MO: 302 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 106 amino acid5 
(Bl TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(iii) IIYPOTJIETICAL; Yfc:S 

(vi) ORIGIKAL SOURCE: 

(A) ORGANISM: Porphyronjonas gingivalxE 

(ix] FEATURE: 

(A) HAIIE/KEY: misc feature 
(Bl LOCATIOM 1. . .106 

(Xl) SEQUENCE DESCRIPTION: SEQ ID WO: 302 



Leu Sex 
1 


Tyr 


Ser 


Gly 
5 


Glu 


Ser 


Asp 


Ala 


Arg Lys 


Cy.9 


Thr 


Phe 


lie 


Gly 


Phe 


Glu 






20 








25 


Leu lie 


Lys 


Ala 


Phe 


Leu 


Val 


Gin 


Leu 




35 










40 




Tyr l.ys Arg 


Phe 


lie 


Ser 


Pro 


Leu 


Thr 


50 










55 






Pro Ser 


Cys 


Ser 


Ser 


Tyr 


Ala 


He 


Glu 


65 








70 








Gly Lys 


Gly 


Leu 


Leu 


Leu 


Ser 


He 


Lys 








85 








Trp Gly 


Gly 


Ser 


Gly 


Tyr 


Asp 


Pro 


Val 



Lys Glu Ser Asp Gin Asn Cys 
10 15 
Lys Arg Val Asn Thr Met Arg 
30 

Leu Leu Leu Pro He Phe Phe 
45 

Pro Pro Ser Cys Arg Phe Thr 
60 

Ala Leu Arg Lys Tyr Gly Pro 
75 80 



20 
Pro 

100 105 
12) IHFORJIATION FOR SEQ ID IIO:303 

(i) SEQUENCt; CHARACTERISTICS: 

(A) LENGTH: 861 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroraonas gingiva lis 

(ix» FEATURE: 

(A) NA14E/KEY: misc_f ea turc 

(B) LOCATION 1, . . 861 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303 



Leu 


Glu 


Met 


Ala 


Tyr 


Asp 


Phe 


Thr 


Gin 


1 








5 










Tyr 


Ser 


Tyr 


Gin 


Glu 


Ala 


Thr 


Arg 


Leu 








20 








25 


A;'p net 


Leu 


Val 


Leu 


Gly 


He 


He 


Arg 






35 










40 


Asp 


He 


Met 


Axg 


His 


Tyr 


Gly 


He 


Asn 




50 










55 






He 


Glu 


r.eu 


Glu 


Ala 


He 


Ala 


Glu 


Ser 


65 
















Ser 


Pro 


He 


Phe 


Tlir 


Pro 


Scr 


A] a 


Arg 


Asp 


He 


Cys 


Ala 


85 
Asp 


Met 


Glu 


Asp 


Glu 








100 








105 


Leu 


Leu 


Ser 


Tie 


Leu 


Asn 


Ser 


Thr 


Gin 






115 










120 



15 
Val Thr 

30 



Ala He 



Pro Glu Gly 
80 

Asp Ala Thr 
95 

Vol His Leu 
110 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Phe 


net 


Lys 


Gin 


Gly 


He 


Lys 


Tyr Asp 


Thr 


He 




Ser 


Asp 


Tyr 


Phe 




130 










135 








140 










Gly 


Gl A 


Acy 


Asn 


Pro 


Ser 


Glu 


Gly Lys 


Ser 


Pro 


Ser 


Glu 


Het 


Glu 


He 


145 










150 








155 










160 


I.eu 


Arp 


Gly Tyr 


Gin 


Asp 


Asn 


Asp Phe 


Asp 


Asp 


Glu 


Glu 


Asp 


Glu 


Ser 










165 








170 










175 




Ser 


Pro 


Pro 


Ser 


Gly 


Asn 


Ser Gly Thr Gly 


Gly 


Gly 


Ser 


Ciy 


Asp 


Ala 








180 








185 










190 






Pro 


Glu 


Gin 


Asn 


Thr Gly 


Gly Gly Asp 


Thr 


Thr 


Thr 


Thr 


Thr 


Arg 


Sec 






195 










200 








205 








Gly 


Gly 


Asp 


Tnr 


Pro 


Ala 


Leu Asp Thr 


Phs 


Gly 


Thr 


Asp 


He 


Thr 


Ala 




210 










215 








220 










Met 


Ala 


Ala 


Ala 


Gly 




Leu 


Asp Pro 


Val 


Val 


Gly 


Arg 


Glu 


Gin 


Glu 


225 










23 Q 








235 










240 


lie 


Glu 


Ara 


Val 


He 


Gin 


He 


Leu Scr 


Arg 


Arg 


Lys 


Lys 


Asn 


Asn 


Pro 








245 








250 










255 




Vnl 


IjCM 


11** 


Gly 


Glu 


Pro 


Gly 


Val Gly 


Lys 


Ser 


Ala 


He 


Val 


Glu 


Gl y 








260 








265 










2 fo 






Leu 


Ala 




Arg 


He 


Val 


A5n 


Arg Lys 


Val 


Ser 


Arq 


He 


Leu 


Phe 


Asp 






^7 5 










2B0 








2 85 










Arg 


lie 


Ilr 


Ser 


Leu 


Asp 


Leu Ala 




Met 


Val 


Ala 


Gly 


Thr 


Lys 




290 










295 








300 












Arg 


Valy 


Gin 


Phe 


Glu 


Glu 


Arg Leu 


*'J^ 


Ala 


Val 


Leu 


Asp 


Glu 


Leu 


305 










310 








315 










320 




!>"'£■ 


Arn 
Tn 


Pro 


Gin 


He 


He 


Leu Phe 


He 


Asp 


Glu 


He 


His 


Thr 


He 










325 








330 










335 




Vai 


Gi V 


Ala 


Gly Ser 


Ala 


Ala 


Gly Ser 


Met 


Asp 


Thr 


Ala 


Asn 


Met 


Leu 








3-10 








345 










35 Q 






iij ^- 


Pro 


Ala 


Leu 


Ala 


Arg 


Gly Gin Val 


Gin 


Cys 


He 


Gly 


Ala 


Thr 


Thr 






355 










360 








365 








Leu 




Glu 


Tyr 


Arg 


Lys 


Asn 


He Glu 


Lys 


Asp 


Gly 


Ala 


Leu 


Glu 


Arg 




370 










375 








380 










Arg 


Phe 


Gin 


Lys 


Val 


Pro 


He 


Ala Pro 


Ser 


The 


Ala 


Glu 


Glu 


Thr 


Leu 


385 










390 








395 










400 


Thr 


lie 


Lsu 


Gin 


Acn 


He 


Lys 


Glu Lys 


J yr 


Glu 


Asp 


Tyr 


His 


Gly Val 










405 








410 










415 




Arn 


T\*r 


Thr 


Asp 


Glu 


Ala 


He 


Lys Ala 


Ala 


Val 


Glu 


Leu 


Thr 


Asp 


Arg 








420 








425 










430 






Tyr 


Val 


Ser 


Asp 


Arg 


Phe 


Phe 


Pro Asp 


Lvs 


Ala 


He 


Asp 


Ala 


Het 


Asp 






435 










440 








445 








Glu 


Al a 


Gly 


Ala 


Ser 


Val 


His 


He Thx 


Asn 


Val 


Val 


Ala 


Pro 


Lys 


Glu 




4 50 










455 








460 










lie 


Glu 


lie 


Leu 


Glu 


Ala 


Glu 


Leu Ala 


Ser 


Val 


Arg 


Glu 


Asn 


Lys 


Leu 


4 65 










470 








475 










480 


Ser 


Ala 


Val 


Lys 


Ala 


Gin Asn Tyr Glu 


Leu 


Ala 


Aia 


Ser 


Phe 


Arg 


Asp 










485 








4 90 










495 




Gin 


Glu 


Arg 


Arg 


Thr 


Gin 


Gin 


Gin Hs 


Ala 


Glu 


Glu 


Lys 


Lys 


Lys 


Trp 








500 








505 










510 






Glu 


Glu 


Gin 


Met 


Ser 


Lys 


His 


Arg Glu 


Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 






515 










520 








525 








Ala 


His 


Val 


Val 


Ala 


Leu 


Met 


Thr Gly 


Val 


Pro 


Aia 


Glu 


Arg 


Leu 


Set 




530 










535 








540 










Thr 


Gly 


Glu Gly Glu Arg 


Leu 


Arg Thr 


Met 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


545 










550 








555 










560 


Lys 


Val 


Val 


Gly Gin Asp 


Thr 


Ala He 


Glu 


Lys 


Met 


Val 


His 


Ala 


n«i 








555 








57 0 










575 




Gin 


Arg 


Asn 


Arg 


Leu 


Gly 


Leu 


Arg Asn 


Glu 


Lys 


Lys 


Pro 


He 


Gly Ser 








S80 








585 










590 






Phe 


Leu 


Phe 


Leu 


Gly 


Pro 


Thr 


Gly Val 


Gly 


Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 






595 










600 








605 








Lys 


Leu 


Ala 


Glu 


Tyr 


Leu 


Phe 


Glu Asp 


Glu 


Asn 


Ala 


Met 


He 


Arg 


Val 




610 










615 








620 










Aa'P 


IJfft 


Ser 


Glu 


Tyr 


Met 


Glu 


Lys Phe 


Ser 


Val 


Ser 


Arg 


Leu 


Val 


Ciy 












€30 








635 










64Q 


Ala 


Pro 


Pro 


Gly T/r 


Val 


Gly Tyr Glu 


Glu 


Gly 


Gly 


Gin 


Leu 


Thr 


Glu 










645 








650 










655 




Azg 


Vol 


Arg 


Arg 


Lyff 


Pro 


Tyr 


Ser Val 


Val 


Leu 


Leu 


Asp 


Glu 


He 


Glu 








660 








665 










670 






Lys 


Ala 


His 


Ala 


Asp 


Val 


Phe 


Asn Leu 


Leu 


Lsu 


Gin 


-Val 


Met 


Asp 


Glu 






67S 










680 








685 








Gly 


Gin 


Leu 


Thr 


A.'Tp 


Snr 


Leu 


Gly Arg 


Arg 


Val 


Asn 


Phe 


Lys 


Asn 


Thr 














695 








7 00 










Val 


Il« 


lie 


lie 


Thr 


Ser 


Asn 


Val Gly 


Thr 


Arg 


Gin 


Leu 


Lyr 


Asp 


Phe 


705 










710 








715 












Gly 


Gin 


Gly 


11- 


Gly 


Ph« 


Arg 


Ser Glu 


Lys 


Arp 


Glu 


Glu 


Al a 


Asn 


Lys 
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p. . 




















725 










Al a vliifU 








7 35 






Hi? 


s-r 


Axg 


Ser 


Val 


He 


Gin 


Lys 




I.ys 


Thr 


Phe 


Ser 








740 
















LeV 


750 






Pro 


Giu 


Pile 
755 


Leu 


Asn 


Arg 


Leu 


Asp 
760 


Asp 


He 


Ha 


Phe 
765 


Asp 


Gin 


Leu 


Gly 


Lys 
770 


Thr 


Glu 


Tie 


Axg 


Arg 
775 


Met 


Val 


Affp 


He 


Glu 
780 


Leu 


Lys 


Ala 


Val 


Leu 


Ala 


Arg 


He 


Hit- 


Axg 


Ala 


Gly 


Tyr 


Asp 


Leu 


Val 


Leu 


Thr 


Asp 


Giu 


7R.S 










7 90 










7 95 










800 


Ala 


Lys 


A:?p 


Val 


Tie 


Ala 


Thr 


Lys 


Gly 


Tyr 


Asp 


Leu 


Gin 


Tyr 


Gly Ala 










805 










BIO 










815 




Arg 


Pre 


Leu 


820 


Arg 


Thr 


Leu 


Gin 


Asn 
825 


Glu 


Val 


Glu 


Asp 


Arg 

830 


Leu 


Thr 


A^p 


Leu 


Ilfi 
835 


Leu 


Ser 


Gly 


Gin 


He 
840 


Glu 


Lys 


Gly 


Gin 


Tlir 
845 


Leu 


Thr 


Leu 


Ser 


Ala 

S50 


Arg 


Asp 


Gly 


Glu 


He 
855 


He 


Val 


Gin 


Glu 


Gin 
860 


Ala 









(21 lUFORIHATIOn FOR SEQ ID HO; 304 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEIJGTIJ: 456 amino acids 
(R) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<iii) HiPOTHETICAI.: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGATilSM: Porphyromonas gingivalis 

{ixi FEATURE: 

tA) NAilE/KEY; misc_feature 
(BJ LOCATIOM 1. . .456 



(xi) SEQUENCE DESCRIPTIOH: SEQ ID JJO:304 



Ary 


Gly 


Gly 


Gin 


He 


Arg 


Arg 


His 


His 


Thr 


Asp 


Ser 


Ser 


Arg 


Gly Ser 


1 








5 










10 










15 




Asp 


Ser 


Lys 


Ala 


Cys 


Arg 


Phe 


Ser 


Phe 


His 


Ala 


Glu 


Thr 


He 


Gly 


Phe 








20 










25 










30 






Ser 


Asti 


Hi£: 


Gin 


Arg 


Als 


Lys 


Met 


Asn 


Tyr 


L^u 


Tyr 


He 


Leu 


He 


Thr 






35 










40 










45 








Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


Ser 


Gly 


Ala 


Glu 


He 


Ala 


Phe 


Leu 


Ser 




50 










55 










60 










s«r 


Aep Lys 


T.eil 


Arg 


Leu 


Glu 


Leu 


As-p 


Arg 


Asn 


Arg 


Gly Asp 


Leu 


Thr 


65 










70 










75 










80 


Gly 


Arc? Ala 


Leu 


Asn 


Leu 


Leu 


Tyr 


Arg 


His 


Pro 


Asp 


Gin 


Leu 


Val 


Thr 










85 










90 










95 




Thr 


Leu 


Leu 


Val 


Gly 


Asn 


Asn 


He 


Val 


Leu 


Val 


Val 


Tyr 


Gly 


Leu 


Leu 








100 










105 










110 






Met 


Ala 


Gly 


Leu 


Leu 


Alct 


Ala 


Pro 


Leu 


Ala 


Gin 


Trp 


He 


Asp 


Asn Asp 






115 










120 










125 








Ala 


Met 


He 


Val 


Val 


Leu 


Gin 


Ser 


Val 


Leu 


Sor 


Thr 


He 


He 


He 


Leu 




130 










135 










140 










Phe 


Thr 


Gly 


Glu 


Phe 


Leu 


Pro 


Lys 


Ala 


He 


Phe 


Lys 


Thr 


Asn 


Ala 


Acn 


145 










150 










155 










160 


Met 


Hot 


Met 


Arg 


Val 


Phe 


Ala 


Leu 


Pro 


He 


Val 


Ala 


He 


Tyr 


Tyr 


Leu 










165 










170 










175 




Leu 


Tyr 


Pro 


Leu 


ser 


Lys 


Leu 


Phe 


Thr 


Gly 


Leu 


Ser 


Arg 


Ser 


Phe 


He 








lOO 










185 










190 






Arg 


r.eti 


Val 


Asp 


Lys 


Asn 


Tyr 


Val 


Pro 


Thr 


Thr 


Val 


Gly 


Leu 


Gly Arg 






195 










200 










205 








Val 


Asp 


Leu 


Arp 


Hlj? 


Tyr 


Leu 


Ala 


Glu 


Arn 


Met 


Ser 


Gly 


Glu 


Asn 


Glu 




210 










215 










220 










Gin 


Aan 


A*p 


Leu 


Thr 


Thr 


Glu 


Val 


Lya 


He 


He 


Gin 


Asn 


Ala 


Leu 


Asp 


225 










230 








235 










240 


Phe 


Ser 


Giy 


He 


Gin 


Val 


Arg 


Asp 


Cys 


Met 


He 


Pro 


Arg 


Asn 


Glu 


Met 










245 










250 










255 




He 


Ala 


Cys 


Glu 


L-u 


Gin 


Thr 


Asp 


He 


Glu 


Val 


Leu 


Lys 


Thr 


Thr 


Phe 








260 










2G5 










270 






He 


^^p Thr 


Gly 


Leu 


Ser 


Lys 


He 


He 


He 


Tyr 


Arg 


Gin 


Asn 


He 


Asp 






275 










280 










285 








Asp 


Val 


Val 


Gly 


Tyr 


He 


His 


Ser 


S=r 


Glu 


Het 


Phe 


Arg 


Giy 


Gin 


Asp 
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40 



45 



50 



55 



60 



65 



70 





290 










295 










300 










Trp 


Gin 


Lys 


Pirg 


He 


Asn 


Thr 


Thr 




Phe 


Val 


Pro 


Glu 


Ser 


Met 


Tyr 


305 










310 










315 










320 


Ala 


Asn 


Lys 


Leu 


Met 


Arq 


Leu 


Leu 


Met 


Gin 


Arg 


Lys 


Lys 


St*r 


He 


Ala 










325 










330 










335 




Ila 


Val 


lie 


Asp 


Glu 


Leu 


Gly 


Gly 


Thr 


Ala 


Gly 


Met 


Val 


Thr 


Leu 


Glu 








34 0 










315 










350 








Leu 


Val 


Gl\i 


Glu 


He 


Phe 


Gly 


Asp 


He 


Glu 


Asp 


Glu 


His 


Asp Thr 






355 










360 










365 








Arg 


Lys 


He 


He 


Ala 


Lys 


Gin 


Lsu 


Gly 


Pro 


HI.-? 


Thr 


Tyr 


Leu 


Val 


Ser 




370 










37b 










380 








Gly Arg 


Met 


Giu 


lie 


Asp JKsp 


Vai 


Asn 


Glu 


Arq 


Phe 


Gly 


Leu 


Ser 


Lfeu 


395 










390 










395 










4 00 


Pro 


Glu 


Her 


Asp 


Asp 


Tyr 


L-u 


Thr 


Val 


Ala 


Gly 


Phe 


li- 


T.su 


Aen 


Ser 










405 










410 










415 




His 


Gin 


Asn 


rie 


Pro 


Gin 


Ala 


Asn 


Giu 


Val 


Val 


Glu 


ne 


Ala 


Pro 


Tyr 








'120 










42S 










430 






Thr 


Phe 


Thr 


lie 


Leu 


Arq 


Ser 


Ser 


Ser 


Thr 


I.ys 


He 


Glu 


Leu 


Val 


Lys 






436 










4'10 










445 








net 


Ssr 


Tift 


Asp 


Asp Gin 


Ssr 


Asn 




















.J SO 










455 





















10 



15 



20 



iz) irirowtATioii for seq id ho:30S 

25 iJ SEQUEHCF. CHARACTERISTICS: 

IA) LENGTH: 299 amino acids* 

IB) TVPE: amino acid 
{DJ TOPOLOGY: linear 

30 Ui) fiOLECULE TYPE: protein 

(liil HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 
35 {A) OKGANISM: Porphyromonas glnglvalls 

{ixl FEATURE: 

(Al HAf-fE/KEY: misc_foature 
iB) LOCATION 1...2 99 



[xij SEQUENCE DESCRIPTIOTJ : SEQ ID NO: 305 



T.fiU 


Lys 


Lys 


Glu 


He 


Thr 


Met 


Lys 


Gin 


Asn 


Tyc 


Phs 


Lys 


Arg 


Val 


Cyf 


1 








5 










10 










15 




Ser 


Leu 


Leu 


Trp 


Leu 


Val 


Leu 


Pro 


Met 


Leu 


He 


Met 


Pro 


Leu 


Glu 


Vsl 








20 










25 










30 






Ala 


Ala 


Gin 


Glu 


He 


He 


Pry 


Asn 


Glu 


Glu 


Val 


Leu 


Glu 


Ser 


Leu 


Thr 






35 










40 










43 








Phe 


Val 


Ala 


Pro 


Val 


Glu 


Glu 


Thr 


Asp 


Ala 


He 


Glu 


Ala 


Glu 


Val 


Giu 




50 










55 










€0 










Ala 


Leu 


Gin 


Glu 


He 


Val 


Ala 


Thr 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Ala 


Val 


65 










10 










75 










80 


Arg 


Ser 


Tyr 


Thr 


Tyr 


Thr 


Val 


Tyr 


Arg 


Asp 


Gly 


Val 


Lys 


He 


Ala 


Ser 










85 










90 










95 




Gly 


Leu 


Thr 


Glu 


Pro 


Thr 


Phe 


Leu 


Aj?p 


Glu 


Asp 


Val 


Pro 


Ala 


Gly Giu 








100 










105 










110 






His 


Thr 


Tyr 


Cys 


Val 


Glu 


Val 


Gin 


Tyr 


Gin 


Gly 


Gly Val 


Ser 


Asp 








115 










120 










125 






Val 


Cys 


Val 


Asp 


Val 


Glu 


Val 


Lys 


A^p 


Phe 


Lys 


Pro 


Val 


Thr 


Asn 


Leu 




130 










135 










140 










Thr 


Gly 


Thr 


Ala 


Ser 


Asn 


Asp 


Giu 


Val 


Ser 


Leu 


Asp 


Trp 


Asp 


Gly 


Val 


145 










150 










155 




160 


Glu 


Glu 


Lys 


Ala 


Glu 


Glu 


Pro 


Ala 


Ser 


Asp 


Lys 


Ala 


Val 


Ser 


Tyr Asn 










165 










170 










175 




Val 


Tyr 




Asn 


Gly 


Thr 


Leu 


He 


Gly 


Asn 


Thr 


Ala 


Glu 


Thr 


His 


Tyr 








180 










105 










190 






Val 


Glu 


Thr 


Gly 


Val 


Ala 


Asn 


Gly 


Thr 


Tyr 


He 


Tyr 


Glu 


Val 


Giu 


Val 






195 










200 








205 








Lys 


Tyr 


Pro 


Asp 


Gly 


Val 


Ser 


Pro 


Lys 


Val 


Ala 


Val 


-Thr 


Val 


Thr 


Val 




210 










215 










220 










Thr 


Asn 


Ser 


Ser 


Leu 


Ser 


A.^-n 


Val 


Asp 


Gly 


Gin 


Ala 


Pro 


Tyr 


Thr 


Le=j 


225 










230 










235 








240 


Arg 


Val 


Glu 


Gly 


Lys 


Lys 


He 


He 


Ala 


Glu 


Ala 


His 


Gly Met 


He 


Thr 










245 










250 










255 




L«u 


Tyr 


Asp 


He 


Acn 


Gly 


Arg 


Thr 


Val 


Aia 


Val 


Ala 


Pro 


Asn 


Arg 
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10 



Glu Tyr Met Ala Gin Th. Gly P^e Tyr Ala V.l Arg Phe Val Gly 

Asn Ly^ His His Val Ser Lys Tie Gin Val Arq 
290 295 

(2) IHFORIIATIW FOR SEQ ID NO;306 

(i) SEQUENCE CHARACTERISTICS' 

(A) LEIIGTH: 377 amino acids 

(B) TYPE: amino acid 
iO) TOPOLOGY: linear 

^_ (11) MOLECULE TYPE: protein 

15 

(xii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 
20 ORGANISM: Porphyromonas gingivalir 

iix) FEATURE: 

(A) HAIIE/KEY: misc feature 
(R) LOCATION 1 . . .377 

SEQUENCE DESCRIPTION: SEQ ID N0:30G 
Leu ?hc A.'.n His Lys Lys Ser Trp Tyr Glu Thr Phe Gin Phe Tyr Leu 
3^ Val val Phe Arg Ser Gly Phe He Leu IL Ci*. Glu Arn Thr I^u Ala 
Gin Gin T.y.s Thr Glu Glu Phe Ala Pro Val Ser A.-p Leu Arg Ala Glu 
Ala Tyr Gly Ser Thr Val Phe i^u Mi. Trp Thr Pro Pro Tyr A.p Aan 
35 Pro Met He Pro I..u Ser Glu Ser Phe Glu Ser G^y He p.o Ala He 
Trp Ly. Thr He A.p Ala A.p Gly A.p Gly ^y. A.n Trp Met HI. Leu 
Thr Asn Phe Thr Gly Gin Ser Gly Leu Cys Val Ser Ser Ala ler Tyr 



lie Gly Gly Val Gly Ala Leu Thr Pro Asp A.n Tyr Leu III Thr Pro 

Ala 

45 cv, oin A-.p Leu Th. ifa Pro s« 31u HI. rtl Al. Val Tyr Ser 



Glu L.U Ly. L=u Pro Thr A=p Ala Leu Val Glu lie l" Tyr Trp Val 



S«r Scr Thr Gly A=n Z'r. Ala Ala A=p Phe Vaf A.n Leu Leu Tyr Glu 
Glu Thr Leu Thr Ala Lys Aro He Gin Sex Prv Glu Leu tie ^rg Gly 
Asn Ar, Thr Gin Gly Val Trp Tyr G^n Ar, Ly, Val Val II' Pro A.n 



?n*^ 

A.P Thr Li-s Tyr Val Ala Phe Arg Hi. Phe A.„ Ser Thr A.p ;^.„ pne 

55 Trp Leu A.„ Leu A,p Glu Val Ser He Leu Tyr Thr Pro Leu Pro Arg 

Arg Ala Pro Cys Pro His Pro Gly Gly Tyr ?hr Tyr Scr Val Phe ^g 

ASP Gly Gin Lys He Ala Ser Gly Leu III Ala Leu Ala Tyr ll" A.p 

Thr A.P Val Pro Tyr Gly Thr Gin Asp Tyr Cys Val Gin vl? Asn Ty. 



Leu Gin Oly A.p Ser Tyr Lys Val Cy. Ly. Asn He III Val Ala A.n 
65 ser Ala Asn He Tyr Gly Ala Asp Lys Pro Phe A^a Leu Thr Val Val 
Gly Ly. Thr He Val Ala Ser Ala Phe Lys Gl'v Glu He Th. Leu Tyr 
Asp He Arg Gly Arg Leu He Ala Ser Gly Cys A.^.Thr Leu Arg Tyr 
Ly. Ala Glu Asn Gly Phe Tyr Leu He Lys He Gin Val Asn Gly Thr 



75 



Val Tyr Thr Glu J,ye He Gin He Gin 
370 



365 



PCT/AU98/01023 
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12) IHFORimTION FOR SEQ ID HO: 301 

(i) ytQUtNCE CHARACTERISTICS: 

<A> LENGTH: 84 9 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY; linear 

(11) tlOLECULE l-ypE: protein 

10 (iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

15 (ix) FEATURE: 

{A) HAME/KE'i: misc_feature 
(R> LOCATION 1 . , .04 9 

fxil * 

20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 





txi) 


SEQUEHCE DESCRIPTION: SEQ ID NO: 307 










Phe 


Gly 


Tift 


Ser 


Pro 


Ser 


Met 


Lys 


Lys 


Ser 


Phe 


Leu 


Leu 


Ala 


He 


Val 


1 








5 










10 










15 




Met 


Leu 


Phe 


Gly 


He 


Ala 


Het 


Gin 


Gly 


Mis 


Ser 


Ala 


Pro 


Val 


Thr 


Lys 








20 










25 










30 






Glu 


Arg 


Ala 


I<GU 


s^r 


Leu 


Ala 


Arg 


Leu 


Ala 


Leu 


Arg 


Gin 


Vai 


Ser 


Leu 






35 










40 










45 








Arg 


Met 


Gly 


air. 


Thr 


Ala 


Val 


Scr 


A^p 


T.ys 


He 


S*r 


He 


Asp 


Tyr 


Val 




50 










55 










60 










Tyr 


Arg 


Gin 


Gly 




Ala 


Glu 


Arg 


Gly 


He 


Thr 


Ser 


Gin 


Glu 


Glu 


Gly 


65 










70 










75 










80 


ii«r 


Pro 


Ala 


Tyr 


Phe 


Tyr 


Val 


Ala 


Asn 


Arg 


Gly 


Asn 


Asn 


Glu Gly Tyr 










flS 










90 










95 




Ala 


Leu 


Val 


Al a 


Ala 


Asp 


Asp 


Arg 


He 


Pro 


Thr 


He 


Leu 


Ala Tyr 


Ser 








100 










105 










110 






Pro 


He 


Gly 


Arg 


Phe 


Asp 


Met 


Asp 


Ser 


Met 


Pro 


Asp 


Asn 


Leu Arg 


Met 






115 










120 










125 








Trp 


Leu 


G]n 


He 


Tyr 


A5P 


Gin 


Glu 


He 


Gly 


Leu 


He 


Leu 


Ser 


Gly Lys 




130 










135 










140 










Ala 


Gin 


Leu 


Asn 


Glu 


Glu 


He 


Leu 


Arg 


Thr 


Glu 


Gly 


Val 


Pro 


Ala 


Glu 


14 5 










150 










155 










160 


Val 


His 


Ala 


L«u 


Met 


Asp 


Asn 


Gly 


His 


Phe 


Ala 


Asn 


Asp 


Pro 


Met 


Arg 










165 










170 










175 




Trp 


Asn 


Gin 


Gly 


Tyr 


Pro 


Trp 


Asn 


Asn 


Lys 


Glu 


Pro 


Leu 


Leu 


Pro 


Asn 








180 










185 










190 






Gly 


Asn 


His 


Ala 


Tyr 


Thr 


Gly 


Cys 


Val 


Ala 


Thr 


Ala 


Ala 


Ala 


Gin 


lis 






195 










200 










205 








Met 


Arg 


Tyr 


His 


Ser 


Trp 


Pro 


Leu 


Gin 


Gly 


Glu 


Glv 


Ser 


Phe 


Asp 


Tyr 




210 










215 










220 










His 


Ala 


Gly 


Sec 


Leu 


Val 


Gly 


Asn 


Trp 


Ser 


Gly 


Thr 


Phe 


Gly 


Glu 


Met 


22b 










230 










23S 










240 


Tyr 


Asp Trp 


He 


Asn 


Met 


Pro 


Gly 


Asn 


Pro 


Asp 


Lsu 


Asp 


Asn 


Leu 


Thr 










245 










250 










255 




Gin 


Ser 


Gin 


Val 


Asp 


Ala 


Tyr 


Ala 


Thr 


Leu 


Met 


Arg 


Asp 


Val 


Scr 


Ala 








260 










265 










270 






Ser 


Val 


Str 


Met 


Ser 


Phe 


Tyr 


Glu 


Asn 


Gly 


Ser 


Gly 


Thr 


Tyr 


Ser 


Val 






27G 










280 










285 








Tyr 


Val 


Val 


Gly 


Ala 


Leu 


Arg 


Asn 


Asn 


Phe 


Arg 


Tyr 


Lys 


Arg 


Ser 


Leu 




290 










295 










300 










Gin 


Leu 


Hi£? 


V^il 


Arg 


Ala 


Leu 


Tyr 


Thr 


Ser 


Gin 


Glu 


Trp 


His Asp Mot 


305 










310 










315 










320 


I] e 


Arg 


Gly 


Glu 


Leu 


Ala 


Ser 


Gly 


Arg 


Pro 


Val 


Tyr 


Tyr 


Ala 


Gly Asn 










325 










330 










335 




Asn 


Gin 


Ser 


He 


Gly 


His 


Ala 


Phe 


Val 


cys 


Asp 


Gly 


Tyr 


Ala 


Ser 


Asp 








340 










345 










350 






Gly 


Thr 


Phe 


His 


Phe 


Affn 


Trp 


Gly 


Trp 


Gly 


Gly 


Val 


Ser 


Asn 


Gly 


Phe 






355 










360 










365 








Tyr 


Lys 


L«;U 


Thr 


Leu 


Leu 


Ser 


Pro 


Thr 


Ser 


Leu 


Gly 


He 


Gly Gly 


Glu 




310 










375 










380 










Gly 


He 


Gly 


Phe 


Thr 


lie 


Tyr 


Gin 


Glu 


He 


He 


Thr 


>Gly 


Tie 


Glu 


Pro 


385 










390 










395 










400 


Ala 


L/s 


Thr 


Pro 


Ala 


Glu 


Ala 


Gly 


Thr 


Asp 


Ala 


Leu 


Pro 


He 


Leu 


Ala 










4 05 










410 










415 




T.eu 


Lys 


Asp 


He 


Giu 


Ala 


Glu 


Tyr 


Lys 


Ser 


Glu 


Ser 


Gly 


Leu 


Asn 


Yal 








420 










425 










430 






Gly 


Tyr 


Sot 


He 


Tyr 


Asn 


Thr 


Gly 


Glu 


Glu 


G1 n 


Ser 


Asn 


Leu 


Asp 


Leu 
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65 







435 










440 










445 








Giy 


Tyr 


Arg 


Leu 


Asn 


Lys 


Ai a 


A«p 


Gly 


Glu 


Va 1 


Tie 


Glu 


Val 


Lys 


'i'hr 


450 










455 










460 










Ser 


Ser 


lie 


As n 


ILe 


Ser 


Trp 


Tyr 


Gly Tyc 


Gly 


Glu 


His 


Pro 


Glu 


Ser 


4 6S 










4/0 










475 










480 


Phe 


Se r 


LS:U 


Ala 


Pro 


Asn 


Gin 


Leu 


Ser 


Gin 


Gl y 


Tls 


Asn 


Thr 


11- 


Thr 










485 










490 










4 95 




Leu 


Leu 


Tyt 


Arg 


Arg 


Thr 


Gly 


Thr 


Glu 


Gin 


Trp 


Glu 


Pro 


Val 


Arg 


His 








500 










505 










510 






Ala 


Gift 


Gly 


Gly Tyr 


Val 


A-n 


Ser 


lie 


Lys 


\ a 1 


Asn 


Thr 


Thr 


Asp 


Pro 






515 










520 










525 








Asn 


Asn 


Val 


Val 


Val 


Thr 


Val 


Asp 


Asn 


Asn 


Glu 


Gly 


Lys 


Leu 


Ser 


He 




530 










535 










540 










Val 


Pro 


Asn 


Ser 


Phe 


Val 


Ala 


Asp 


Lsu 


Asn 


Ser 


Tyr 


Glu 


His 


Ser 


Thr 


545 










550 










555 










560 


lie 


Thr 


Val 


Gin 


Phe 


Asn 


Ser 


Asp 


Ser 


Pro 


Asp 


Glu 


He 


Arg 


Thr 


Pro 










565 










570 










575 




Val 


Al a 


Phe 


Ala 


Leu 


Ser 


Thr 


Gly Ala Thr 


A3 a 


Asp Asp Val 


He 


Ser 








SflO 










585 










590 






Leu 


Giy 


Trp 


Val 


Met 


Aln 


Glu 


Val 


Pro Gly 


Gly 


Ser 


Ser 


Asn 


Tyr 


Pro 






595 










600 










605 








Val 


Val 


Trp 


Ser 


Lys 


Asp Val 


Leu 


Thr 


Leu 


Ser 


Glu 


Gly Asp Tyr 


Thr 




61 0 










61b 










620 










Leu 


Tip 


Tyr 


Arg 


Phe 


Szt 


lie 


Am 


Asn 


Gin 


Lys 


Asp 


Glu 


Trp 


Lys 


Lys 


625 










630 










635 










640 


Tl o 


Gly 


Ser 


Val 


y^r 


val 


Lys 


Thr 


Pro 


Thr 


Glu 


Tyr Thr 


His 


Pro 


Leu 










645 










650 










655 




Phe 


Glu 


Val 


Gly 


His 


Am 


Gin 


Thr 


Sec 


Thr 


Tyr 


Thr 


Leu 


Asp 


Met 


Ala 








660 










665 










67 0 






HI 


Artr. 


Arg 


Val 


Leu 


Pro 


Asp 


Fhfe 


Thr 


Lou 


Lys 


Asn 


Leu 


Gly 


Leu 


Pro 






675 










600 










685 








Phe 


Asn 


Gly 


Glu 


Leu 


Val 


Val 


Val 


Phe Arg 


Gin 


Thr 


Gin 


Ser 


ser 


Ser 




690 










695 










700 










Gly 


Set; 


Leu 


Trp 


Ala 


Ala 


Gin 


Glu 


Thr 


Val 


His 


He 


Lys 


Gin 


Gly 


Glu 


705 










710 










715 










720 


Thr 


Phe 


Val 


Tyr 


Lys 


Pro 


Val 


Val 


Glu 


Gly 


Pro 


He 


Pro Asp 


Gly 


Ser 










725 










730 










735 




Tyr 


Arg 


Ala 


Thr 


Leu 


His 


Ala 


Phe 


Val 


Asn 


Gly 


Gin 


Gin 


Gin 


Leu 


Tyr 






740 










745 










750 






Leu 


Lys 


Gly Lys Arg 


Asn 


Tyr 


Thr 


Val 


Lys 


He 


Val 


Asn 


Gly Thr 


Ala 






755 










760 










765 








Val 


Glu 


Ala 


He 


Glu 


Ser 


Ser 


Glu 


Glu 


lis 


Ara Val 


Phe 


Pro 


Asn 


Pro 




770 










775 










780 










Ala 


Arg 


Asp 


Tyr 


Val 


Glu 


lie 


Ser 


Ala 


Pro 


Cys 


He 


Pro 


Gin 


Glu 


Thr 


785 










790 










7 95 










80C 


Ser 


lie 


lie 


Leu 


Phe 


Asp Leu 


Scr 


Gly Lys 


He 


Val 


/iet 


Lys 


Asn 


Ser 










805 










810 










015 




Leu 


Ser 


Ala 


Gly 


His 


Gly 


Arg Het 


Asp Val 


Ser 


Arg 


Leu 


Pro 


Asn 


Gly 








820 










825 










830 






Ala 


Tyr 


lie 


Leu 


Lys 


Val 


Asp 


Gly Tyr 


Thr 


Thr 


Lys 


He 


Asn 


He 


Val 






835 










840 










845 








His 

































10 



15 



20 



25 



30 



35 



40 



45 



50 



55 (2) TUFOraiATlOU FOR StQ ID 140:308 

(i) SEQUENCE CHARACTERISTICS: 

IA) LENGTH: 295 amino acids 

IB) TYPE: amino acid 
60 (D) TOPOLOGY: linear 

Ui) HOLEOJLS TYPE: protein 

(ill) HVPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

<A1 ORGAtllSH: Porphyromonaa gtngivalis 



(iJt) FEATURE: 
70 (A) IIAIIE/'KEY: mlsc_feature 

(B) LOCATIOH 1. . .295 

Ui) SEQUEUCE DESCRI PTIOtI : SEQ ID IIO:308 

75 Thr Lys Lys Thr Leu Uet Lys Lys Leu PhG I.eiu Ser Leu The Ser Leu 
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5 



10 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 





Met 


Val 


Phe 


Ala 


Val 


Ala 


Ser 




Asp 


T.l¥r 


He 


Asp 


Lys 


Asp 


Gin 








20 










25 










30 






Thr 


Leu 


Leu 


Pro 


Ala 


Pro 


Thr 


Asn 


Val 


Thr 


Pro 


Asp Asn 


Pro 


Asp Asp 






35 










40 










45 








Asn 


Pro 


Ser 


Olu 


He 


Asp 


He 


Thr 


Gin 


Thr 


HI s 


Thr 


Glu 


Lys 


Tyr 


Val 




5C 










55 










60 










Leu 


Aia 


Glu 


Glu 


Phe 


Thr 


Gly Oln 


Lys 


Cya 


Leu 


Asn 


Cys 


Pro 


Lys 


Gly 


65 










70 










15 










80 


His 


Axg 


Lys 


Leu 


Ala 


Aia 


Leu 


Lys 


Glu 


Gin 


Tyr 


Gly 


Ly5 


Arg 


Leu 


Thr 










85 










90 










95 




Val 


Val 


Gly 


Tie 


His 


Ala 


Gly 


Pro 


Gly 


Ser 


Leu 


Viii 


Pro 


Pro 


Leu 


Phe 








100 










105 










110 






Rrq 


'I'hr 


Glu 


Aia 


Gly 


Asp 


Ala 


Tyr 


Tyr 


Sfir 


Ly^ 


Phe 


Ai& 


Asn 


Asn 


Thr 






lis 










120 










125 








Pco 


Leu 


Pro 


Ala 


Lsu 


Met. 


Val 


ser 


Ac<g 


Lys 


Lys 


Phe 


Gly 


Ser 


Ser 


Tyr 




130 










135 










14C 










Val 


Tyr 


Asp 


Lys 


Ser 


Tyr 


Lys 


Thr 


Trp 


Asp 


Val 


Pro 


He 


Ala 


Glu 


Gin 


145 










150 










155 










160 


lleL 


Glu 


Gin 


Lys 


Ala 


Lys 


He 


Asn 


lie 


Phe 


Ala 


Val 


Ala 


Glu 


Tyc 


Thr 










165 










170 










175 




As-p 


Thr 


Gin 


Lys 


lie 


Ly« 


Val 


Thr 


Val 


Ly.^ 


Gly 


Lya- 


11- 


Leu 


Glu 


Gly 








180 










185 










190 






Asii 


Tlir 


Leu 


Pro 


Lyff 


Ser 


flet 


Val 


Gin 


Vai 


Tyr 


Leu 


Leu 


Glu 


Asp 


Ly5 






19S 










200 










20i: 








Leu 


lie 


Al^ 


Pro 


Gin 


Val 




Gly f^n 


Thr 


Thr 


Val 


Glu 


Asn 


Tyr 


Glu 




210 










21b 










220 










His 


Asn 


Hi^ 


Vol 


Leu 


Arg 


Gly Ala 


Val 


Asn 


Gly 


He 


Tip 


Gly 


Glu 


Glu 


225 










230 










235 










24 0 


Phe 


Val 


Asn 


Leu 


Lyf 


Asp 


Tyr 


Leu 


Tyr 


Thr 


Tyr 


Ala 


Val 


Glu 


Pro 


Leu 










2^5 










250 










255 




Ser 


Gly Met 


Ser 


Phe 


Val 


Ala 


Glu 


Asn 


Tyr 


Ser 


He 


Val 


Ala 


Pha 


Val 








260 










2 65 










2/0 






Tyr 


Asp 


Val 


Gin 


Thr 


Phe 


Glu 


Val 


Tyr 


Asp 


Val 


Val 


His 


Val 


Lys 


He 






215 










2S0 










285 








Asn 


Pro 


Gin 


Scr 


A^p 


Gly 


Lys 






















290 










2 95 





















(2) INFORILATIOM FOR SEQ ID NO: 309 

U) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 230 amino acids 
(BJ TYPL: amino acid 
fD) TOPOLOGY: linear 

{ii) MOLECULE r/PE: protein 

(ill) HVPOTMDTICAL: YES 

(vi) ORIGIIJAL SOURCE: 

(A) ORGAJllSH: Porphyromonas gingivaiij? 

iix> FEATURE; 

tA) IIAIIE/KEY: mi£c_f eatur*- 
tB) LOCATIOrt 1. . .230 



<xi) SEQUEllCE DESCRIPTION: SEQ ID tJO: 309 



Thr 


Asn 


Lys 


Lys 


Glu 


Giu 


Thr 


Met 


Lys 


Lys 


Ser 


Ser 


Val 


Val 


Ala 


Ser 


1 








5 








10 










IS 




Val 


Leu 


Mo 


Val 


Ala 


Leu 


Val 


Phe 


Ala 


Gly 


Cys 


Gly 


Leu 


Ai?n 


Asn 


Mer 








20 










25 










30 






Ala 


Lys 


Gly 


Gly 


Leu 


He 


Gly 


Ala 


Gly 


Val 


Gly 


Gly 


Aia 


He 


Gly Ala 






35 










40 










45 








Gly 


Val 


Gly 


Asn 


Val 


Ala 


Gly 


Asn 


Thr 


Aia 


Val 


Gly 


Ala 


lU 


Val 


Gly 




50 










55 










60 










Thr 


Ala 


Val 


Gly 


Gly 


Ala 


Ala 


Gly 


Ala 


Leu 


lie 


Gly 


Lys 


Lys 


Met 


A£p 


65 










70 










7 5 










80 


Lys 


Gin 


Ly^ 


hys 


Glu 


Leu 


Glu 


Ala 


Ala 


Val 


Pro 


Tvsp- 


.Ala 


Thr 


He 


Gin 










95 










90 










95 




Thr 


Vai 


Asn 


Any 


Gly 


Glu 


Ala 


He 


Leu 


Val 


Thr 


Phe 


Asp 


Ser 


Gly 


He 








100 










105 










HQ 






Leu 


Phe 


Ala 


Thr 


Asn 


Sec 


Ser 


Thr 


Leu 


Ser 


Pro 


Asn 


Ser 


Arg 


Thr 


Ala 






H5 










120 










126 








Leu 


Thr 


Lyf 


Phe 


Ala 


Ala 


A5n 


Ket 


Asn 


Ly= 


Aen 


Pro 


A5-p 


Thr 


Asp 


He 
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130 131> \A0 

Arg He Val Gly His Thr Asp Asn Thr Gly Ser Asp Lys He Asn Asp 
145 150 155 160 

Pro Leu Ssr Giu Arg Arg Ala Ala Ser Val Tyr Ser Phe Leu Asn Ser 
5 1G5 no 175 

Gin Gly Vui Ser Mst Ser Arg Met Ala Ala Glu Gly Arg Gly Ser Kis 

180 IBS 190 

Glu Pro Val Ala Asp Asn Ser Thr Val Ala Gly Arg Ser Ala Asn Arg 
19S 200 205 

10 Arg Val Glu Val Tyr lie Leu Pro Asn Ala Lys Met He Glu Gin Ala 
210 215 220 

Gin Gin Gly Thr Leu Lys 
225 230 

15 (2) INFORMATION FOR SEQ ID HO: 310 

(1) SEQUEKCE CHARACTERISTICS: 

(A J LENGTH: 342 amino acids 
(S) TYPE: amino acid 
20 (U) TOPOLOGY: linear 

(iil MOLECULE TYPE: protein 

(lilt HYPOTHETICAL: YES 



25 



(vH ORIGINAL SOURCE: 

(A) ORGAIJTSH: Porphyromonas gingivalis 



(1x1 FEATURt;: 
30 (A) NAl-IE/KEY: misc_f eaturt? 

(B) LOCATION 1. . .342 

(Xi) SEQUENCE DESCRIPTIOfJ: SEQ ID NO: 310 

35 Asn Arg Aan Arg Asn Met Ser Lys Lys Ser lie Leu Leu Leu Cys Cys 
15 10 IS 

Scr Leu Cys Phe lie Ser Ala Thr Lys Ala Val Thr Pro Val Ara Asn 

20 25 30 

Val Arg Asn Ser Gin Val Asn Ser Lys Ala Lys Thr Glu Arg Thr Lys 
40 35 4 0 4 5 

Pro Ser Asp Ser Vai Arg Tyr He Ser Asn Met lie Ala Asp Arg Leu 

50 53 60 

Glu Phe Arg Asn Lys He Ser Ser Giu Lys Glu VqI Arg Lys Ala Glu 
65 70 75 80 

45 Tyr Glu Asn Arg Leu Ala Met Glu Ala Leu A:sn Tyr Pro Ala He Asp 
85 90 95 

Leu Tyr Gly Glu Asp Ser Trp Ser Glu Tyr Val Asn Pro Phe Val Gly 

100 105 110 

Ala Gly Thr Asp Val Glu He Pro Asn Ser Tyr Asp He Asp Cys Ser 
50 115 120 125 

Ser Phe Val Met Pro Val Glu Asp hys Gin Val Thr Ser Gin Phe Gly 

130 135 140 

Tyr Arg Arg Arg Phe Gly Arg Met His Tyr Gly He Asp Leu Ser Val 
145 150 155 160 

55 Asn Arg Gly Asp Thr He Arg Ala Ala Phe Asp Gly Lys Val Arg Val 
165 170 173 

TVrg Ser Tyr Glu Ala Arg Gly Tyr Gly Tyr Tyr He Val Leu Arg Hiff 

IQO 185 190 

Pro Asn Gly Leu Glu Thr Val Tyr Gly His Hot Ser Ara Gin Leu \.'ai 
60 195 200 205 

Aep Glu Asn Gin He Val Arg Ala Gly Gin Pro He Gly Leu Gly Gly 

210 215 220 

Ser Thr Gly Arg Ser Thr Gly Pro His Leu His Phe Glu Thr Arg Phe 
225 230 235 240 

65 Met Gly He Pro Il« Asn Pro Ser Thr He He Asp Phe Asp Asn Gly 
245 250 255 

Val Pro Leu Arg Asp He Tyr Thr Phe Lys Arg Gly Ser Asn Ser Arg 

260 265 270 

Tyr Ala Lys Ala Set I.ys Thr Ser Ser Arg Tyr Ala Lys Lys Gly Lys 
70 275 280 '285 

Lys Gly Arg Gin Ala Ser Ser Pro Met Thr Tyr Arg lie Lys Lyc Gly 

290 295 300 

Asp Thr Le\i Glu Thr He Ala Lys Arg His Gly Thr Ser Val Gin h'y^ - 
305 310 315 320 

75 l.eu Cys Ala Thr Atn Gly He Gly Lys Ser Lys He Leu Thr Pro Gly 



SXJBSnTUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 



325 

LyJT A J a L-u Arg He Lys 
340 



232/490 



5 (2) IMtORllATION FOR SEQ ID NO: 311 

fl» oF.QUFNCE CHARACTERISTICS- 

(A) LEHGTH: 159 amino acids 
-IP, (S> TYPE: amino acid 

(DJ TOPOLOGY: linear 

(lij riOLF.CULE TYPE: protein 

HYPOTHETICAL: YES 

tvll ORIGINAL SOURCE: 

(A) ORGAfUSM: Porphyromonas gingivali^ 
liy.) FEATURF.: 



20 



(AJ NAME/KEY: misc feature 
(BJ LOCATION 1. . .159 



25 



SEQURMCE DESCRIPTIOH : SEQ ID HO; 311 

Pro Ser Lys Thr He II- Lys Th- H«r Ai^ r.,- n ^ 
^ ^ x.>5 in. M„r. Ala Lys He A-n Phe Tyr Ala 

aiu Giv vo, ... Leu A., it, la. v.l oiy ^f. Tr. 

lie .lu V.l cy. Se. A., Ty. ^I-, A.. Val CJy n, Ser 

Tyr Leu Phe Cys A/?p Asp Glu Tyr II- t zii 

Le. A.P K,. A.P ry. ry. rUr A.p XI. ... |L A.p S.. c.. <3Xu 

35 Ala A.P r.. V,. Asn OX, A.p L.u X.e. XX= sf. .e. A.p T.. VaX 

ser Asn AX. Ax, AXa X.eu A.p Ac, T^r «Xu A.p OXu Leu „'L A., 

^ Va: x^e XX= HXs CXy ZXe Xeu HXs .eu Cys Gly .eu ^.^p" .y, se. 

Lys LV. A.P Glu AXa Gin Met Ar, AXa AXa «Xu GXu ^ys AXa L.u Vai 
He. xeu Ar, GXu Th. tXe GXy S.r GXu Leu Se. Leu Thr 

45 1S5 

(2) IHFORflATIOt4 FOR SEQ ID HO: 312 
(i) SEQUEIJCE CHARACTERISTICS - 

50 !^! .5:5^^"= --in- 3cid3 

tej rypL: amAoo acid 
(O) TOPOLOGY: linear 

Ui) MOLECULE TYPE: protein 

liil) inrPOTHETlCAL: YES 



60 



(vi) ORIGIiJAL SOURCE: 

(A) ORGA^^ISM: Porphyromonas ginqivalis 
(ix) FEATURE: 

(A) MAME/KEY: mi.-rc featur- 

(B) LOCATION 1. . .395 



tXi} SEQUENCE DESCRIPTION: SEQ ID MO:312 
Lys ser Lys T.r Met Lys Val Lys Tyr Leu r.et Leu Thr Leu Val Gly 
Ala no Ala Leu A.n Ala Ser Ala Gin ^l^u Asn T.r Val Pro ..^ 
Cly cm L.U Pre Ala Ly. Asn Val Al, P.e Ala Arv A.n Ala Gly 

ser A3n Trp P.e Val Thr Leu Gin Gly Gly Val Ala IL oin Phe Leu 
Ajn ATP A.n A.n A.n Lys L«u Hot A^p Arg tl .i, 



75 



PCT/AU98/01023 
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10 



15 



20 



25 



30 



35 



40 



Ser 


Leu 


yer 


Val 


Gly 


Lys 


Tyr 


His 


Ssr 


Pro 


Phe 


Phe 


Ala 


Thr 


Arg 


Leu 










85 










90 










95 




Gin 


He 


Arn 


Gly Gly Gin Ala 


His 


Thr 


Phe 


Leu 


Gly 


Lys 


Asn 


Gly Glu 








100 










105 










110 






Gin 


Glu 


He 


Asn 


The 


Asn 


Phe 


Gly 


Ala 


Ala 


His 


Phe 


Asp 


Phe 


Met 


Phe 






115 










120 










125 








Asp 


Val 


Vai 


Asn 


Tyr 


Phe 


Ala 




Tyr Arg Glu 


Asn 


Arg 


Phe 


Phe 


His 




130 










135 










140 










Leu 


He 


Pro 


Trp 


Val 


Gly Val 


Gly Tyr 


Gin 


His 


Lys 


Phe 


He 


Gly Ser 


145 










150 










155 










160 


Glu 


Trp 


Ser 


lys 


Asp 


Asn 


val 


Glu 


Ser 


Leu 


Thr 


Ala 


Asn 


Val 


Gly 


Val 








165 










170 










175 




Met 


Het 


Ala 


Ph% 


Acy 


Leu Gly 


T^ys 




Val 


Asp 


Phe 


Val 


He 


Glu 


Ala 








100 










185 










190 






Gin 


Ala 


Ala 


His 


Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 


Tyr 


Asn 


Ala 


Lys 






195 










200 










205 








Lys 


Thr 


Pro 


Val 


Phe 


Glu 


Arp 


Pro 


Ala 


Gly Arg 


Tyr 


Tyr 


Asn Gly 


Phe 




210 










215 










220 










Gin 


Gli' 


liQt 


Ala 


Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


Gly 


Ala 


Val 


Gly 


225 










230 










235 










240 


Phe 


A.?n 


Ala 


lit 


UnK 


Pro 


fHz 


Asp 


Tyr 


Ala 


L^M 


Tie 




Asp 


Leu 


Asn 










245 










Z50 










255 




Gly 


Gin 


He 


Asn 


Arg 


Leu 


Arq 


Ser 


Glu 


Vai 


Glu 


Glu 


Leu 


Ser 


Lys 


Arg 








260 










265 










27 0 






Pro 


Val 


Ser 


Cys 


Pro 


Glu 


C/s 


Pro 


Glu 


Val 


Thr 


Pro 


Val 


Thr 


Lys 


Thr 






?75 










280 










285 








Glu 


Ail) 


He 


Leu 


Thr 


Gl ij 


hys 


Ala 


Val 


Leu 


Phe 


Arg 


Phe 


Asp 


Ser 


His 




2?0 










295 










300 










Val 


Val 


Asp 


Lys 


Asp 


Gin 


L^u 


lis 


Asn 


I.eu 


Tyr 


Asp 


Val 


Ala 


Gin 


Phe 


305 










310 










315 










320 


Val 


Lys 


Gill 


Thr 


Asn 


Glu 


Pro 


lie 


Thr 


V<.1 


Val 


Gly 


Tyr 


Ala 


Asp 


Pro 










325 










330 










335 




Thr 


Gly 


Asn 


Thr 


Gin 


Tyr Asn 


Glu 


Lys 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


Lys 








340 










345 










350 






Ala 


Val 


Val 


Asp Val 


Leu 


Thr 


Gly 


Lys 


Tyr 


Gly 


Val 


Pro 




Glu 


Leu 






355 










360 










365 








He 


Sec 


Val 


Glu 


Trp 


Lys 


Gly Asp Ser The 


Gin 


Pro 


Phe 


Ser 


Lys 


Lys 




370 










375 










380 










Ala 


Trp 


Asn 


Arg 


Val 


Val 


He 


Val 


Arg 


Ser 


LyiJ 












385 










390 










395 












(2) 


IN f FORMATION 


FOR 


SEQ 


ID WO: 31 3 

















45 



(i) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 397 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 



50 



55 



60 



65 



70 



75 



Hi) MOLECULE TYPE: protein 

<ili) HYPOTHETICAL: YES 

tvij ORIGINAL SOURCE: 

(A! ORGANISM: Porphyromonas glngivalis 

tlx) FEATURE: 

(A) name; KEY: mlffcfeature 

(B) LOCATION 1. . .387 





(-xi) SEQL'EMCE DESCRIPTIOfJ : SEQ ID NO: 313 








Tyr 


Lys Met 


Thr 


Tyr 


Arg 


Il^> 


Met 


Lys 


Ala 


Lys 


Ser 


Leu 


Leu Leu 


Ala 


1 






S 










10 








15 




Leu 


Ala Gly 


Leu 


Ala 


Cys 


Thr 


Phe 


Ser 


Als 


Thr 


Ala 


Gin 


Glu Ala 


Thr 






20 










25 










30 




Thr 


Gin Asn 


Lys 


Ala 


Gly 


Met 


His 


Thr 


Ala 


Phe 


Gin 


Arg 


Asp Lys 


Ala 




35 










40 










45 






Ser 


Asp His 


Trp 


Phe 


He 


Asp 


He 


Ala 


Gly 


Gly 


Ala 


Gly 


Met Ala 


Leu 




SO 








55 










60-- 








Ser 


Gly Trp 


Asn 


Asn 


Asp 


Val 


Asp 


Phe 


Val 


Asp 


Ary 


Leu 


Ser He 


Val 


65 








70 










75 








80 


Pro 


Thr fhe 


Gly 


He 


Gly 


Lys 


Trp 


His 


Glu 


Pro 


T^-r 


Phe 


Gly Thr Arg 








85 










90 








95 




Leu 


Gin Phe 


Thr 


Gl y 


Phe 


Asp 


He 


Tyr 


Gly 


Ph^ 


Pro 


Gin 


Gly Ser 


Lys 






100 










105 










110 
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55 



60 



65 



70 



75 



GlU 


Arg 


Asn 


His 


Asn 


Tyr 


Phe 


Gly 


Asn 


Ala 


His 


Lou 


Asp 


Phe 


Met 


Phe 






1 15 










120 










125 








Asp 


Leu 


Thr 


Asn 


Tyr 


Phe 


Gly 


Val 


Tyr 


Arg 


Pro Asn Arg 


Val 


Phe 


His 




130 










135 










IJO 










lie 


lie 


t>ra 


Tcp 


Ala 


Giy 


He 


Gly 


Phe 


Gly Tyr 


Lys 


Phe 


tils 


Ser 


Glu 












ISO 










155 










ISO 


Asn 


Ala 


Asn 


Gly 


Glu 


Lys 


Val Gly Sec 


Lys 


Asp Asp Met 


Thr 


Giy 


Thr 










165 










170 










175 




Val 


Pisn 


Val 


Giy 


Leu 


Mel 


Leu 


I.y.'; 


Phe 


Arg 


Leu 


Ser 


Arg 


Val 


Val Asp 








IQO 










195 










190 






Phe 


Pisn 


lie 


Glu 


Gly Gin 


Ala 


Phe 


Ala 


Gly 


Lys 


Met 


Asn 


Phe 


He 


Gly 






195 










200 










205 








Thr 


hys 


Arg 


Gly 


Lys 


Ala 


Asp 


Phe 


Pro 


Val 


Met 


Ala 


Thr 


Ala 


Gly 


Leu 




210 










215 










220 










Thr 


Phe 


Asn 


Leu 


Gly 


Lys 


Thr 


Glu 


Trp Thr 


Glu 


He 


Val 


Pro 


Met 


Acp 


225 










230 










235 










240 


Tyr 


Ala 


Leu 


Val 


Asn 


Asp 


Leu 


Asn 


Asn 


Gin 


Tie 


Arsn 


Ser 


Leu 


Ara 


Gly 










2^5 










250 










255 




Gin 


Val 


Glu 


Glu 


Leu 


Ser 


Aro 


Arq 


Pro 


Val 


Ser 


Cys 


Pro 


Glu 


Cys 


Pro 








260 










265 










270 






Glu 


Pro 


Thr 


Gin 


Pro 


Thr 


Val 


The 


Arg 


Val 


Val 


Val 


Asp 


Asn 


Val 


Val 
















2ii0 










285 








Tyr 


Phe 


Arg 


He 


Aen 


Ser 


Ala 


Lys 


He 


A^p 


Arg Asn 


Gin 


Glu 


He 


Asn 




290 










295 










30C 










Val 


Tyr 


Asn 


Thr 


Ala 


Glu 


Tyr Ala 


Lys 


Thr 


Asn 


Asn 


Ala 


Pro 


He 


Lys 


305 










310 










315 










320 


Vdl 


Val 


Gly Tyr 


Ala 


Asp Glu 


Lys 


Thr Gly Thr 


Ala 


Ala 


Tyr Asn t'.o~ 










326 










330 










335 




Lys 


Leu 


aer 


filu 


Arg 


Arg 


Ala 


Lys Ala 


Val 


Alfl 


Lys 


Met 


Leu 


tilu 


Ly£ 








3*10 










3^15 










350 






Tyr Gly 


Val 


Ser 


Ala 


Asp Arg 


He 


Thr 


He 


Glu 


Trp 


Lvs 


Gly 


Ser 


Ser 






355 










360 










365 








Glu 


Gin 


He 


Tyr 


Glu 


Glu 


Asn 


Ala 


Trp 


Asn 


Arg 


He 


Val 


Val 


Met- 


Thr 




370 










37 5 








380 










Ala 


Ala 


Glu 




























385 
































(2) 


I M FORMAT I ON 


FOR 


SEQ 


ID HO: 31 4 

















10 



15 



20 



25 



30 



35 



40 (iJ SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 195 amino acids 
(D) TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (ii) MOLECULE TYPE: protein 

(til) HYPOTHETICAL: YES 

tvi) ORIGI/IAL SOURCE: 
50 <A) ORGANISl^l: Porpliytomona^ gingival is 

tixl FEATURE: 

(A) MAI-IE/KEV: misc_feature 
(S) LOCATIOI/ 1 . . .195 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 314 



Val 


He 


Gly 


He 


He 


Met 


Glu 


Phs 


Phe 


Met 


Lsu 


Phe 


He 


Ala 


Ala 


Val 


1 








5 










10 










15 




Phe 


Val 


Asn 


Aen 


Val 


Val 


Leu 


Ser 


Gin 


Phe 


Leu 


Gly 


He 


Cys 


Pro 


Phe 








20 










25 








30 






Leu Gly Val 


Ser 


Lys 


Lys Val 


Asp 


Thr 


Ser 


He 


Gly 


Met 


Gly Ala 


Ala 






35 










40 










45 








Val 


Thr 
50 


Phc 


Val 


Lev J 


Ala 


Leu 
55 


Ala 


Thr 


Leu 


Val 


Thr 
60 


Phe 


Leu 


He 


Gin 


Lys 


Phe 


Val 


Leu 


Asp 


Arg 


PhG 


Gly 


Leu 


Gly 


Phe 


Met 


Gin 


Thr 


He 


Ala 


65 










70 










75 










80 


Phc 


He 


Lftu 


Val 


He 
85 


Ala 


Ala 


Leu 


Val 


Gin 
90 


Het 


Val 


Glu 


He 


He 

95 


Leu 


Lys 


Lys 


Val 


Ser 


Pro 


Pro 


Lsu 


Tyr 


Gin 


Ala 


Leu 


Gly 


Val 


Phe 


Leu 


Pro 








100 










105 








110 






Leu 


He 


Thr 


Thr 


Asn 


Cys 


Cys 


Val 


Leu 


Gly 


Val 


Ala 


He 


Leu 


Val 


He 






115 










120 








125 








Gin 


Lys 
130 


Asp 


Tyr 


Thr 


Leu 


Leu 
135 


Gin 


Ser 


Phe 


Val 


Tyr 
140 


Ala 


He 


Ser 


Thr 



SUBOTTUTE SHEET (Rule 26) (RO/AU) 



^ri, ^ r<«™ ti-M. «t! tpNi ^ ^ "t^t if'i? iPii 

Oi 9 -o «aLic:: o ctm i« Oi o nj^ 



wo 99/29870 PCT/AU98A>ia23 

235/490 

Ala lie Gly Phe Tlir Leu Ala Met Val Thr Phe Ala Gly lie Arg Glu 
145 150 1S5 160 

Gin I.su A.-^p Met Thr Asn Leu Pro Lys Ala Met Ly£ Gly Tie Pro Ser 
165 170 175 

5 Ala Leu Leu Ala Ala Gly He Leu Ala Het Ala Phe Met Gly Phe Ser 
IBO 185 190 

G}y Tl- Ais 

155 

10 (2) IHFORJIATION E^OR SEO ID NOi 315 

(i) SEQUEMCE CHARACTERISTICS: 

I A) LENGTH: 87 6 amino acids 
(E) TYPE: amino acid 
15 (D) TOPOLOGY: linear 

Uil nOLECULE TYPE: protein 

(lii) H-i'PaTHF.TICAL: YES 



20 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



tvi) ORIGIHAL SOURCE; 

(A) ORGANISM: Porphyromonaff gingival is 



fix) FEATURE: 
25 (AJ HAJrE/KEY; raise feature 

IB) LOCATIOH 1. . .B7€ 



(Xit SEQUEUCS DESCRIPTIOH: SEQ ID IIO:31S 



Tyr 


Arq 


Ser 


Tyr 


Arg 


Gly 


He 


Gly 


Ser 


Gly 


Thr 


His 


Ser 


Pro 


Asn 


Leu 


1 








5 










10 










15 




Lys 


Asn 


Arg 


Leu 


Lys 


Arg 


He 


Gly 


He 


Arg 


He 


Pro 


Asn 


Arg 


His 


Tyr 








20 










25 










30 






lie 


His 


He 


Lys 


Pro 


He 


Lys 


Pro 


Lys 


Asn 


Lys 


Met 


Lys 


Gin 


Leu 


Asn 






35 




















45 








lie 


He 


Ser 


Phe 


He 


He 


Ala 


Phe 


Leu 


Phe 


Lsu 


Gly 


Thr 


Ser 


Ala 


Ser 




50 










55 










60 










Ala 


Gin 


Gin 


Ser 


Gly 


Gly 


Ser 


Val 


Thr 


Gly 


Thr 


Val 


Val 


As'p 


Lys 


Ser 


65 










70 










75 










00 


Ser 


Lys 


Glu 


Pro 


He 


Ala 


Tyr 


Val 


Gin 


Val 


Phe 


Val 


Lys 


Gly 


Thr 


Thr 








85 










90 










95 




Leu 


Gly 


Thr 


Ser 


Thr 


Asp 


Ala 


Aan 


Gly 


Asn 


Tyr 


Ser 


He 


Lys 


Gly 


He 








100 










105 










110 






Pro 


Ser 


Gly 


Asn 


Gin 


Thr 


He 


Val 


Ala 


Arq 


Leu 


HeL 


Gly 


Tyr 


Ser 


Thr 






llli 










120 










12b 








cys 


Glu 


Glu 


Lys 


VaX 


His 


He 


Glu 


Lys 


Gly 


Gly 


Ser 


Arg 


His 


Val 


Asp 




130 










135 










140 










Leu Tyr 


L^u 


Thr 


Glu 


G3u 


He 


Leu 


Ser 


Leu 


Asp 


Gly 


Val 


Val 


Val 


Sec 


145 










150 










155 










160 


Ala 


Asn 


Arg 


Asn 


Glu 


Thr 


Phe 


Arg 


Arg 


Gin 


Al a 


Pro 


Ser 


Leu 


Val 


Thr 










165 










170 










175 




Val 


Leu 


Ser 


Pro 


Glu 


Lsu 


Phe 


Leu 


Lys 


Thr 


Asn 


Ser 


Thr 


Asn 


Leu 


Ser 








180 










185 










190 






Gin 


Gly 


Leu 


Lys 


Phe 


Gin 


Pro 


Gly 


Leu 


Arg 


Val 


Glu 


Asp 


Asn 


Cys 


Gin 






196 










200 










205 








Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 


Arg 


He 


Asn 


Gly 


Leu 


Glu 


Gly 


Ala 


Tyr 




210 










215 










220 










Ser 


Gin 


He 


Leu 


He 


Asp 


Ser 


His 


Pro 


He 


Phe 


Ser 


Sec 


Leu 


Ala 


Gly 


225 










230 










235 










240 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


Met 


Pro 


Ala 


Asn 


Met 


He 


Glu 


Arg 


Val 


Glu 










245 










250 










255 




Val 


He 


Arg 


Gly 


Gly 


Gly 


Ser 


Ala 


Leu 


Phe 


Gly 


Ser 


Asn 


Ala 


Val 


Gly 








260 










265 










270 






Oly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Glu 


Pro 


T,C;U 


Arg 


Asn 


Ser 


Ala 


Glu 




275 










280 










285 








He 


Ser 


His 


Ser 


Thr 


Met 


Thr 


Phe 


Asp 


His 


Ala 


Lys 


Gly 


Trp 


Gly 


Ser 




290 










295 










300 










Phc 


Gin 


Asn 


Thr 


Thr 


Gin 


Phe 


Asn 


Gly 


Ser 


Met 


Leu 


Thr 


Glu 


Asp 


Arg 


305 










310 










315 










320 


Lys 


Ala 


Gly 


Val 


Met 


Val 


Phe 


Gly 


Gin 


His 


Asn 


Tyr 


Arg 


Pro 


Gly 


Gin 










325 










330 










335 




A£p 


He 


Asp 


Gly 


Asp 


Asn 


Phe 


Thr 


Glu 


Leu 


Pro 


Asn 


Leu 


Arg 


Asn 


Arg 








340 










345 










350 








Leu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Lys 


Thr 


Gly 


Leu 


Tyr 


Ser 


Lys 


Ala 






355 










360 










365 
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Til r 




ol u 


T" ■ r 


His 


Ser net 


Gin 


Glu 


Tyr 


Arg 


Arg 


Gl v 


Gly 


Arg 




370 








375 
















L*u 


r\£p 


n 


t*ro 


Pco 


Ph- Giu 


Ala 


Gin 


1 1 * 


Ala 


Glu 


Tyr 


l.eu 


-1 


385 










300 








3 r5 








4 O'' 


T\'r 


lie 


Ajrn 




Gly 


Ser Phe 


Lys 


Phe 


A-p 


Gin 


Lily 


Phe 


Ser 


Gly Gi^- 










4 05 








410 










415 


**> - 


Aip 


Phe 


Phe 


Ser 


Lsru Tyr 


Ala 


Ser 


Ala 


uin 


A^p 


Val 


Gin 


Arg Arg 
















425 










430 


Ser 




T'T 


> 


Gly Gly Asp Tyr Thr 


Glu 


Asn 




Leu 


Asn 


Gl y Ai a 






4 35 








440 










445 






Val 


Gin 


See 


Gl y 


Ser 


Thr Glu 


Ser 


Asp 


Glu 


Tyr 


A^n 


Asp Ala 


Phe Thr 




^ SO 








455 










4 60 








Ala 


I.cu 


Th r 


er 


Tyr 


Gly Thr 


Thr 


Lys 


Gly 


Phe 


Asp 


Leu 


Gin 


Gly Gly 


«l 65 










470 








475 








480 








rg 


Hie 


Thr Phe 


Gly 


Glu 


Asn 


Trp 




Phe 


Thr 


Gly Gly 
















490 










4 95 


Leu 


Glu 




He 


Tyr 


Gly Gin 


Leu Asp 


Asp 


Arg 


Ser 


Gly 


Tyr 


Arg Pro 








500 








505 










510 




r 


Lys 


lie 


Arp 


Gin 


Asn Thr 


Ser 


Thr 


Pho 


Scr 


oln 


Tyr Arp 


Gin IjSu 






Si 5 








520 










525 










LSfS 


Thr 


GHi 


Ly5= Leu 


Ser 


Ala 


Leu 


He 


Gly 


Ala 


Arg 


lie Asp 












535 










540 








T"i-r 


Va 1 


Leu 


Leu 


A^n 


Gin A^p 


Gly 


Lys 


Arg 


Tyr 


lie 


Asp 


Pro 


Leu Phe 












550 








555 








560 


lie 


Phe 


Ssr 


Pro 


Arg 


Ala A^n 


Val 


Arg Tyr 


Asn 


Pro 


Asn 


Lys 


Asn Leu 










565 








S70 










575 






Arg 


Leu 


Ser 


Tyr Sfer 


Glu 


Gly 


Phc- 


Arg 


Ala 


Pro 


Gin 


Tyr Phe 








500 








585 










590 




Af p 


CI u 


Affp 


Leu 


nis 


Val Glu 


Leu 


Ala 


Giy 


Gly 


Thr 


Pro 


He 


Ser Arg 






595 








600 










605 






Val 


Leu 


Ser 


Pro 


Asn 


Leu Lys 


Glu 


Glu 


Arg 


Ser 


Arg 


Ser 


He 


Ser Ala 




610 








615 










620 












Asp 


Tyr 


Tyr 


His Arg Ala Asp Glu Trp 


Git) 


Phe 


Asn 


He Met 


625 










630 








635 








640 


Gl y 


Gl u 


Ala 


P)ie 


Ser 


Thr Phe 


Tie 


Ser 


Asn 


Gin 


Phe 


Lys 


Pro 


Ser Asp 










645 








650 










655 


I.ys 


Val 


Glu 


Thr 


Thr 


Ser Asp Gly Lys 


Glu 


Trp 


lie 


He 


Arg 


Thr He 








660 








665 










670 




Tyr 




Asp 


Lys 


Asp 


Gly Val 


Ser 


Lys 


Val 


Tyr 


Gly 


Val 


Asn 


Leu Glu 






67 5 








680 






685 






Gl" 


Axg 


11$ 


Al a 


Tyr 


Asn Lys 


Ser 


Phe Asp 


Leu 


Gin 


Leu 


Gly 


Giy Thr 




6 90 








695 










700 






Trp 


Gin 


Arg 


Ser 


Arg Tyr Gly Ser 


He 


Tyr 


Thr 


Ala 


Val 


Glu 


Ala Asp 


705 










710 








715 








720 


Lyc 


Thr 


Thr Gly 


Gin 


Ala Glu 


lie 


Ser 


Vai 


Lys 


Asp Tyr 


Val 


Arg Thr 










725 








7 30 












Pro 


Asn 


Leu 


Tyr 


Gly Tyr Phe 


Val 


Ala 


Thr 


Val 


Arg 


Pro 


Thr 


Glu His 








740 








745 










750 




Phc 


Ala 


Tie 


fKsn 


Leu 


Ser Gly Thr 


Phe 


Thr 


Gly 


Lys 


Met 


Asp 


Val Val 






755 








760 










765 




His 


Glu 


Ala 


Tyr 


Glu Gly Asp 


lie 


Pro 


Ala 


Glu 


His 


He 


Ala 


Pro Asp 




770 








775 










780 






Gly 


Ser 


Phe 


Asp 


Phe 


Glu Met Asn Gly Gin 


Gin 


Phe 


Lys 


Gly 


Leu Ala 


785 










790 








7 95 




900 


Glu 


Gly 


His 


Ala 


Lys 


Leu Val 


Lys 


Thr 


Pro 


Ala 


Phe 


Ala 


Asp 


He Asp 










605 








eio 










815 


Leu 


Lys 


Leu 


Ser 


His 


Asp Phe 


His 


Leu 


Ala 


Ser 


Thr 


Met 


Thr 


Leu Glu 








820 








825 










830 




L«u 


Asn 


Ala 


Gly 


lie 


Gin Acn 


He 


Phe 


Asn 


Ser 


Tyr 


Gin 


Lys 


Asp Thr 






835 








840 










845 






Arp 


Lys 


Gly 


Pro 


Gly Arg Ala 


Ser Thr Tyr 


Val 


Tyr 


Gly Pro 


Met Gin 




850 








855 










860 








Pro 


Arg 


Arg 


Tie 


Ph« 


Val Gly Thr 


Lys 


He 


Asn 


Phe 








065 










870 






87 5 










(21 


IttFORHATIOH 


FOR 


SEQ ID irO:316 















10 



15 



20 
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30 



35 
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45 



50 



55 



60 



65 



(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH; 899 amino acidr 
70 (B) TYPE: amino acid 

<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

75 (11 H HYPOTHETICAL: YES 
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(vi) ORlr;TllAL -"-Ol'RCE: 

lA) OR-SAlilStI: Porphytcmon^s glnglvall., 
5 ( i I FEATURE : 

(A) IIAIIK/KEY: misc feacur- 
IBl LOCATIOJJ 1...899 

10 SEQUEIICE DESCKIPTIOII: SEQ ID 110:316 

11. Leu A.n Hi. L.u Arg Ly. Thr M,t Tyr Lys Ly, He lie Ala V.l 
Ala Ala L,u Phe cy, Ala Se. ciy j.,, ^ly oi.. ^e'. Se. 

15 Asp .cu T.,. P.o Oln .-V^p Th. II. iyr ser Pro Olu lie Ser Tyr Ala 



Ly, Pro 11, Hi. ,y, rhr lie Ala S« He Glu He Glu Gly Met Arg 



20 '^'^ '"'^ 1^2^ "^^^ ^" Ser «y Leu Ala Val Gly 



A.P Glu val Leu 11, p„ BO^ 
lie Met A.g Gl„ oly Ty. P.e Ser A,„ Val Arg ,le He Ala Asp Ly. 
25 Tyr va, Gly A,.- Lys Val Tyr Leu Ly^ He He Val T..r ^L" Arg Pro 
Arg He Ser Ly. Val Thr Phe Ser Gly Val Ly, Ly, III Glu Arg Glu 
30 ^41 "^-^ Leu Arg Glu Gly H^e" Gin Her Thr Arg 

Asn As„ Glu Asp Ly. Val Arg Gin He Val Gl'n Ly, Tyr Phe Ser Gl'u 



Ly= Gly Tyr Arg Ala Ser He Arg III Thr Gin Glu Pro '^4 Leu 

35 ser Ly. A.p Gly Phe Val A,n Val LeS He Ser He Glu HI Ly. Se 



200 



Ly, Thr Ly. Val A,n Glu He Tyr Phe Ser Gly A.„ "s Ala Leu Ser 
40 «S '^'^ Lys A„ Thr Ala Ly, Phe Ser 

Leu Arg Ly3 Hi. He Arg Ser Ser Phe Leu Lyf Leu Phe Ser Thr nit 
Lys Phe val Glu G.u Ser Tyr Arg «lu L'" Leu Val Arg Leu III Glu 
45 Ly. Tyr Gin Glu Tyr Gly Tyr Arg A.p Ala Glu He Leu Thr Asp Ser 
val Val Ly. Ala Pro A.p Gly Ly, Arg Val A.p He ^y^ Leu A,n He 
50 %t ""'^ ''•'■^ III "-y^ ^" "he val Gly A.n 



ser Gin Tyr Pro Ser Glu Tyr Leu Glu Arg Val Leu Gly He Ly, Ser 
Oly A,p val Tyr A.„ Gl„ Arg Arg Leu lit Lys Arg Leu Asn lit A.p 
55 Glu ASP Ala V,. Gly A.„ Leu Tyr Tyr A.„ Asn Gly Tyr III Phe Ala 
rrp val A.P Pro Val Glu Thr A.„ Val Val Gly A..p III Val Ser Leu 
60 ^8? '^'^ I-y- Gin Ala A^n He Asn Lys Val H» 

lie Ly. Gly A.n Thr Val Val Tyr Glu A.p vfl Val Arg Arg Glu ieu 
Tyr Thr Ly. Pro Gly Gin Leu Phe Ser Arg Glu A.p He He tit Ser 
65 H, Arg Leu He A.n Gl.. Leu Gly ill p., 3,, 

Pro Arg Pro He Pro A.„ Pro Glu Thr Gly Thr Val til He Glu Tyr 



70 Tsl !" ^« '^P «n =l".Le*u ser Val Gly Trp 



ser Gin Ser G,y Leu £eu Phe Arg Gly Ala JI^Lys Phe Thr A«n Z 



ser Val Gly A.„ Leu „1» Pro ser Tyr Lys Lys Gly ill h, 

75 Pro Gin Gly A.p Gly Gin Thr Leu Ser Leu Ser Ala Gin T^r Asn Gl 



y 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



70 







515 




















52S 










Tyr 


Tyr 


G-1 n 


Gin 


Tyr 


S?^r 


Val 


Thr 


t'l-.e 


list 


Asp 


Pro 


Trp 


Phe 


Gly 




530 










535 










5-JO 












Lys 


Arg 


fro 


Asp 


Met 


Fhe 


Ser 


Fhc 


Scr 


Ala 


Phe 


Tyc 


Ser 


Lys 


Thr 


54 S 










550 










555 










560 


Thr 


Alu 


lie 


Asp 


S^r 


Ly.-? 


Ph- 


Tyr 


Asn 


Jer 


Asn 


Ala 


Gly Asn Tyr Tyr 










565 










57 0 










575 




Arn 


Al a 


Tyr 




A^n 


Ser 


Tyr 


Tyr 


Asn 


Asn 


Tyr 


Atn 


Ser 


Tyr 


Tyr 


Asn 








580 










5Bb 










590 






Glr 


Met 


Str 


Af n 


Tyr 


Thr 


CI y 


Asp 


Leu 


Tyr 


Thr 


Gin 


Ala 


Ser 


Asp 


Pro 






595 










600 










€05 








A?p 


Arg 


Ser 


Leu 


Gin 


Met 


Leu 


Gly 


The 


Ser 


He 


Gly 


Tyr Gly Lys 


Arg 




610 










615 










620 










L*iu 


The 


Trp 


Pro 


Asp 


Asn 


Trp 


Phe 


Gin 


He 


Tyr 


Thr 


Set 


Leu 


Asn 


Tyr 


625 










€30 










635 










640 


Thr 


Tyr 


Tyr 


Arg 


Leu 


Arg 


Asn 


Trp 


Ser 


Tyr 


Asn 


Thr 


Phe 


Gin 


Asn 


Phe 










64 5 










650 










655 




His 


Hit- 


Gly 


Ser 


Ala 


Asn 


Asp 


Leu 


Asn 


Leu 


Glu 


Leu Arg 


Leu 


Ser 


Arg 








660 










665 










670 




Thr 


Ser 


lie 


Asp 


Asn 


Pro 


He 


Tyr 


Thr 


Arg 


Ser 


Gly 


Ser 


Asp 


Phs 


Het 






675 










680 










685 








Val 


Ser 


Val 


Ala 


Ala 


Thr 


Leu 


Pro 


Tyr 


Ser 


Leu 


Trp Asp Asn 


His 


Asp 




6?0 










695 










7 00 










Tyr 


Ala 


Ser 


Gin 


Asn 


Leu 


Ser 


Val 


ser 


Asp 


Arg 


Tyr Arg 


Phe 


He 


Glu 












710 










715 










720 


Tyr 


His 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


Arg 


Val 


Phe 


Thr 


Pro 


Leu 


Lt^u 


Asn 










725 










730 










735 




Pro 


Ala 


Thr 


His 


Lys 


Tyr 


The 


Pro 


Vai 


Leu 


Met 


Ser 


Arg 


Val 


Glu 


Gly 








740 










745 








750 




Aid 


Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 


Asn 


Lys 


Lys 


Ser 


Pro 


Phe 


Gly 


Thr 






755 










760 










765 






Phe 


Tyr 


Het 


Gly 


Gly 


Asp 


Gly 


Met 


Ser 


Ser 


Tyr 


Tyr Gly Gly Tyr Met 




770 










775 










700 










Asn 


Glu 


Thr 


lie 


Gly 


Leu 


Arg 


Gly 


Tyr 


Lys 


Asn 


Gly Ser 


He 


Ala 


Gly 


785 










790 










7 95 










800 


Arn 


Asn 


Tyr 


Asp 


Tyr 


Ala 


Tyr 


Ala 


Tyr 


M»t 


Arg 


Leu 


Thr 


Met 


Glu 


Leu 










805 










810 










815 




Arg 


Phe 


Pro 


lie 


Leo 


Phe 


Glu 


Asn 


Ser 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 








820 










825 










830 






Phe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp 


Arg 


Ser 


He 


Asp Asn 


Tyr 


Asn 


Pro 






835 










840 










845 








Phe 


Aen 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly 


Val 


Gly 


Leu 


Arg 


Val 


Thr 


Leu 


Pro 




850 










855 










860 










M«t 


Val 


Gly 


Met 


Leu 


Gly 


He 


Asp 


Trp 


G3y 


Tyr 


G)y 


Phe 


Asp 


Arg 


Pro 


865 










870 










875 










880 


Asp 


Aen 


Ser 


Leu 


Gin 


Arg 


Gly 


Gly 


Ser 


Asn 


Val 


His 


Phe 


Val 


Leu 


Gly 










885 










890 










6 95 


Gin 


Glu 


Phe 





























(2) IHFORHATIOW FOR SEQ ID NO: 317 



U) SEQUENCE CHARACTERISTICS: 

(Al LENGTH: 177 amino acids 
55 (B) TYPE: amino acid 

ID) TOPOLOGY: linear 

{11 J MOLECULE TYPE: protein 

60 (ill) HYPOTHETICAL: YES 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas glngivalts 

65 tlx) FEATURE: 

(AJ HAHE/KET: miscfeatuce 
(BJ LOCATION 1- . .177 



(xi) Sh^QUEHCZ DESCRIPTIOIl: SEQ ID NO: 31 7 



Gin life Asn Met Asn Gly Asp Met Lys Arg Phe Leu He Leu He Gly 
IS 10 15 

Phe Ala Leu Ala Val Ala Phe Ser Gly Phe Ser Gin Lys Phe Ala Leu 
20 25 30 

75 Val Asp Met Glu Tyr He Leu Arg Asn He Pro Asp Tyr Glu Met Met 
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Asn Glu Gin Leu Giu Gin Val Ser Ly.^ Ly.- Trp Gin Glu He Glu 

^ Ala L.u Glu Asn Glu Ala Gin Ser Met Ty. Ly.- Tyr Gin Ser Asp 

Leu Val Ph^ Leu Sor Al:. Ala Gin Lys Ly.^ Thr Gin Glu Glu Ala 11^ 

Val Ly.^ Lys Glu Gin Gin Ala Ser Giu Leu Ly.^ Arg Lys Tyr Phe Gi"- 

10 Pro Glu Giy gIu Lsu Tyr Lys Ly. ^rg Ser Asp Leu Met Lys Pro He 

120 125 

Gin Asp Glu He Trp Asn Ala He Lys Glu 11. Ala Lys Arg Asn Asn 
135 



15 



20 



25 



60 



65 



70 



Tyr Gin M«t Val Leu Asp Arg Gly Thr Ser Gly He He Phe Ala S-r 
150 

Pro Ser Tie Asp He Ser Asp Leu Val Leu Ser Lys Met Gly Phe Ser 
165 170 175 



Lys 



(2) TNFORflATIOIJ FOR SEQ ID HO: 318 

(i) SEQt'EMCE CHARACTERISTICS • 

{A) LENGTH: 170 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
30 (lii) HYPOTHETICAL: YES 

(Vi) ORIGIKAL SOURCE: 

(A) ORGAIJISMr Porphyroroonas gingival is 
35 I XX) FEATURE: 

<A) IWIE/KEY: misc feature 

(B) IX5CATION 1,..170 

^ ixij SEQUEHCE DESCRIPTION: SEQ ID NO: 318 

Arg He Asn Lys Gin His Glu Met Lys Lys Phe Phe Leu Met Leu Leu 
Met Ala Leu Pro Leu Ser Leu Leu Ala G^n Lys Val Ala Val wll Asn 

45 Thr Glu Glu lie He Ser Lys Met Pro Glu Gin Val Ala IL Thr Lys 
Gin Leu A^n Glu Leu Ala Glu Lys Tyr Arg Leu Asp Leu Lys Ser Met 

50 «^ ""^^ ""^^ ''^^ 7^^ "-^'^ Val Lys Glu Lys Asp 

Ser Leu Leu Glu Asn He Arg Asn Arg Arg Gin Gin Glu Leu Gin l^p 

oo 90 ot 

He Gin Thr Arg Tyr Gin Gin Ser Tyr Gin Thr Met Gin Glu Asp Leu 
Cc AU*. 105 110 

55 Gin l.ys Arg Gin Gin Gin Leu Phe Ala Pro He Gin Gin Lys Val Ala 

120 125 

Asp Ala lie Lys Lys Val Gly Asp Glu Glu Asn Cys Ala Tyr He Met 



Glu Ala Gly Met .,=t Leu ^^r Thr Gly Ala Thr Ma He Asp Leu Thr 

loo ISS 1 

Ala Lys Val Lys Ala Lys Leu Gly He Ly- 
165 170 

(2) INFORMATION FOR SEQ ID NO: 31 9 

il) SEQUEHCE CHARACTERISTICS - 

(A) LENGTH: 828 amino acids 

(B) T'^'PE: amino acid 
(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 
75 (vl) ORIGINAL SOURCE: 



PCT/AU98A)I023 



SUBSTrrUTE SHEET (Rule 26) (RO/AU) 




2 S a, lui oi iir, o u H. J 



wo 99/29870 



PCT/AU98/01023 



240/ 



490 



5 



(A) OkGAllISIl; Pocphyromoi^aE- gin-3lVoii2 

iix) FEAT V RE: 

(A) MWtE/KEY: misc feature 

(B) LOCATION i . . .a'20 



SEQUENCE DESCRI PTIOtI : SEQ ID IJO:31? 



He 


Met 


Lys 


Glu 


Ala 


He 


Pro 


Arg 


LyE 




Ly£r 


Tyr 


He 


Lys 


Leu 


Asn 


1 








5 










10 










15 




Gly 


Il« 


Tyr 


Arg 

20 


Leu 


Ser 


Phe 


He 


Leu 
25 


Leu 


C's 


Cys 


Leu 


Leu 
30 


Cys 


Ser 


Cln 


Ala 


Ala 

35 


Met 


Ala 


Gin 


Gly 


Val 
40 


Arg 


Val 


Scr 


Gly 


Tyr 

45 


Val 


Leu 


Asp 


Arg 


Gly 


Glu 


Lye 


Pro 


He 


Pro 


Phe 


Ala 


Gly 


Val 


Lys 


Val 


Arg Gly Thr 




50 










55 










60 










Gly 


Thr 


Gly 


Ala 


Thr 


Thr 


Asn 


Leu 


Lys 


Gly 


Tyr 


Tyr 


Glu 


Phe 


Arg 


Met 


65 










70 






75 






80 


hys 


Ala 


Thr 


Thr 


A5P 
85 


Ser 


He 


Thr 


He 


Glu 
90 


Phe 


Ser 


Ser 


Met 


Gly 
95 


Tyr 


Gin 


Gly 


Val 


Ser 


Arg 


Ser 


Phe 


Pro 


Ser 


Leu 


Thr 


Lys 


A5p 


Thr Arg 


Leu 








100 










105 










HO 






Af-n 


Vai 


Arg 
115 


Leu 


Ala 


Glu 


Ala 


Glu 
120 


Met 


Glu 


Leu 


Ser 


Ser 
125 


Val 


Thr 


Val 


Gin 


Ala 


Thr 


Lys 


Arg 


Acg 


Leu 


Aisn 


Thr 


Met 


Glu 


Arg 


Val 


Asn 


Thr 


Arg 




1.^0 










135 










140 








A^-p 


Leu 


Arg 


Va] 


Affn 


Ala 


Gly 


Pro 


Thr 


Gly 


Gly 


Val 


Glu 


Ser 


Leu 


11 e 












150 










155 










160 


Ser 


Thr 


Tyr 


Ala 


Gly 


Val 


Thr 


Gin 


Asn 


Asn 


Glu 


Leu 


Ser 


Ser 


Gin 


Tyr 










165 










170 










175 


Ser 


Val 


Arg 


Gly 


Gly 


Ser 


Tyr 


Asp 


Glu 


Asn 


Met 


Val 


Tyr 


Val 


Asn Gly 








180 










185 










190 






Val 


Glu 


Val 


Tyr 


Arg 


Pro 


Leu 


Leu 


Val 


Arg 


Ser 


Ala 


Gin 


Gin 


Glu 


Gly 






195 










200 










205 






Leu 


Ssr 
210 


Phe 


Val 


Af n 


Pro 


Aep 
215 


Leu 


Thr 


Gin 


Ser 


Val 
220 


Gin 


Phe 


Ser 


Ala 


Gly 


Gly 


Phe 


Thr 


Ala 


Asp 


Tyr 


Gly 


Asp 


Lys 


Met 


Ser 


Ser 


Val 


Leu Asp 


225 










230 










235 










240 


He 


Arg 


Tyr 


Lys 


Gin 


Pro 


Gin 


Glu 


Lys 


Glu 


Gly Ala 


Val 


Leu 


Leu 


Gly 










245 










250 










255 




Met 


Leu 


Gin 


Ser 


Ser 


Ala 


Tyr 


Tyr 


Gly 


Sex 


Ser 


Ala 


Gly 


Ala 


Phe 


Ser 








260 










265 








27 0 






Gl n 


Il« 


Thr 


Gly 


Val 


Arg 


Tyr 


Lys 


Ser 


Ala 


Lys 


Ser 


Leu 


Leu 


Gly Thr 






275 










280 










285 








Thr 


A5p 


Thr 


Lys 


Ala 


Glu 


Tyr 


Asp 


Pro 


Tic 


Tyr Ala 


Asp 


Gly Gin 


Thr 














295 










300 








Phe 


Met 


Thr 


Tvr 
I /r 


Arg 


Phe 


Ser 


Pro 


Lys 


Leu 


Sec 


Val 


Ser 


Phe 


Leu 


Gly 


305 










310 










315 










320 


JKsn 


He 


Ser 


Gin 


Thr 


Arg 


Tyr 


Lys 


Phe 


Val 


Pro 


Gin 


The 


Arg 


Glu 


Thr 










325 








330 








335 




Sec 


Phe 


Gly 


Thr 
340 


Leu 


Ser 


Asp 


Ala 


Lys 
345 


Lys 


Leu 


Lys 


He 


Phe 
350 


Phe 


Asp 


Gly 


Gin 


Glu 


Gin 


Asp 


Arg 


Phe 


Leu 


Thr 


Tyr 


Phe 


Gly 


Ala 


Phe 


Ser 


Met 






355 










360 








365 








Asn 


Phe 


Val 


Pro 


Asp 


Asp 


Lys 


Gin 


Arg 


Hie 


Thr 


Val 


Thr 


Leu 


Ser 


Ala 




370 










375 








360 










Phe 


Asn 


Ser 


Asn 


Glu 


Arg 


Glu 


Thr 


Tyr 


Asp 


He 


Gin 


Gly 


Glu 


Tyr 


Phe 


3es 










390 










395 








400 


Leu 


Asn 


Asp 


V^l 


Gin 
405 


Leu 


Gly 


Ala 


Asp 


Gly 
410 


Thr 


Ala 


Ser 


Met 


Ala 
415 


Ser 


Gly 


Sej 


Glu 


Asn 


Ser 


Asn 


Gly 


Leu 


Gly 


He 


Gly 


Arg 


Asn 


His 


Glu 


His 








420 










425 








430 






Aid 


Arg 


Asn 


Arg 


Leu 


Ser 


Tyr 


Arg 


Val 


Leu 


Asn 


Met 


Gly 


Tyr 


Arg 


Gly 






435 










440 










445 


Glu 


Met 


Lys 


Leu 


Asn 


Glu 


Lys 


His 


Arg 


Leu 


Gin 


Ala 


Gly 


Val 


Sec 


Ala 




450 










455 










460 








Gin 


Met 


Glu 


Lys 


lie 


Ala 


Asp 


His 


He 


Ser 


Glu Trp 


Glu 


Arg 


Arg 


Asp 


465 










470 










475 










480 


Ser 


Val 


Gly 


Tyr 


Asn 


Leu 


Pro 


His 


Ser 


Glu 


Thr 


Val 


Leu 


Leu 


Met 


Tyr 










485 










4 90 










495 


Afffi 


Asn 


Leu 


Tyr 


Ala 


Asp 


Thr 


Gin 


Met 


Arg 


Gly 


Thr 


Arg 


Leu 


Ser 


Ala 








500 










505 






510 






Ph« 


Val 


Gin 


Asp 


Arg 


Phe 


A^n 


Phe 


Ser 


Met 


Gly Gly 


Gly 


Thr 


Phe 


Ser 






515 










520 










5Z5 








l.eu 


He 


Pro 


Gly 


He 


Acg 


Ala 


Ser 


Trp 


Trp 


Ser 


Phe 


Asn 


Lys 


Glu 


Leu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



%ji "Tj^i r^i iCii ic; in> « ^ljj n.«ji 



,pii, -ifisi, fl^j, 
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530 53b S40 

Leu Vai Ser Fro Arg Tls S*r Val -Jiy Tyr Ser Pro GJlu Ser Asn Pre 
54 5 550 555 560 

Ala Leu Val Leu Arg Ala Ala Ala Gly Leu Tyr Tyr Gin Ala Pro phe 
5 565 b"/0 57 5 

Tyr L'j's Glu Leu Arg Gin Thr His L/s Asp Ala Glu Gly Asn Arn Val 

bQO 585 590 

Vol VsL Leu A-n Giu Lys lis Arg Ser Cln Gly Ala Phe His He Leu 
595 €00 €05 

10 Al^i Gly Ala Asp Tyr The eh« Glu Met Giy Gly Ar^j Lys Tyr Lys Phc- 
610 615 620 

Thr Ala Glu Ala Tyr Tyr Lys Scr Leu Ph» Aar lie f\sn Pro Tyr He 
625 630 635 6^0 

He Glu Ae-n Vol hys Tie Arg Tyr Leu Gly Glu Asn He Gly Ser Gly 
15 6^5 650 655 

Tyr Ala Ala Gly He Asp Leu Lys Leu Phe Gly Glu Leu Val Pro Gly 

660 665 670 

Val Arp Ser Trp Leu Thr Ala Ser He He Lys Ala Arg Gin Lys Leu 
675 680 685 

20 Asp Gly Tyr Gly Ser Leu Pro Leu Met Asn Ala Pro Thr Tyr Asn Phe 
6?0 €95 700 

Ser Phe Phe Leu Gin Glu Tyr Val Pro Gly Asn Lys Arg He Thr Ala 
705 710 715 770 

Thr Leu Arg Ala Ala Leu Sec Gly Gly Leu t»ro Gin Leu Asn Pro Ser 
25 725 730 735 

Lyf Gly Leu Ser Ser Pro Ala Phe Thr Ala Pro Ala Tyr Lys Arg Val 

740 745 750 

Asp Leu (ily Val Met Tyr Lys Trp Leu A.«^p Pro Asp Asp Ser Phe Ala 
755 760 765 

30 Gly Arg Ser Lyc Trp Leu Met Gly Val Lys Gly Ala Tyr He Gly Aln 
770 775 780 

Asp Leu Phe Asn Leu Phe Asp Het Thr Asn Val Aan Ser Tyr Tyr Trp 
7 85 7 90 7 95 800 

Val Ser Asp Ala Tyr Gin Gin Gin Tyr Ala Val Pro Asn Tyr Leu Thr 
35 805 810 815 

Arg Arg Gin Phe Asn Leu Arg Leu Leu Val Glu Phe 
820 825 



40 



45 



65 



70 



75 



(2 J IMETORJ'IATIOn FOR SEQ ID WO: 320 

(1) SEQOEHCE CHARACTERISTICS: 

(A) LENGTH: 67 9 amino acids 

(B) TYPE: amino acid 
(Dl TOPOLOGY: linear 

lii) MOLECltLE TYPE: protein 

(iii) HYPOTHETICAL: YES 



50 (Vl) ORIGIHAL SOURCE: 

(Al ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: mlsc feature 
55 (B) LOCATlOfJ 1.,.679 



60 



(xi) 


SEQUENCE DESCRIPTION: SEQ 


ZD NO: 320 






Pro He 


Pro 


Phe 


Leu 


Leu 


Asn 


Met 


Tyr 


Ser 


Gly 


His 


His Lys He 


His 


1 






5 










10 






15 




Tyr Pro 


Phe 


Leu 


lie 


Leu 


Leu 


Val 


Cys 


Leu 


Ala 


Phe 


Ala Ala Cys 


Lys 






20 










25 








30 




Sec Val 


Lys 


Leu 


Lys 


Asp 


Ala 


Glu 


Lys 


Ala 


His 


Asp 


Arg Gin Glu 


Tyr 




35 










40 










45 




Thr Lys 


Ala 


Ala 


Asp 


Met 


Tyr 


Asn 


Thr 


Leu 


Tyr 


Arg 


Arg Thr Arg 


Arg 


50 










55 










60 






Lys Gin 


Val 


Glu 


Met 


Lys 


Ala 


Tyr 


Thr 


Ala 


Phe 


Arg 


Ser Gly Glu 


Asn 


65 








70 










75 






80 


Tyr Arg 


Ala 


Ala 


Gly 


Arg 


Gin 


Ala 


Lys 


Ala 


Leu- 


-Arg 


Gly Tyr Leu 


Asn 








85 










90 






95 




Ala Arg 


Arg 


Tyr 


Gly 


Tyr 


Pro 


Asp 


Ser 


Val 


Val 


Leu 


Leu Arg Leu 


Ala 






100 










105 








HO 




Gin Thr 


Tyr 


Gin 


Gin 


Gly 


Gly 


Asn 


Tyr 


Lys 


Glu 


Ala 


Glu Val Leu 


Phe 




115 










120 










125 




Arg Gly Tyr 


Leu 


Glu 


Ala 


Tyr 


Pro 


Lys 


Ser 


Tyr 


Phe 


Ala Ala lie 


Gly 



SUBSTITUTE SHEET (Rale 26) (RCVAU) 
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1 j5 










l-JO 














Gly 


Cyr 




Phe Ai a 


Arg 


Gin 


Gin 


Lys 


Glu 


Tyr 


Pro 


Thr 


Arg 












1 50 








15S 










1 6Q 


T' T 




11* 


Arg 


Arg 


Ala Aia 


Glu 


Trp 




Scr 


Ala 


Arg G l y 


A?p 


Phe 










165 








1 ' 0 










175 




Gly 


Pro 


Ala 


Tyr 


Ala 


Pro Arp 


Ala 


3* r 


Al a 


Leu 


Tyr 


Phe 


Thr 


Ser 


Ser 
















} 85 










190 






Arg 


Ser 


Lys 


Asp A5p 


Gly Leu 


Aa'p 


Ajrn 


Ser 


Sor 


Hc- 


Thr 


Gly Leu 


Lys 






1 95 








200 










205 










Asn 


Aip 


lie 


Tyr 


He He 


Lys 


Arg 


Asp 


Ala 


Gln 


Gly Arg 


Trp 


Gly 




210 








215 










220 










Ar^ 


Pro 


Afp 


Ser 


Val 


Ser Gly 


Gly 


He 


7\sn 


Thr 


Pro 


Trp 


A5p 


Glu 


Gly 


225 










230 








235 










240 


Val 


Pro 


Thr 


He 


Thx 


Pro Asp 


Gly 


Ser 


Thr 


lie Tyr 


Tyr 


Thr 


Leu 


Ala 










24 5 








250 










255 




Gin 


Gin 


Gly Ala 


Asp 


Tyr Asp 


Arg 


Thr 


Val 


Gin 


He 


Tyr 


Ser 


Ala 


Ala 








250 








265 










270 






Arg 


Ser 


Gly 


Glu 


Gly 


Gly Trp 


Ser 


Asn 


Gly 


Ser 


Leu 


Vul 


Asp 


He 


Met 






275 








280 










265 






Arg 


Arp 


Sec 


Leu 


Arg 


Met Ma 


Ala 


His 


Pro 


Ser 


Ket 


Ser 


Ala 


Ser 


Gly 




290 








2?5 










300 








Arp 


Tyr 


Leu 


Tyr 


Phe 


Val Ser 


Asn 


lie 


Gly 


Gly 


Ser 


Tyr 


Gly 


Gly 


Lys 


305 










310 








315 








320 


Afp 


lie 


Tyr 


Arg 


Val 


hys Val 


Ser 


A5p 


Arg 


Ser 


Tyr 


Gly 


Ser 


Pro 


Glu 










325 








330 










335 




ASTi 


Leu 


Gly 


Pro 


Asp 


He A^n 


Thr 


Pro 


Gly 


Asp Glu Het 


Phe 


Pro 


Phe 








340 








345 










350 






11* 


A5p 


Uly Asp 


Ser 


Thr L&u 


Phe 


Phe 


Ala 


Ser 


Asp 


Gly His 


Ala 


Gly 






355 








360 










365 








Leu 


Gly 


Gly Leu 


Asp 


He Phe 


Lys 


Ala 


Thr 


Leu Asp 


Ser 


Thr 


Gly 


Gin 




3*7 0 








375 










360 








Trp 


His 


Val 


Val 


Asn 


Met Gly 


Gin 


Pro 


Val 


Asn 


Ser 


Ser 


Ala 


Asp 


Asp 


385 










390 








395 










400 


Phe 


Gly 


Leu 


Ala 


Val 


Glu Pro 


Lys 


Gly 


Lys 


Asn 


Lys 


Glu 


Glu 


Ala 


Leu 










405 








410 










415 




Pro 


Ajp 


Asn 


Gly Val 


Lys Gly 


Val 


Phs 


Cys 


Ser 


Asn Arg 


Gly Asp 


Ala 








420 








425 










430 






Arg 


Gly 


Trp 


Pro 


His 


Leu Phe 


His 


Phe 


Glu 


Leu 


Pro 


Ala 


He 


Tyr 


Thr 






435 








440 










445 








Glu 


lie 


Gin 


Gly Tyr 


Val Hc-t 


Asp 


Arg 


Glu 


Glu 


Asn 


Pro 


He 


Ala 


Gly 




450 








455 










460 








Ala 


Thr 


Val 


Arg 


He 


Val Gly 


Glu 


Arg 


Gly 


Pro 


Val 


Gly Gin Gly 


Phe 


465 










470 






475 










480 


Val 


Thr 


Thr 


Arg 


Asp 


Asp Gly 


Ser 


Tyr 


Lys 


Met 


Ser 


Val 


Gin 


Gly 


Asp 










465 








490 










495 




Thr 


Arg 


Tyr 


Val 


Met 


Leu Ala 


Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Asn 


Gin 


Tyr 








500 








505 










510 




Val 


Glu 


Lfeu 


Lys Thr 


Asp Thr 


Ala 


Lys 


Gin 


Ser 


Glu 


Thr 


Tyr 


Tyr 


Val 






515 








520 










525 






A£?p 


Phe 


Phe 


Leu 


Ala 


Ser Arg 


Glu 


Lys 


Ala 


Glu 


Gly Leu Gin Asn 


He 




530 








535 










540 










Phe 


Tyr 


Afp 


Phe Asp 


Lys Ala 


Thr 


Leu 


Arg 


Pro 


Glu 


Ser 


Met 


Lys 


Ser 


S4 5 










550 








555 








560 


Leu 


Asp 


Glu 


Leu 


He 


Arg He 


Leu 


Thr 


Asp 


Asn 


Pro 


T^p 


He Arg 


He 










565 








570 










575 




Glu 


Leu 


Gly 


Ser 


His 


Al 3 Asp 


Arg 


Lys 


Gly 


Pro 


Asp 


Ala 


Tyr 


Aen 


Leu 








560 








565 










590 






Gly 


Leu 


S*c 


Asp 


Arg 


Arg Ala 


Lys 


Ser 


Val 


Val 


Asp 


Tyr 


Leu 


Thr 


Ser 






595 








600 








605 








Arg 


Gly 


lie 


Ala 


Ala 


Asp Arg 


Leu 


Thr 


Trp 


Lys 


Gly Tyr Gly 


Lys 


Ser 




610 








615 










620 








Val 


Pro 


Lys Thr 


Val 


Thr Ala 


Lys 


He 


Ala 


Glu 


Arg 


His 


Asp 


Phe 


Leu 


625 










630 








635 










640 


Lys 


Glu 


Gly Asp Val 


Leu Thr 


Glu 


Glu 


Phe 


Val 


Ala 


Pro 


Leu 


Thr 


Glu 










645 








650 










655 




Glu 


Gin 


Gin 


Sec 


Val 


Cys Asp 


Gin 


Leu 


Asn 


Arg Arg 


Thr 


Glu 


Phe 


Arg 








660 








665 










670 






Val 


Il« 


Glu 


Glu 


Glu 


Leu Arg 
























67 5 
























12) 


IMFORIIATIOH 


FOR 


SEQ ID NO:321 


















til 


i SEQUENCE CHARACTERISTICS: 

















(Al LEHGTH: 77 2 amino acids 
IB) TYPE: amino acid 



SUBSmrUTE sheet (RuIc 26) (RO/AU) 




wo 99/29870 



243/490 



(D) TOPCLOGV: linear 
(li) MOLECULE TYPE; protein 
5 (iii) HVPOTHETICAL: YES 

(vi) ORIGIIJAL SOVRCt:: 

(A) ORGAiasil: Porphyromonas gingivalis 
10 (ix) FEATURE: 

(A) NA/IS/KEV: mite featur- 
(R) LOCATION 1. . ,7 72 

SEOUEMCE DESCRIPTIOII: SEQ ID NO: 321 

Pro Al. Gin Pro M'e Thr Tyr Cys He Arg Tyr Phe Arg Arg GIu Vai 

Ser Pro Arg Arg Gin Thr Glu Arg Thr IL Tyr Ala A^^p Ly^ htet Arg 

20 Arg HI. II. Val Lys C.^. Cy3 Pro Ser Ala Ser Xle L«u Leu Gin Giu 

Tyr Ser Thr Thr Glu Leu Phe His Hi. rhe A*n iL Ser He He 

55 go 

25 yr '^'^ 

Al. Phe Ala Lys Thr Asp A.„ Val Pro Thr I^p s,r Leu Arg Val Hi. 

90 OR 
A^n l.eu Gin Thr Val Thr Val Tj-r Ser Thr Ar, Thx Ala Val Pro Leu 

30 Ly. ly.. p„ Ala Lys Met Glu tea He Ser Ser Ara He Ly- 

Gin Ser Gly Phe A.n A^n Met Thr A.p He Leu Ly. Thr Gl„ Ser Ser 

35 lis ri'S S« He Gly He Arg 

Gly Phe Ly. Pro Ser Gly Lys Tyr Val Thr "! Leu Val Asn Gly He 
Pro Ala Gly Thr Asp Asn He Ser Thr 11° Thr Ser Asn He Glu 

40 Gin He Glu He Leu Ly. Gly Pro Phe Ser Ser He Tyr Gl'y Tl.r Asn 

200 2ns 
Ala Het Gly Gly Val Val Asn He He Thr His Lys Ser Ly, Asp Lys 

45 2« ""'^ "'^ 2V0 ^''^ f^*" "° ^"^"^ '^i^ 

Gly ser Phe Asn Leu Gly Gly Arg Phe Glu I^p He Phe Ser Phe ^p 
Leu ser Leu Gly Leu Asp Lys Gin Asn Lys Asp Tyr Lys Thr "y Ser 
50 Asn Asn Phe Leu Ser Leu Ser Lys "u Glu Glu Ala He III Asp Val 



Asn Ala Thr Ly. Asn Lys Ly. Met Lys Gly Ser Asp Tyr Thr Val Ala 

55 lol "'^ I'll 'hr Pro Glu Trp ser Leu 

Asn Leu Tyr Gin Asn Val Phe Leu Gly Asp Wa He Pro Val Gly 

330 33S 



ser He Trp Gly Val Tyr Gly Glu Ser Ly= Lys Asn Leu Asn Ar| Ser 
BO Ser Thr ser Phe Glu Leu Leu Gly fys His Gly Cys His Thr Lou Gin 

360 

Phe Ser Pro Tyr Phe Asn He Glu Lys Ser Glu Asn Tyr Asn Asn Ala 

65 lit, ^^-^ Thr Tyr Gly 

Ala Leu Leu Gin Asp Lys He Ser Phe Gly Gl" Gin Asn He Val Leu 
Gly Val Asp ser Arg Asn Met Thr Met gIu Ser Glu Arg Phe gJu Gin 

70 Ala Gly val Asn Thr Ly. Pro Tyr ifn Pro Gly ^Tyr Ala ihr Asn Asn 

11. Gly Leu Phe Gly Gin Ala Asn Phe Tyr Leu Leu i^n A^p Ala Leu 

460 

75 Tel ASP Leu Ly.. 



PCT/AU98A)1Q23 



475 



480 



SUBSTITUTE SHEET (Rote 26) (RO/AU) 



n^i =n™^ tT^j «}J «F=-ti ..i-'* -^^Hi ,«=^ ^F=-^1 -rf™?, 

'"^Sii C3 iCii U™ iOi 113 m ^^.-Ji O ii:^: C3 ILS li^ 



WO 99/29870 

244/ 490 

Ala A.^n Glu Tyr Leu A.n A.n Glu Ala Ly^ Glu Glu Thr Hi^ Acn Val 
n. A.n Pro A.n Val Gly Ly. Tyr gIu Phe Vai Ly. Gly 2!" Thr 

5 Ala Hi, Gly Thr Phe Gly Ssr Ala Phe Ser Ala Pro fcp Ala Phe Gin 
r.y-. Ala Gly Gin Tyr Val Gly Pro Ph« Gly Thr Thr l" Gly Asn Pro 
10 s!? "^"^ '■'"^ Trp Asp piJe Gly He Gly Tyr 

Ser Asn Ala Arg Cy^ Gly n- em Ala Arp Vaf Thr Leu Thr Tyr Phe 

Hi. Thr A.p Hi. Ly. A.p L.u He Leu llr Ser Pro A.p Tyr All A.n 
1tr S85 son 

15 A.^n He Thr Thr Tyr He Asn Ala A^p Ly.^ Ala Arg MbL sir Gly Il« 

Glu Ala Leu Leu Ser Tyr A:.p Phe Gly .Ser Leu Phe Ala A^n Ly^ Ph, 
^15 §20 
20 fsl '''^ ''"■^ Leu A.^n Ser Glu Met 

Ly^ Ly^ Ser Gin Thr A^p Ala Pro Trp s»r Glu Met Tyr Tyr Val A^g 
t.y-' Gin A.n lie Thr Phe Gly 11. Glu Tyr Ar, Gly Lys Glu "y Leu 

rye- DO J 665 G70 

Glu val Met Leu A^n Gly Arg Phe Hct Gly Arg Arg He C,u Gin A.,n 

Trp Tyr Ala Tyr Tyr Pro Glu Val" Arg Pro Glu Leu Gin Gin L*u Leu 

S5S 7Q0 

30 7^5 ^" vio ""^ 

Gin Ala Met Val Phe A=n Ala Ser Ala Tyr ^yr His Met A^n Lys ^yr 

730 -7^^ 
Leu Thr Phe Gly Val Asn Leu Asn Asn He Leu A..p Glu Leu Tyr Thr 

35 Glu Lys A^p Gly Tyr His Met Pro G^y Arg Asn He Met Gly Lys Val 

755 



PCT/AU98/01023 



40 



45 



55 



60 



65 



70 



75 



Met Vai Asn Phe ' ''^^ 

770 

(21 INFORJ IATIOIJ FOR SEQ ID NO: 322 

ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: A&4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(AJ ORGAJaSM: Porphyromonas gingivalis 
(ix) FEATURE: 

(A) HAIIE/KEY: misc feature 



(B) LOCATION 1. . .4 84 
(xi) SEOUEWCE DESCRIPTlor^: SEQ ID NO: 322 



50 
Val Leu 
65 



Ser 


Ser 


Pro 


Tyr 


Arg 


Gin 


Met 


Asp 


Gin 


5 










10 


Aia 


Ser 


Met 


Asn 


Arg 


Phe 


Ser 


20 










25 






Gly 


Phe 


Val 


Leu 


Trp 
40 


Phe 


Val 


Ser 


Ala 


Ser 


Glu 


Thr 


Thr 


Val 


Ser 


Tyr 








55 








Glu 


Ala 


Asp 


Vai 


Leu 


Arg 


He 


Ala 


Val 




70 








75 


Ala 
85 


Asp 


Met 


Asp 


Val 


Arg 
90 


Lys 


Arg 


Ala 


Asp 


Leu 


Phe 


Pro 


Lys 


Val 


100 










105 




Thr 


Leu 


Lys 




Gin 


Val 


Leu 


Tyr 










120 







15 

His Trp Pro 
30 

Ser Arg Thr 
45 



95 
Arn Gly 
110 



US i,n 
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The S^r Ser Ser Glu Giy He, gIu Met Gly Arg Thr His A.-n Thr Gin 

^^'-^ 13S 
Gly Gly V^l A^n VaJ Ser Ilez Pro Leu Val S%r Ala Gin Leu Trp Lys 

£- •'■''^ is*^ 1S5 

5 s*r II* Ala Met Thr Giy Glu Gin Leu Asp t.eu Ala Leu Glu Ly- Aia 
165 

Arg Ser Ser Arg He fKsp Leu Val Ala Glu Val Lys Lys Ala Tyr L-u 
19i> 185 190 

in ^''^ T*Vr Gly Val Phe Lys Arg S-r T-r 

•lU 195 200 20S 

A.<^p Arn Ala Leu Ala Asn TyL Lys Asn He Ser Asp Lys Phe Asp Arg 

- 215 220 

Gly Leu V;.! Ala Glu Tyr A.-p Lys He Arg Ala Asn Val Gin Val Arg 

15 Asm Tie CUi Pro Asn Leu Leu Gin Ala Gin A^n Ser Val Al;, L-u Ala 
245 25C 255 

Leu Trp Gin Leu Lys Val Leu H-t Ser HeC Glu Val Glu Thr Pro II- 

265 270 
Arg Leu Ser Gly Ser Leu Ser Asp Tyr Lys Glu Gin Val Tyr Thr Gly 

280 28b 
Tyr Phe Ala Ala Asp Thr Leu He S(jr A^n Asn Ser Ser Leu Arg Gin 

295 300 
Leu Asp He Gin Arg Arg Leu Ala Val Ser Ala Asp Lys Leu Asn Lys 
oe -^^^ 315 320 

Tyr Ser Phe Leu Pro Thr Leu Asn L-u Gly Gly Gin Tyr Thr T-r S»r 

330 

Leu Asn Ser Asn Asp He Lys Phe Trp Gly Glu Gly Gin Arg Trp Thr 
340 

30 ilr '^^''^ ^^"^ ^^'^ '^Vr He Pro He Phe Asn Giy 

'^^ 355 360 

Gly Lys Arg Leu Tyr Asn Val Lys Gin Ser Ala Leu Ser He Arg Gin 

^'^ 3*75 
He Asp Leu Gin Arg Arg His He Glu Gin Ser He Arg Met Gly He 
390 ' 395 4 00 

35 Lys Asn Gin Asn Asp Arg Leu Arg Thr Cys Met Gin Arg Phe Val ila 
405 410 415 

Ser Glu Glu Ala Val Arg Ser Ala Glu Ly=t Gly Tyr Gin He Ala Glu 

425 430 
Lys Arg Tyr Gin Thr Gly Glu Gly Thr Leu Val Glu Leu Asn Asp Ala 
™ 435 440 

Asp Val Ala Leu Leu Gin Ala Arg Leu Asn Tyr Asn Gin Ala He Phe 
iSS 460 



20 



45 



60 



65 



Asp Phe Met Thr Ala Lys Ala Glu Leu .^sp hyc Met Asn Gly Met Gly 
^''O 415 4flo 

lie Pro Glu Gin 



iZ) INFORMATION FOR SBQ ID HO: 323 



50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 0 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

55 (11) MOLECULE TYPE: protein 

tlii) HYPOTHETICAL: YES 



(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

( ix ) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATIOTJ 1...S4 0 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 323 

Phe His Asn Phe Asp Phe Leu Asn Gly He Lys Leu Phe Ser Met Ala 

70 Asn Asn Thr Leu Leu Ala Lys Thr Arg ^rg Tyr Val Cys Leu vll Giy 
20 25 30 

Phe cys Trp Leu Met Ala Met Met His Leu Ser Gly Gin Glu Val Thr 

75 ^""^ Gin vfl Arg Arg Thr 

55 60 



SUBSTmjTE SHEET (Rule 26) (RCVAU) 
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iL J3 "^t» tOi JL Q KTM m ILJB ilZ^. O U 



WO 99/29870 

246/490 

Leu V.l Ly. Val Ala Leu Ser Glu Ser ,..u _Pro fro Gly Ala Lys Gin 

'5 0.J 



10 



20 



30 



40 



50 



65 



70 



lie Arc He Gly The S.r Leu P.o Ly. Glu Thr Glu Glu Ly. V.l Thr 

Ala Leu Tyr Leu L^u V.l Ser Asp S.c Leu AI. Val Arg A.p L'u Pro 
B T -i-^ - 2 05 110 

ill '^"^ ^-'^ P"^'^ II- Ser Lys Glu 

A.P Arg Thr Th. Al. Leu £cr Ai° A.p s„ Val Ala Gly Ac, A.g Phe 

Phe Tyr Le... Ala Ala A,p ile Gly Pro Val Al3 Ser Ph= Ser Arg Ser 

ASP 'VUr L,u Thr Ala Ar, Val g„. Glu Val il'a Val A.p Gly Arg Pro 

170 



l^^ 155 ISO 

15 L.U Pro L=u Ly5 gIu l,eu s„ p„ Ala Ser Arg Arg Leu Tyr ^ Gly 



Tyr Glu Ala Leu Phe Val Pro Gly "p Gly Gly Ser Arg A-^^ Tyr Aro 

200 20^ 
He Pro Ala II, Leu Ly5 Thr Ala Asn Gly Thr Leu lU Ala Met Ala 

215 220 
Arp Arg Arg Lys Tyr Arn Gin Thr Asp Leu Pro Glu Asp He Asp 11« 

Val Met Arg Arg Ser Thr Asp Gly Gly Lys Ser Trp Ser A^p Pro Arg 



-r —J -j = 

25 lie Ile Val Gl„ Gly Glu Gly Arg A^n His Gly Phe Gly Asp v" Ala 

Leu Val Gin Thr Gin Ala Gly Lys "u Leu HeL lie Phe vl? Gly Gly 

Z/w 280 



val Gly L.U Trp Gin Ser Thr Pro A.p Arc, Pro Gin Arg Thr Tyr lie 



285 

Pro Asp Arq Pro Gin 

295 

Ser Glu S=r Arg Aep Glu Gly Leu Thr Trp Ser Pro Pro Arg A..p He 
Thr His Phe He Phe Gly Lys A^p Cy. Ala ^p Pro Gly Arg Ser ^^g 



35 Tro Leu Aln s^r- pk! r^,.- _. 335 

Leu 

Gly Axg II, Thr Phe Val Ala Ala Vll Arg Glu Ser Gly Ql^ Glu Tyr 



Trp Leu Ala Ser Phe Cys Ala Ser Gly Gin Gly Leu Val Leu Pro Ser 



3^0 34B 
Ala 

3S0 36S 
Tyc Ser Asp Asp Glu 
375 380 



Vil Leu Asn Asn Tyr Val Leu Tyc Ser Asp Asp Glu Gly Asp Thr Trp 



Gin Leu Ser Asp Cys Ala Tyr Arg Arg Gly Asp Glu Ala Lys Leu Ser 
r.ou Met Pro Asp Gly Arg Val Leu Met Ser Arg Asn Gin Gly il^g 

45 Gl,> Glu s.r Arg Gin Arg Phe Phe Ala 
«0 42S 



- 395 400 

Gly 
415 

. Gly 

Thr Trp Glu Arg Ala Lys Gin Phe olZ Gly 11. His Asp "o Gly Cys 



Gin Glu Ser Arg Gin Arg Phe Phe Ala Leu Ser Ser Asp Asp sly Leu 



Asn Gly Ala Met Leu Gin Val lyl Arg Asn «ly Arg tit Gin Val L.u 



His ser Leu Pro Leu Gly P^; Asp Gly Arg Arg ^p Gly Ala Val Tyr 



Leu Phe Asp His Val s.l Gly Arg Trp Ser til Pro Val Val Val "n 

ser Gly Ser Ser Ala Tyr Ser A.p Met Thr Leu Leu Ala Asp Gl'y Thr 

lie Gly Tyr Phe Val Glu Glu Gly Asp Glu He Ser Leu Va^ Phe He 

Arg Phe Val Leu A^p Asp Leu Phe Asp Val Arg Gin 
"0 S3£ ^ 540 

iZ) INF0R1>1ATIW FOR SZQ ID NO: 324 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 293 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

til) MOLECULE TYPE: protein 

Hil) HYPd'HETICAL: YES 



<vl) ORIGINAL SOURCE: 
75 ORGAIilSH: Porphyromonas gingivalls 
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(i;:) FEATURE: 

(A) MAIIE/KEV: miffc_f eaLure 
(P) I.OCATIO!! 1. . .293 

5 (-1) StQUEIICE DESCRIPTION: SZQ IDHO:3Z4 



10 



15 



20 



25 



30 



35 



40 



45 



65 



70 





Ser 


Pro 


Ser 


Arp 


Ser 


Ala 


Leu Pro 


Ser 


Val 


Trp 


Ser 


Ala 


Asn 


Thr 


1 








5 








10 










15 




lis 


Ser 


•jly 


Leu 
20 


Val 


Gly 


Gly 


Lys Arg 
2b 


He 


Thr 


Leu 


Leu 


He 
30 


Leu 


Tyr 


S-r 


Met 


Al a 


He 


Arg 


Ser 


Ser 


Asn He 


Glu 


Ser 


He 


Gin 


Cys 


Phe 


Val 






35 










40 








45 






Met 


Lys 
SO 


Ly< 


Glu 


Lys 


Leu 


Trp 

55 


He Ala 


He 


Val 


Ala 
60 


Gly 


Leu 


Ala 


Phe 


Val 


Leu 


Gly 


Leu 


Tyr 


Ala 


Leu 


Gly Arg 


Ser 


Vai 


Ala 


Gin 


Leu 


Arg 


Arg 












70 








75 








80 


Ser 


Gin 


Pre 


Ser 


Val 


Thr 


Val 


Thr Gly 


Met 


Ala 


Glu 


Arg 


Asn 


Phe 


Lys 










05 








90 








95 


See 


Asp 


Leu 


He 


Val 


Trp 


Thr 


Ala Ser 


Tyr 


Gin 


Leu 


Gin 


Net 


Met 


Asp 








ICQ 








105 










110 




Lfiu 


Glu 


Ser 


Ala 


Tyr 


Lys 


Ala 


Leu Lys 


Glu 


Lys 


Gin 


He 


Leu 


Val 


Ala 






115 










120 






125 










Tyr 


Leu 


hys 


Asn 


Lys 


Gin 


Leu Pro 


Asp 


Ser 


Ser 


Tyr 


He 


Phe 


Ser 




130 










135 






140 








s-r 


Val 


Ala 


He 


Ser 


Lys 


Glu 


T^T Asn 


Tyr 


Tyr Tyr 


Asp 


Pro 


Arg 


Gin 


1 4S 


Gin 








150 








155 






] 60 


GIu 


Asn 


Val 


Arg 
165 


Thr 


Phe 


Ala Gly 


Tyr 
170 


Leu 


Leu 


Ser 


Gin 


Thr 
175 


Val 


Thr 


Val 


Thr 


Ser 


Gin 


Asp 


He 


Glu His 


Val 


Glu 


Lys 


Ile 


Scr 


Arg 


Asp 








lao 








185 










190 


He 


Thr 


Glu 


Leu 


IlG 


Asn 


Gin 


Gly Val 


Glu 


He 


Thr 


Ser 


Asp Arg 


Pro 






195 










200 








205 








Ala 


Tyr 


Tyr 


Tyr 


Thr 


Lys 


Leu 


Asn Asp 


Leu 


Lys 


Val 


Glu 


Met 


Leu 


Arg 




210 










215 








220 








Pisn 


Ala 


Ser 


Glu 


Asp 


Ala 


Phe 


Asn Arg 


Ala 


Ser 


Val 


He 


Ala 


Glu 


Gly 


225 










230 








235 










240 


Ser 


Gly 


Ser 


Ser 


Val 


Gly 


Lys 


Met Leu 


Ser 


Ser 


Ser 


Met 


Gly 


Val 


Phe 










245 








250 








255 




Gin 


lie 


Val 


Gly 


Leu 


Asn 


Ser 


Asn Glu 


Asp 


Tyr 


Ser 


Trp 


Gly Gly 


Ser 








260 








265 








27 0 






Phe 


Am 


Thr 


Ser 


Ser 


Lys 


Met 


Lys Thr 


Ala 


Ser 


He 


Thr 


Val 


Lys 


Ala 
















280 








285 






Ser 


Phe 
290 


Ala 


Leu 


Lys 























(2) IHFORIiATION FOR SEQ ID HO: 325 



(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 290 amino acids 
50 tB) TYPE: amino acid 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

55 (ilil HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(Al ORGANISI^: Porphyromonas glngivalls 

60 (ix| FEATURE: 

(A) NAME/ KEY: inlsc_feature 
(B» LOCATION 1 . . .280 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:325 

Gly Lys Leu Gin Met Lys Lys Thr He Ala He He Ala Ser Ala Leu 
15 10 15 

Leu Ala Leu Gly Ala Val Gly C^^ Lys Lys Asn Ala Asp Thr Thr Ala 

20 25 30 

Val Ser Glu Lys Asp Ser He Ala Leu Ser Mer Gly He Leu Tvr Glv 

35 40 45 

Gin Asp Phe Ala Asn Gin Phe Glu Met Ser Arg Leu Gin Gly Gin Pro 

50 SS 60 

He Asp Ser Val Ala Phe Leu Asp Gly Phe Lys Tyr Gly He Asp Thr 
65 70 75 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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45 



50 



55 



60 



65 



70 



75 



Thr 


Arg 


Phe 


Sei 


Tyr 


Afrn 


Leu 


Gly 


Ala 


He 


Tyr 


Ala 


Ser 


Ai-n 


He 


Ala 


































Ara 


Gin 


Leu 


ALa 


llis 


Asp 


Ser 


He 


f^p 


:ie 


Asp 


Lys 


Phe 


Tyr 


Ala 


Ala 








IQO 










105 








110 






Met 


Arg 


Ala 


Ala 


Leu 


Leu 


Ky^ 


Asp 


Thr 


Val 


Ser 


He 


Ala 


Met 


Lys 


Pec 






115 










120 










125 








Ala 


p 


Ala 


Gin 


Ala 


Fhe 


Met 


Gin 


Ar^ 


He 


Gin 


Ala 


Lys 


Lys 


Gin 


Arq 




130 










13S 










140 








Glu 


Asn 


Am 


Met 


L^'ij 


Gin 


Phe 


Gly 


Gin 


Asr. 


lift 


Glu 


Lys 


Gly 


Asn 


Glu 


145 










150 










J 55 








160 


Tyr 


lie 


Asp 


Thr 


Phe 


Lys 


Lys 


Glu 


Asp 


Gly 


Val 


Thr 


Val 


Thr 


Thr 


Thr 










165 










170 










175 




01 y 


Leu 


Ala 


Tyr 


Lys 


Thr 


Leu 


Gin 


Glu 


Gly 


Thr 


Gly 


Ala 


Thr 


Pro 


Ser 








180 










185 










190 






L«u 


Ala 


Asp 


Thr 


Vril 


Arg 


Val 


Lys 


Tyr 


Val 


Gly 


Thr 


Leu 


Val 


Asp 


Gly 
















200 


















LVS 


Glu 


fhe 


Asp 


Lys 


Asa 


Glu 


Glu 


Gly 


He 


Glu 


Phe 


Ala 


Val 


ThE 


Gly 




210 










215 










220 










Val 


lit; 


Lys 


Gly 


Trp 


Thr 


Glu 


Met 


Leu 


Gin 


Leu 


Met 


Lys 


Val 


Gly 


Gin 


225 










230 










235 








240 


Lys 


Val 


Arg 


Val 


Val 


He 


Pro 


G] n 


Glu 


Leu 


Ala 


Tyr Gly Glu 


Thr 


Gly 










245 










250 










255 




Asn 


Tyr 


Thr 


lie 


Glu 


Pro 


Phe 


Ser 


Thr 


Leu 


Thr 


Phe 


Glu 


Met 


Glu 


Leu 








260 










265 










270 






He 


Gly 


He 


Lys 


.Pro 


Gly Lys 


Lys 
































280 


















(2) 


IllFOWlATIOti 


FOR 


SZQ 


ID MO: 320 

















10 



15 



20 



25 



30 (t) SEQUEUCt; CHAEUJ^CTERISTICS: 

(Al LENGTH: 803 amino acids 
IB} TYPE: amino acid 
ID) TOPOLOGY: Unear 

35 (li) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
40 lA) ORGANISM: Porphyromonas gingivalis 



(ixj FEATURE: 

(Al NA1>(E/KEY: misc_feature 
(B) LOCATION 1. . .803 

<xi) SEgUEWCE DESCRIPTION: SEQ ID MO: 326 



Cys 


Arg 


Lys 


Gin 


Lys 


Leu 


Ser 


Asn 


Pro 


T - 
— J 


Arg 


Ser 


Arg 


Tyr 


Pro Cy^ 


1 








5 










IC 










15 


Arg 


Arg 


He 


Ser 


Phe 


Arg 


Gin 


Ser 


Asp 


Lys 


Asn 


Pro 


Met 


Lys 


Val Leu 








20 










25 










30 




Arg 


Gin 


Val 


Phe 


Lou 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Leu 


Thr Gly 


Ala Cys 






35 










40 










45 






Ser 


Thr 


Thr 


Lys 


Asn 


Leu 


Pro 


Glu 


Gly 


Glu 


Gin 


Leu 


Tyr 


He 


Gly Met 




50 










55 










€0 








Gly Lys 


Thr 


Gin 


He 


Leu 


Arg 


Gin 


Asp 


Lys 


Ser 


His 


Ala 


Gly 


Gin Gin 


65 










70 








75 








80 


Ala 


Leu 


Thr 


Glu 


Val 


Glu 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 


Pro 


Asn 


Gly Ala 










85 










90 










95 


He 


Phs Gly 


Ser 


Ala 


Ser 


Ala 


Ser 


Leu 


Pro 


Lys 


He 


Pro 


Phe 


Gly Leu 








100 










105 








110 




Trp 


Leu 


Tyr Asn 


Ser 


Phe 


Val 


Gly 


Asp 


Ser 


Thr 


Val 


He 


Ser 


Lys Trp 






115 










120 










125 






He 


Phe 


Asp 


Lys 


Phe 


Ala 


Ala 


Lys 


Pro 


Val 


Phe 


He 


Ser 


Gin 


Val Lys 




130 










135 










140 








Ser 


Asp 


Ser 


Arg 


Ala 


Lys 


Val 


Ala 


Thr 


Asn 


He 


Leu 


Arg 


Glu 


His Gly 


145 










150 










ISS 








160 


Tyr 


Phe 


Asp 


Ala 


Lys 


Val 


Lys 


Ser 


Ser 


Val 


Thr 


Thr 


Leu 


Lys 


Lys Asp 










165 










no 










175 


Ser 


Leu 


Lys 


Ala 


Lys 


He 


Ser 


Tyr 


Thr 


Val 


Asp 


He"t 


Ala 


Ser 


Pro Tyr 








180 










185 










1?0 




Hie 


Tyr Asp 


Ser 


He 


He 


Pro 


Leu 


Pro 


He 


Ser 


Thr 


Phe 


Pro 


Asp Ser 






195 










200 










205 






He 


Leu 


Ala 


Tyr 


Arg 


Gin 


Thr 


Pro 


Ser 


Leu 


He 


Arg 


Lys Gly 


Asp Gin 




210 










215 










220 
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Ph, «5n L.U Al. Lis Lcu Hi.. Glu Glu Arg Gin Thr Ue Sec Al* L«u 
L^u Arg A.p A..,> Olv Tyr Tyr P,.^ Arg Pro Gin A^p He II, Tyr 

5 Glu Al, A..P Thr teu L,u VM Arc, Cly Ala Val C,- Leu Ar, Al" Ly. 
Leu S^r Glu A-^p Thr Pro Pro Gin a" Met Atg Pro Trp Irg n, Gly 

10 ^ot ^"'^ ^='' Gly Glu sfr Pro Thr Asp 

S«r Leu Glu v.l clu A.p H.t Ly. V.l Leu Tyr Tyr Arg Ly. Met Pro 
V.l Arg Pro Ly. lie Leu Ala r.y., Arg Phe lig Ph, Phe ser Gly Asn 
15 Leu Tyr Arg Gin Lys A.p A.p Glu Thr Thr Arg Ly= Ser L=u a" Arg 
Leu Gly Ala Phe S.r V.l A.p L«u A.n Ph, Leu Gin Arg A,p Ser 

20 3^0 '^'P "-'^ Thr Thr l" A.p Ly= Pro 

Trp A.P Ala Ser Leu Glu Thr Leu Phe Thr Ser Lys Ser Asn A,p Phe 
He Gly Pro Gly Leu A»n Phe Ala Leu Ala Ar| Arg Asn Val Phe Gi°y 
25 Oly Gly Glu As,. Le^ Ser Trp lie ^Jy Gly Ser Tyr Glu ^rp Glu 

Thr Gly A.,„ Arg Pro Glu A.n Ser Ser Asn Arg Lou He L'p He Asn 
30 lio ""-^ """^ ^" ^" S« Val 



50 



60 



70 



75 



440 

Val Asn Leu Ser Phe 
_ . 4SS 4gQ 

465 '•^^ "^^-^ fyr Tyr Pro Thr Thr Thr Thr 



Phe Gin Ala Ser Ala ^hr Ala X.eu Asn Arg til His Tyr Phe Ser Met" 



470 

Thr 

35 Tyr Ser Phe Gly Phe ser Thr Thr Tyr g!u Phe Gin Pro Ser lys Glu 
His Arg His Ala He Phe Pro Leu Lys Leu Asn Tyr Asn Leu Leu Gly 

40 tl"o ''"^ t',s "'^ ■''^^ '^^ 

Leu Leu Ser Leu Gl„ Ser Gin Phe Leu Ala Gin Het" Gly Tyr He Tyr 

Thr Phe Asn Lys Ser Val Ser Glu Lys Ser Pro His His Leu Trp Met° 
565 



555 

Pro His His Leu Trp 
45 Gin Phe Gly Leu Glu Ala Gly Asn lIu Leu Asn Leu He ^yr Leu 



& 1 T\ -I Ooo 590 

Ala Ala Lys Lys Tyr Ser A^p Thr Lys Asn Phe Val Gly Val Pro 

600 fin*; 
Phe ser Gin Phe He Ly. Ala Thr Gly Glu Leu Arg Tyr Ser Tyr Thr 
Ti wl3 620 

He Asp Arg Asn Gin Ser Leu Ala Thr Arg Phe Gly Thr Gly Val He 

Ti-r ser Tyr Gly Asn Met Arg Val Ala Pro "r Ser Glu Gin Phe Ty? 

55 val Gly Gly Ala Asn Ser He Arg Ala Phe Thr Val Arg Ser l" Gly 

Pro Gly Arg Phe Asn Pro Asp Ser "p Asn Gin Tyr Ser Tyr Leu Asp 

Gin val Gly Glu Phe Lys Leu Glu Ala Asn Val Glu Tyr Arg Gly Lys 



700 

Leu Phe Gly Asp Leu His Ala Ala Val Phe Leu Asp Ala Gly Asn Val 

Trp Leu Leu Arg Glu Asp Ser Ser Arg Pro G^y Gly Ala Leu Ser llu 

Tan 



65 val Gly ser Val Ser Asn Phe Leu Asn Ser He Ala Leu Gly ^hr Gly 
Val Gly Leu Arg Tyr Asp Leu Ala P^e Leu Val Val Arg v" Asp Val 



Gly Phe Gly Leu His Leu Pro Tyr Asn Thr Gly_Lys Lys Gly Tyr Tyr 
A--n He Pro Arg Phe Lys Asp Ala He Gly Ph, h!s Leu Ala Val Gly 
Tyr Pro Phe 600 
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10 
15 
20 
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30 
35 
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45 
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75 



(.-» IMFORIIATIOII FOR SEQ ID HO: 327 

(i) 3EQUEIICE CHARACTERISTICS; 

(A) LEM<3TH: 783 amino acid^ 

(B) TYPE: amino acid 
(D TOPQLOCij: linear 

fii) MOLECULE TVFE: prot-in 

tiii) HYFOTHETICAL: YES 

{vi» ORIGIIIAL SOURCE: 

(Al ORGAWISl-t: Po cphyroiBotias gingivalls 

{i>:\ FEATURE: 

(A) NAIIE/KEY: mxsc f eaLur** 

(B) LOCATIOTJ 1 . . ,783 



(xi) SEQUENCE DESCRIPTIOM: SEQ ID riO:327 





Leu 


Ser 


Pro 


Tyr 


He 


Arg 


Phe 


Pro 


Met 


Ser 


Ser 


His 


Ser 


Val 


Arg 


1 








5 










10 










15 




Tyr 


Leu 


He 


Gly 


He 


Ala 


Gly 


cys 


Leu 


Leu 


Leu 


Met 


Leu 


Ala 


Ser 


Ser 








20 










25 










30 






cys 


Ser 


Val 


Thr. 


Atq 


Tyr 


Val 


Pro 


Asp 


Gly 


Ser 


Arg 


Leu 


Leu Asp 


Arg 






35 










40 










45 








Val 


Thr 


He 


Ala 


Ser 


Glu 


Thr 


Gly 


Ser 


He 


Ala 


Leu 


Pro 


Glu 


Asp 


He 




50 










55 










60 










Arg 


Asp 


Tyr 


Thr 


Leu 


Gin 


Gin 


Pro 


Asn 


Tyr 


Arq 


Leu 


Phe 


Gly Met 


Thr 


65 










70 










75 










60 


Arg 


Trp 


Leu 


Leu 


Arg 


Val 


Tyr 


Ser 


Ser 


Ser 


Asn 


Pro 


Asn 


Scr 


Asn 


Ser 










85 










90 










95 




Trp 


Trp 


Asn 


Arg 


Ser 


Leu 


Arg 


Lys 


Met 


Gly 


Glu 


Pro 


Pro 


Val 


Leu 


He 








100 










105 










110 






Asp 


Ser 


Val 


Leu 


Thr 


Asp 


Arg 


Thr 


Ala 


Af 11 


Ary 


Leu 


Ala 


Lys 


Ala 


Met 






115 










120 










125 








Ala 


Gly 


Asp 


Gly 


Phe 


Leu 


Asp 


Ala 


Thr 


Ala 


Arg 


Ala 


Val 


Val 


Asp 


Thr 




130 










135 










140 








Gly 


Leu 


Tyr 


Lys 


Lys 


Ala 


Arg 


He 


Thr 


Tyr 


Leu 


He 


Gin 


Pro 


Gly 


Ser 


145 










ISO 










155 








160 


Arg 


Tyr 


Tyr 


He 


Arg 


Asn 


Met 


Ala 


Leu 


Asp 


Val 


Lys 


Asn 


Pro 


Leu 


Leu 










165 










170 










175 




Pro 


Pro 


Val 


Ala 


Leu 


Gly 


Asn 


Ser 


Leu 


Pro 


Ser 


Ala 


Tyr 


Lys 


Val 


Gly 








180 










185 










190 






He 


Ser 


Glu 


Gly 


Ser 


Pro 


Leu 


Ser 


Pro 


He 


Val 


Leu 


Asp 


Glu 


Glu 


Arg 






195 










200 










205 








Lys 


Ala 


He 


Ala 


Arg 


His 


Met 


Arg 


Asn 


Asn 


Gly 


Phe 


Trp 


Lys 


Phe 


Ser 




210 










2)5 










220 










Ma 


Glu 


Asp 


Val 


Tyr 


Tyr 


Glu 


Ala 


Asp 


Thr 


Thr 


Val 


Ser 


Gly Gly 


Ser 


225 










230 










235 










240 


Gly Thr 


Lys 


Sec 


Ala 


Asp 


Leu 


Lys 


Leu 


Val 


Val 


Asn 


Gly 


He Gly 


Arg 










245 










250 










255 




Tyr 


Pro 


T^-r 


Axg 


He 


Gly 


Axg 


Val 


Phe 


Phe 


Hts 


Ala 


Asp 


Tyr 


Asp 


Pro 








260 










265 










270 






Leu 


Glu 


Ser 


PiSp 


Phe 


Arg 


Val 


Gin 


Glu 


Leu 


Pro 


Arg 


He 


Asp Ser 


He 






275 










280 










285 








Ser 


Arg 


Gly 


Asp 


Tyr 


Thr 


Val 


Tyr 


Tyr 


Gly 


Ser 


Arg 


Gly 


Arg 


Tyr 


He 




290 










295 










300 










Arg 


Aid 


Ser 


Ala 


Leu 


Thr 


Arg 


Ser 


Val 


Ser 


Val 


Thr 


Pro 


Gly 


Ala 


Phe 


305 










310 










315 










320 


Phe 


Cys 


Glu 


Asp 


Asp 


Val 


Glu 


Arg 


Ser 


Tyr 


He 


Lys 


Leu 


Asn 


Ala 


Leu 










325 










330 










335 




Pro 


lie 


Val 


Arg 


Asn 


Val 


Asn 


He 


Arg 


Phe 


Val 


Glu 


His 


Asn 


Gly 


Lys 








340 










345 










3S0 






Asp 


Glu 


He 


Ala 


Leu 


Ala 


Asp 


Ser 


Ser 


Arg 


Leu 


Val 


Asp 


Cys 


Tyr 


He 






355 










360 










365 








Leu 


Thr 


Val 


Pro 


Ala 


Lys 


Ser 


Lys 


Ser 


Phe 


Glu 


A3 a 


Glu 


Val 


Leu 


Gly 




370 










375 








380 








Thr 


Asn 


Ser 


Ala 


Gly 


Asp 


Phe 


Gly 


Ala 


Ala 


Leu 


Ser 


Leu 


Gly 


Phe 


Thr 


385 










390 










395 








4 00 


A£-p Arg 


Asn 


Leu 


Phe 


Arg 


Gly 


Ala 


Glu 


Met 


Phe 


Asn 


He 


Lys 


Leu 


Lys 










405 










410 










415 




Gly 


Ala 


Tyr 


Glu 


Ala 


He 


Arg 


Lys 


Gly 


Ser 


His 


Ser 


Phe 


Met 


Glu 


Tyr 








420 










425 










430 






Gly Val 


Glu 


Ser 


Ser 


Leu 


Arg 


Phe 


Pro 


Arg 


Leu 


Leu 


Phe 


Pro 


Phe 


He 
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Sor Asp Glu Thr Arg Arg Arg Leu Arg Ala Ser Thr Giu Trp Lv^ He 

•^SO 4 55 4 60 

Gly Tyr Asn Tyr Gin Thr Arg Pro Glu Phs Asp Arg Val Tie Leu S»r 
^ ^f"^ 470 ,,75 480 

Ala Gin Lc-u Asn Tyr Ssr Trp Gin Thr Tyr Leu His A^n Arg Leu Arg 

186 490 495 

His Thr 11- Arg Leu Leu Asp Val Asp Tyr Leu Hir Leu Pro Tyr II- 

500 505 510 

At-p Pro AFP Phe Ala Gin Ser Leu Pro Pro Thr Thr Ala Leu Tyr Asn 

315 520 525 

Tyr Thr Glu Gin Phe He Leu Gly Ser Ala Tyr He Leu Arn T^r Thr 

530 535 540 

Thr Ala Scr Ser Met Glu Arg Thr Val Ser Arn Pro Phe Thr Ala Arg 
550 555 seo 

Phe Ser He Gin Thr Aid Gly Aen Leu Leu Gin Ala He Ser Tyr Lau 

565 570 575 

Thr Asp Scr Pro Lys Asp Glu His Gly Leu Tyr Lys Met Phe Gly Leu 

580 585 590 

His Tyr Ala Gin Phe Val Lys Leu Asp Leu Affp Leu Ala Lys Thr Val 
, , 595 600 605 

Leu Lou Glu Lys Asp Asn Thr Leu Ala Leu His Leu Gly Phe Gly L-u 
615 620 
9c; ^^"^ '^"^'^ ^'^^ Glu Leu Arg Tyr 

630 635 640 . ^ 

Phe Ala Gly Gly Ser Asn Ser Val Arg Gly Trp Ser Val Arg Thr L-u 

^"'S 650 655 

Gly Pro Gly Ser Met Lys Met Thr Pro Asp Lys Thr Phe Phe Asp Gin 

fi^O 665 67 0 

Met Gly Asp He Arg Leu Asp Leu Asn Val Glu T-yr Arg Thr Lys Leu 

6^5 680 685 

Phe Trp Lys Phe Arg Ala Ala Ala Phe Val Asp Ala Gly Asn Val Trp 

690 695 700 

Thr He Lys Glu Tyr Glu Asn Gin Glu Asp Gly Leu Phe Arg Phe Asp 
I «u ''^^ ■'IS 720 

Arg Phe Tyr Lys Glu He Ala Leu Ala Tyr Gly Leu Gly Leu Arg Leu 

725 730 735 

Asp Phe Asp Tyr Phe Leu Val Arg Leu Asp Ala Gly Leu Lys Ala Tyr 

''^O 745 750 

Asp Pro Gin Gin Thr Gly Arg Tyr Lys Trp Ala He Thr Arg Pro Asn 

^55 760 765 

Leu Ser Ser Asn Phe Ala Trp His He Ala Val Gly Tyr Pro Phe 
775 780 

45 {2) IHFORHATION FOR SEQ ID NO: 328 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 875 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{lii) H'lPOTHETICAL: YES 

tvl) ORIGirJAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .875 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 328 



30 



35 



40 



50 



55 



60 



65 Val Glu Ser Lys Leu Leu Cys Leu Met Arg Lys Arg He Leu Gin Leu 
^5 10 IS 

Phe Leu Thr Ala Leu Leu Leu Ala Leu Gly Sex Ser Leu Ala He Ala 

20 25 30 

Gin Thr Val Val Thr Gly Lys .Val He Asp Ser. GJ u Thr Ser Glu Pre 

3S 40 45 

Leu He Gly Val Ser Val Ser Thr Gly Gin Gly Ala Ser Leu Arg Gly 

^0 55 60 

Val Thr Thr Asp Met Asp Gly Gly Phe Arg Phe Glu Val Pro Ala Lys 
__ 65 70 75 

75 Ser Val Leu Thr Phe Arg Cys Val Gly Tyr Ala Thr Val Thr Arg Ser 
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S5 or. 

- - ?5 



G,y A,<j Cly S« Gin ..lu ;^.p il,- Thr He L.u Leu L'p Pro 

^^'^ 110 
' ^ -IAS Ala Ser Vol v«l pro 



Gin Ala IJe Gly Leu A^p Glu He Gin Va 1 H 



120 



LV. A.p Arg H.t Th. Pro V.l Pro V,l Ser A.n He HI v.l Ala A.p 
11^ Sin Ala Ala Ser Leu A.n Val Glu The Pro Olu Leu Val Ly. s-r 
10 Thr Pro ser T.r ryr Thr TKr Ly. .ly Ser i^ly .iy ,He Gly Asp Gl' 
Ar, Tnr A.n Val Ar, Gly P.e A.p Thr Tyr A.n Phe Gly Val icu i,e 
15 ^" Sfti ^" A3P Gly Ly. val Tyr Trp Ser 

A.. Trp ser Gly Leu Met A.n Gl„ Ala Ser Thr lie He Gin Ar, 

Gly Leu Gly Ala Ser Lys Leu Gly He Ser Ser Va'l Gly Gly Thr Met 
20 A.„ „e He Thr Ly= Thr Thr A.p Ala A.„ Thr Gly Gly Ser Ala T^r 
val G.y net Gly A,n A..p Gly Leu His l" Glu Ser Phe Ser lie s=r 
25 '^^^ Til <=1V rrp Al» x'l" Thr He Ala Gly llr HI, Met 

Thr Glv Leu Gly Tyr Val Ly. Gly Leu Ly. Gly Ar, Ma Phe Ser Tyr 
Phe Phe A.O val ser Lys L^., Phe A.n Glu Ar, Hi°° Thr Leu Ser Leu 
30 Thr Gly Phe Gly Ala Pro Gin Trp His A.n llf. Ar. Ser Ser Ly. ^^r" 
.er val Ala Asp Tyr Asp Lys Tyr Gly 'Al Ar, His Asn Gin III P.e 
35 lit Thr Ala Ty. Ala ^^r" As„ Thr 

Tyr His Lys Pro Gin Phe Ser Leu Asn His Phe Trp Lys Met Asp Glu 
Asn Thr ser Leu Tyr Thr Ala Un. Tyr Ala Ser LeS Ala Thr Gly Gly 
40 Gly Ar, Ar, Ala Tyr Gly Lys Asn Ser Lys T^rp Val Leu Xle Asn ^y^ 



Asn Thr Cly Gin Pro Tyr Glu Gin Thr L^-s" Val Thr Pro Asp G^y Leu 
45 ^'"^ '^-P Ala Wa Asn Ala Al, Ala Ser Asn Gly 

ser Glu Ala He Phe Ala Leu Gly Ser Asn Ser His ijs Trp Phe Gly 
I-eu Leu ser Ser Phe Lys Lys Lys Leu Asn Ser Se'r Leu Thr Leu Thr 
50 Ala Gly Tyr Asp Gly Ar, Tyr Tyr Ar, Gly ^p His Tyr Asp Lys h! 
Thr ASP Leu Leu G,y Gly Ser Tyr Tyr ^ Glu Asp Pro Lys ihr Lys 

55 IVs 

Asn Ar, Asp Tyr Thr Gly Glu n'e Met Trp His Gly "u Phe Ala Gin 

Met Glu His ser Ser Glu Trp He Asp Ala Phe C^? Ser Gly Ser He 

60 Asn Tyr Glu Leu Tvr Ara A-n « a ^ ^60 
565 ^^"^ ^^'^ Lys Ser Thr 

570 



Cly Tyr Leu Pro Gly Val Ser Pro Trp Lys Ser Phe Leu Pro ^rl Ser 

305 



65 "-^^ t',t -"^^ Hi» Val Phe 

Ala Asn Gly Gly Phe Phe Thr Ar, Ala Pro Leu Phe "y Asn He Tyr 
Ala Ala Gly Ala He He Pro Asn Asp Lys Ala "n Met Glu Lys Val 

70 Leu Thr Gly Glu Val Gly Tyr Gly Phe Thr L^n^ His Lys Asn Phe 'ofu 
Phe Asn He Asn Gly Tyr Tyr Thr Lys ^rt Met Asp Ar, Val T^^^ Ser 

75 l^l Val Tyr Leu Asn Gly Val "p Ala Val 

685 
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Hi, cv= Gly Val Gl. Al. Gl . Val ser Tyr Arg Pro Ilo Arg Gin He 

700 

Arp Arg Glv |..,t Ph, S.r Le. Gly A-.p Trp Thr Trp Gin A^n A.n 

5 v»l s,r Tvr Thr Ser Tyr Aip: Slu Ala Gly Irn Glu Thr Gly Gin lip 

730 T>c 
II. Thr Tyc 11= I.y. Gly Leu Hi, Val Gly A=p Ala Aia Gin Het Thr 

Ala Ala Val s« Ala Asp 11^ Giu Leu Phe Lys Gly Phe Hi." Val lie 

Gly Ly^ Tyr Acn Phe Leu Cly Ly, A^rn Tyr Ala Gly Phe Asn Pro Ala 

Thr Arg Aen Ala Gin Gin Tyr Glu Ala Asp Gly Lys Glu He Val Glu 

15 Ser Trp Lys Leu Pro A-.p Val Gly Leu Phe A^p Leo Ser Ala Ser ?yr 

810 815 
Asn Phe Ly:, Leu Gly Ser Leu Ser Thr Thr Phe Tyr Phe A:,„ Met A,-p 

20 Sal "'^ ^''^ ^" A-'P '^-'P Asn He He 

840 845 
UlV Ly^ L/s His Asp Glu Ala Ser Ala Leu Val Trp Tyr Gly Phs Gly 

Arg Thr Trp S-r Thr Gly li- Arg Val Asn Phe 
865 870 875 



25 



30 



(2) INFOKi-lATlOIJ FOR SEQ ID 110:329 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 60 amino acidc 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) IIOLECULE TYPE: protein 
35 (lil) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAiaSM: Porphyromonas gingivalis 
40 (ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .4 60 

^5 SEQUENCE DESCRIPTION: SEQ ID NO: 329 

Asn Arg Cy.. A^n Pro Ser Arg Gin Tyr Ph« Hi^- Leu Ser Gin Asn Asp 
Val Gly Arg Ser Leu Arg Ala Phe Leu \L His Leu Ser Met Us Phe 

50 sec He Arg Leu Phe Leu Cy^ lie Ue Phe Leu Leu Ser Wa Phe He 

Leu Pro Ala Lou Gly Gin Lys Ser Lys Gin Val Gin Arg Leu Glu Lys 
55 gQ 

55 Is" ^^'^ '^^ Thr ASP Arg Glu Leu 

Arg A^n Thr Lys Ly, Aap Ly, Gin Asp Lys Lys His L«u Asn Leu 

Leu Asn Lys Gin Val Ala Gin Arg Lys Gin Het Val Gin Leu Leu Asp 

60 Asn Glu val Lys Glu Leu Gin s.r i^p H. Asp ser Het t" Gly Val 

120 125 
Cys His Gin Leu Ser Val Glu Glu Lys Ala Arg Ser Asp Glu Tyr Ala 

13b 140 

65 lis ^" Tsl "'^ "-^^ *=P '^■^^ 

Leu Phe II. Ser Ser Ala Lys Ser Phe Asp g" Gly Met Arg Arg ie? 

Arg Phe Leu Glu Gin Tyr Ala Ser Ala Tyr Lys Leu Ala Ser ^II Arg 

7U Leu Arg Asp Thr Arg Ser Lyr Leu Glu Thr Glu- Arg Ala Thr Val Glu 

200 205 
Asp Ala Lys Lys Glu Lys Gly His Leu Leu Val He Arg Glu Glu Glu 

*1 o 220 

75 225 ''"^ ''''^ """^ '^^g Arg Gin Val Gin 

230 235 240 
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Al. Leu Oiy Ala Lys Gin Ly. A.p Leu oXu Al. Gin Leu A.g L,-. Gin 
... ... ..u M^cn Ar| Lys li^ Glu Ly.- Gin III /^a 



Z.y.^ Lyr G^n Ala Glu Al. L.u P^n Ar| Lys li. Glu Ly. Gin if 

Ly. Glu lie Siu Ai a Ala Glu Arg Arg Ala Arg Glu Glu Arc 

L.U Ala krg Glu Ala Lys Ala Lys Gi y Lys Pro Val HI Ala Glu P.o 

_ _ 2- J inn 



10 



.3Iu Arc, Lys Ala Glu The ly-. G, y oi,- xyr Ala nl't A.p Ala ^er Glu 
Arr, Al.-. Leu Ser Gly Ser Phe Ala Gin A|„ Ly' Gly Arg Leu Pro G^y 
Pro Val Arv Gly Ar, Tyr Arg He Val s"r A.p Phe Gly VaX ^i' Gin 
15 Hlr Sor Glu Leu Lys Ly, Val Gin V^l^ A.„ A.n Gly Gly li° A.p He 
Ala Val Aia Thr Gly Scr A.p Ala Thr Ser Val Phe ^p Gly Val Val 
s.r S«r val Phe Val lie Pro Gly Tyr A.n Ser Ma v., „et Val Ar, 
HI. Gly A.n Tyr lie Thr Val Tyr Ala A.n Leu Ser Ly. Val Tyr vll 
A..„ ser Cly Thr Arg Val Lys Thr Gly oln Al. L=u Gly Ar, Al'a Tyr 
25 Thr Asp Pro Ser Asn Asn G,„ Thr He He HI. Phe Glu He Trp Ly. 



20 



30 



35 



Glu Ar, ser Lys Gin Asn Pro Arg Leu Trp Lou Arg 
«5 

(2) IHPOruiATION FOR SCO ID HO: 330 

(1) SEQUEHCE CHARACTERISTICS- 

(A) LENGTH: 342 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



44i 



50 



(ii) MOLECULE TYPE: protein 
40 "^^°T"ETICAL: YES 

(vi» ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas ginqlvalis 
(ixl FEATURE: 

lAI NAME/KEY: mlsc feature 
(81 LOCATIOH 1. . .342 

SEQUENCE DESCRIPTION: SEQ ID HO: 330 
Ser Phe Tyr Gl,. Glu He Asp Arg Leu Met Lys Lys Tyr Leu Leu Tyr 
Ala ser Leu Leu Thr Ser Val Leu Leu Phe Ser Cys Ser Lys ^^n A.n 
Pro Asn Glu Pro Val Glu Asp Arg Ser He Glu He Ser He Arg Val 
ASP A.P Phe Thr Lys Thr Gly gL Ala Val Arg Tyr Glu Arg Asn Gin 
Gly Ser Ala Ala Glu Arg Leu He Thr Asn Leu Tyr Leu Leu Leu Phe 
60 ATP Gin ser Gly Ala Asn Pro Ala Lys Tyr ^yr He Thr Gly Asn Thr 
Phe Thr Gly Gly Thr Trp LOU Pro Asp Met Lys Val Lys !eu Asp 

65 tls ^VS val Tyr Val iai° Ala A.n 

val ASP Asn Ala Val Lys Thr Ala Leu Asp Ala Val "a Asn Glu Ser 
Asp Leu Gin Thr Val Lys Arg Thr Thr Ala Met Pro Trp Ser Thr Asp 
70 He Ala Ser Pro Phe Leu Met Ser Gly Asn "s Thr His Asp Phe llu 
Ala Asn Arg Leu Leu Asp Asn Val Pro ^eu Val Arg Ala He ul Lys 

75 [Ts ^"^ "-y^ "e 

200 205 
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20 



30 



35 



40 



255/490 



Zls v., A.n Glv Leu Ser GIu P,.e Lys Fho Arg Tyr VM. A..n Ph, 

A.P LV. Gl. Th. Tvr V.l V.l Ly. P,o Thr The A.p L.. 

5 II. ser ..r A.. A.„ ca, Val Trp P.o «in Jif Th. A.p T.p Th. Vm' 
rrp Olv Ala S„ ..u A.,, xnr Ser P„ 1!° .ro A.p Ala Oly 
10 ^■'"'^ '■''^ ''^^ «y Val Thr Ala Leu Arg Vll Val Thr 

Tvr L.a A.„ Glu Arg A.p S,r l^-s aly Ala Thr Val Gl'u Val Ala L.u 
Pro Ar, val A.p A.p .x,- Thr Leu Pr» Pro Pro «u Phe Cly Pro 3Io 
15 Leu Tyr Ar, Leu Pro Leu Pro A.p L,-, He Leu Ar, Asn H.s Trp ^r 
I.ys Tyr Glu Val Glu He ^35 
340 



(2) IHEXJWIATION FOR SEO ID NO:331 



(11 SEQUEtlCE CHARACTERISTICS- 

(A) LENGTH: 329 amino a^ids 
'B) TYPE: amino acid 
" ID) TOPOLOGY: linear 



(11) HOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

I VI) ORIGIHAL SOURCE- 

(A) ORQA.1ISM: Porphyronwnas gingival!, 
(iJ!) FEATURE: 

(A) NAME/KEY: raise teature 

(B) LOCATIOIJ 1 . . .329 



(xi) SEQUENCE OESCRIPTIOH: SEQ ID N0:33I 
Asn Leu Arg Thr Gin Pro Phe Phe Leu Val Asp Leu Gin Ser Met He 
Arg Thr He Leu Ser Axg Tyr Val Ser Ser A.„ Phe Trp Ser i^g aiy 
Ala Thr Phe Phe Phe Thr He Phe Pro Ala Phe He Leu IL Ala Thr 
Ala Leu Pro Ala Cy. oiy Gly Gly Thr Ala Ser Gly Ser Asp Arg Thr 
^eu Ala val Thr He Glu Pro Gin Ly, Tyr Phe Glu Ser He Ala 

A.P Lys ser Val Gin Val Val Ala Leu Val Pro Ala Gly ser A.n Iro 
Olu Glu Tyr Asp Pro Ser Pro Thr Val Lys Arg Leu Ser Ifu Ala 



55 '^^ '^^ IVs til 



Ala Ala He Arg Asp Asn A.„ Pro Ly, Leu Pro Leu Phe Glu Met Gly 
Ala Leu Ala A.p Ala Gly Ser Ala A.p Leu Ji%° Gly Ser Cys Thr 
60 A.P HI. ser „1= Thr A.p Leu Hla Ala Hi. 11^ Pro HI. Tyr Trp III 
s=r val val Gly Ala Ly. Ala Leu Ser '.I', Ala Ala Tyr Asp HI Leu 
65 ill Trp A.P Lys Gl'y His A.p 

Arg Leu A.„ Gly Arg He A,p Ser Val Ly. Arg Leu 5af Asp Thr Me. 
Phe Ala Asn Gly Lys Ala Asp Lys Ala Phe Val He Tyr His Pro Ser 
70 Leu ser Phe P„e Ala Gin Glu Phe Gly Leu ^^g' Gin He Val He III 
Olu ASP Gly Lvs Glu Pro Thr Ala Ala 'ult Leu Arg Arg Val III Asp 
75 Cln Ala Arg Ala Asp Gly Val Arg z,t Val Phe He Gin III Glu Phe 
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20 



25 



65 



Gl. Th. Ar, Gin Al. aiu A.p He Aro -3iu lie C-ly AI. Arg Pro 

V.l Arg II. A.^n Kro L,u Arg Ser Ser Trp 31u Glu Glu He Leu HX. 
5 lie Ala Ar-? Ala Leu Ala His Giu Arg ''^^ 

(2) IlirORilATIOlI FOR SEQ ID HO: 332 

li' SEOUEI ICE CHARACTERISTICS • 

fA) LENGTH: 878 ansino acid- 
(B) TYPE: amino acid 
(D» TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyroir.ona^ gin-^ivalis 
iiy.) FEATURE: 

(A) IIAHE/KEY: mi^c featur" 
iB) I,OCATION 1. . .878 

{XX) SEQUENCE DESCRIPTION: SEQ ID NO:332 
Ala A.p Scr He Arg Tyr Pro Leu Tyr Phe Phe Gly Arg Asn Arg Ly. 
30 Lys cv. Ar. Gl. Pro He Pro TKr Leu Tyr A.n Lys Asn lie 

Gly Lys Lya He Phe Phe He Leu Leu Ala Leu He Ala Phe Ser Gly 

35 so" ^'"^ ^""^ I^^ Thr Asp Ala A.n 

60 

11= lie Gly Hi. V.l Lys Asp Ser Lys Thr Sly Glu Hi. L.u Val Gly 
lie Thr 11. Ala lie Lys Gly Thr Thr Phe lly Thr Ser Th. Asp IL 
40 Thr Gly Hi. Tyr Tyr Leu Arg Asn Leu A^c, Pro Gly Glu He Ihr Leu 
He Met Arg Gly M«t Gly Tyr Lys Ser Gin Glu Arg Val v" Arg Val 

45 l^'o '^'^ ^1" Ala Glu Glu A.,p Ala 

lie Asn Leu Asp Glu Val Val He Ser Ala Asn ^rg Glu Leu Thr Leu 

Arg Arg Leu Ala Pro Thr Leu Val Asn Val Leu Asn CU, Lys Val Phe 

50 Ser Gin val Asn Ala Ser Asn Leu Ala Gin Gly Leu Ser Phe sln Pro 

Gly Val Arg Val Glu Asn Asn Cys Gl" A^„ Cys Gly Phe 11° Gin Val 
* 200 20S 

55 ' III "''^ ^^^9 Tyr Ala Gin He Leu II, Asp Ser 

Arg Pro He Met Ser Ala Leu Ala Gly Val Tyr "y Leu Glu Gin He 
Hro Ala Asn Met lie Glu Arg Val Glu Val vfl Arg Gly Gly Gly Ser 
60 Ala Leu Tyr Gly Ser Ser Ala He Ala Gly Val Va, Asn He He Thr 
Lys Glu Pro ser His Asn Ser Phe rtl Phe Asn Glu Ser lIu Ser Phe 
Thr Gly Phe Ser Lys Leu Asp ^n Asn Thr Asn Phe Asn Ala Ser He 
Val Ser Asp Asp Asn Arg Ala Gly Ala Met Val Phe Gly Gin Ala Arg 



75 



Tyr Arg Asn Hi, Trp iip Ala Asn Asn Asp oty Tyr Ser Glu Leu G^y 



70 Lys He Asp Ala Arg Ser Leu Gly Ala His Ser Tyr Leu Arg Leu Ser 
ASP Tyr Ser Lys Leu Thr Gly Glu P^e His Thr He Ser Glu Phe Arg 
Arg Gly Gly Asp Arg He Asp Leu Pro Pro His Val Va" Gly Val Ala 
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Glu Gin Thr Asp His Ser Val Phc Ssr Gly Asn Leu Lys Tyr Asp Leu 

Phe Ser Ser Asn Tyr Lys His His Phe Gin Ala Tyr Thr Ser Gly Gin 
_ 405 4j^5 

0 He Val Asn Arg Lys Ser Tyr Tyr Gly Gly lie Gly Glu lie Asp Val 
'^^O 425 430 

Asn Gly His Pro GJ y Gly Thr Glu Gly Tyr Pro He Pro Gin Asp Gin 
435 440 445 

in '^^''^ ?cX "^^^ '^^'^ "Th^ Tyr Met Gly Gly 

450 455 

He Sin Tyr Scr Tyr Asp Leu Asp Lys Phe Leu Leu Met Pro Ser Gin 

47C 475 

Leu Leu Phe Gly Ala Glu Tyr Thr Ar<j Asp Glu Leu Asn Asp V,il Met 

490 495 



15 



25 



65 



70 



Pro He Leu Scr Trp Gin Thr Gly Glu Asp Ala Asn Gly Asn Thr II*. 

505 

Pro Leu Tyr Pro Glu Leu Asp Gin Asn He Asn Asn Tyr Ser L-u Ph- 
515 520 525 

Vn '^''P ^^"^ Trp Ser He Leu Val Gly Ala 

535 540 
Arg Leu Asp Lys His Ser Glu Val Lys Asp Met lie Leu Ser Pro Ara 

555 

Thr Thr Leu Arg Phe Asn Val Asn Pro Asp He Asn Leu Arg Ala Thr 

5€5 570 
Tyr Ala Lys Gly Phe Arg Ala Pro Gin Vai Phe Asp Glu Asp Lou His 

580 585 
Val Gly Vai Vol Gly Gly Glu Ala Gin Lys Vnl Phe Asn Asp Pro Asn 

600 605 

610 615 620 

His Axg Phe Gly Asn Val Gin Thr Asm Phe L=u Val Glu Gly Phe Tvr 

635 640 
Thr Arg Leu Leu Asp Val Phe Thr Asn Glu Glu Gin Pro Asp Gin His 
qc 650 655 

JO Asp Gly He Lys Arg Tyr Thr Arg 11% Asn Gly Ser Gly Ala Lys Vai 
660 665 
Phe Gly Leu Asn Leu Giu Gly Lys Val Ala Tyr Lys Ser Phe Gin Leu 
675 680 605 

40 ^in '^^'^ ^" Tyr Asp Glu Ala Gin Giu 

690 695 700 

Trp Gly Leu Asn Thr Val Lys Asp Thr Asn Gly Ala Phe Val Thr Giu 
TIO 715 720 

Ala Asn Ala Asn Gly Gin Gin Glu Tyr Lys Asn Glu Ser Met Thr Asp 
/ler "'^^ 730 735 

40 Thr Gin He Thr Arg Thr Pro Ser Val Tyr Gly Tyr Phe Thr Leu Ala 
740 745 750 

Tyr Asn Pro Ala His Ser Trp Asn He Ala Leu Thr Gly Ala Tyr Thr 
753 760 765 

50 ""^^ 170 ^"^^ 775 ""'^ ""^^ '^'''^ ""^^ '^^^ ""^^ 

Glu Leu Asp He Met Gin Asn Asn Pro Glu He Thr Asp Giu Thr Gly 

''^O 795 800 

Lys Aia Pro Arg He Asp Giu Leu Lys Lys Thr Pro Ala Phe Phe Asp 

cc eio flis 

55 Leu Giy Leu Lys Val Gly Tyr Asp Phc His Val Phe Gin Ala Thr Giu 
820 825 830 

Val cm Leu Tyr Val Gly Met Asn Asn He Phe Asn Ser Phe Gin Lys 

835 840 845 

60 Vln '^''^ Tyr He Tyr Gly Pro 

855 860 

Thr Gin Pro Arg Thr Gly Tyr Met Gly Leu Val Val Lys Phe 
865 870 87i> 

(2) IMFORMATIOM FOR SEQ ID NO: 333 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 206 amino acids 
(D) TYPE: amino acid 
(D> TOPOLOGY: linear 



Ui) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 
75 (vi) ORIGINAL SOURCE: 
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(AJ ORGANISM: Porphy ronton as gingivalis 

(ix) FEATURE: 

(A> NAME/ KEY: misc_f eatur e 
5 (B) hOCPk710t4 1..-206 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 333 



10 



15 



20 



25 



30 



35 



60 



65 



70 



75 



Lys 


Gin 


He 


Val 


Met 


Thr 


Val 


Lys 


Arg 


Ala 


Val 


Arg 


He 


Ala 


Leu 


Leu 


I 








S 










10 










15 




Thr 


Leu 


He 


Gly 
20 


He 


Leu 


Phe 


Ser 


Ser 
25 


Pro 


Ser 


Leu 


Val 


Arg 
30 


Ala 


Gin 


Ser 


Leu 


Phe 


Ser 


Thr 


Glu 


Ills 


Val 


Leu 


Gin 


Lsu 


Tyr Asn 


Lys 


He 


Leu 






35 










40 










45 








Tyr 


Gly 


Giu 


Ser 


Ala 


Ala 


Asp 


Thr 


Val 


Ala 


Glu 


Lys 


Thr 


Ala 


Gly Glu 


50 










55 










60 










Ser 


Ala 


Phe 


Pro 


Phe 


He 


Asp 


Lys 


Leu 


He 


Asf^ 


Leu Gly Arg 


Thr 


Phe 


65 










70 








75 










80 


Leu 


Gly 


Lys 


Pro 


Tyr 
85 


Arg 


Tyr 


Arg 


Gly 


Pro 
90 


Ser 


Pro 


Trp 


Pro 


Met 
95 


Asp 


Cyff 


Ser 


Gly 


Tyr 
100 


Val 


Ser 


Tyr 


Leu 


Tyr 
105 


Ser 


Lys 


Phe 


Arp 


He 
110 


Lys 


Leu 


Pro 


Arg 


G]y 
115 


Ala 


Ala 


Ala 


Gin 


Ser 
120 


Gin 


Tyr 


Thr 


Asn 


Pro 
125 


He 


Glu 


Arg 


Cla 


Asp 
130 


Val 


Arg 


Pro 


Gly 


Asp 
135 


Leu 


Leu 


Phe 


Phe 


Lys 
140 


Gly 


Arg 


Asn 


Ala 


Axg 


Ser 


Asn 


Arg 


He 


Gly 


His 


Val 


Ala 


Leu 


Val 


Val 


Ser 


Val 


Asp 


Glu 












150 










155 










160 


Asp 


Asp 


He 


Thr 


Het 


Met 


His 


Ser 


Arg 


Asn 


Ser 


Arg Gly 


He 


Val 


He 










165 










170 










175 




Glu 


Lys 


Leu 


Asn 
180 


Arg 


Ser 


Ala 


Tyr 


Phe 
185 


Ser 


Arg 


Arg 


Leu 


Val 
190 


Ser 


Tyr 


Gly 


Arg 


Val 
195 


Pro 


Gly 


Ala 


Lys 


Arg 
200 


Val 


He 


Pro 


Arg 


Lys 
205 


Ser 







(2) INFORMATION FOR SEQ ID NO: 334 



(t) SEQUENCE CHARACTERISTICS; 

iJK) LENGTH: 4 67 amino acids 
40 tBJ TYPE: amino acid 

(D) TOPOLOGY: linear 

di] MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

(Vl) ORIGINAL SOURCE: 

(A) ORGANISM; Porphyromonas gingivalis 

50 (ix) FEATURE: 

(A) NAME/ KEY: ml sc_f eatur e 

(B) LOCATION 1. . .467 



55 





(xl) 


SEQUENCE DESCRIPTION: ; 


SEQ : 


ID NOr334 










Lys 


Gly 


Thr 


Trp 


Affn 


Arg 


Lys 


Asn 


Arg 


Arg 


Glu 


Glu 


Mot 


Lys 


Arg 


Thr 


1 








5 










10 










15 




Tie 


Leu 


Leu 


Thr 


Ala 


Leu 


Thr 


Val 


Leu 


Ser 


Ser 


Leu 


Ser 


Leu 


Leu 


Arg 








20 










25 










30 






Ala 


Gin 


Asn 


Glu 


Ser 


Glu 


Ala 


Ser 


Thr 


Asn 


Pro 


Met 


Ser 


Gly 


Lou 


Ser 






35 










40 










45 






I.eu 


Glu 


Asp 


Cys 


He 


Arg 


He 


Ala 


Lys 


Glu 


Arg 


Asn 


Leu 


Asn 


Leu 


Arg 




50 










55 










60 










Arg 


Gin 


Glu 


He 


Glu 


Gin 


Glu 


Asn 


Arg 


He 


He 


Ser 


Leu 


Asp 


Ala 


Ala 


65 










70 








75 










80 


Arg 


His 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asm 


Ala 


Gly 


He 


Gly His 


Asn 


Tyr 










85 










90 










95 


Ser 


Phe 


Gly Arg 


Ser 


Lys 


Asp 


Lys 


Thr 


Gly 


Val 


Thr 


Val 


Asp 


Arg 


Ser 








100 










105 










HO 






Ser 


Met 


Asn 


Thr 


Asn 


Leu 


Ser 


He 


Gly 


Ala 


Ser 


Val 


Glu 


Val 


Phe 


Ser 






115 










120 








125 








Gly 


Thr 


Arg 


Arg 


Leu 


His 


Asp 


Leu 


Lys 


Gin 


Gin 


Lys 


Tyr 


Asn 


Val 


Glu 




130 










135 










140 










Asp 


Gly Ho Ala 


Arg 


Leu 


Gin 


Lys 


Ala 


Arg 


Glu 


Asp 


Leu 


Ser 


Leu 


Gin 


145 










150 










155 










160 



PCT/AU98/010!23 



StJBSTTrUTE SHEET (Rule 26) (RO/AU) 



iHlJ iHii 'r»!l' r™i! »^"^ 

iLji n3 c:ii ict o Oi m u q ^ar: C3i u u 



WO 99/29870 PCT/AU98/01023 

259/490 

He Ala Ala Leu Tyr He A^n Leu Leu Phe Arg Gin Glu Met Thr Arq 



30 



50 



60 



65 



Thr Ala Glu Thr Gin Leu Ala Leu lie Arg Glu Gin Arg A,n lA Thr 

5 Ala Glu Met Val Arg Val Gly Lys Trp Ala Glu Gly Lys Leu Leu Asp 

200 205 
lie Asn Ala Gin Met Ala Ly^ A:,p gi^ Gin Leu Leu Val Gin Tyr Arg 

10 til ^^"^ ^0 '''''' ^""^ ^'^^ 

Glu His Pro Glu Ser He Ala Val Lys Ala Pro A^.p Thr Asp Val Leu 

Val Ala Glu Arg Leu Gly Ser Leu Leu Ma Pro Glu Glu He Tyr Arg 

15 Thr Ala Leu Gly Leu Ly. Pro Ala ^u His Ser Ser Glu llu Gin He 

Ala Sor Ala Arg Glu Gly Leu Ala Ser Ala Arg Ala Alf Tyr Phe Pro 
295 300 

20 los ^^"^ ^^"^ ""^^ '"^'^ l^l ""^^ 

Gly Lys Glu Tyr Ala Ala He A^n Pro Ser Ihl Ser Glu Gin Trp Lys 

^25 --^ 



75 



315 

A.<.n Asn Gly Ser iy~r Ser He Gly Leu ler Leu Asn He Pro He Phe 



or: 345 :aso 

25 ser Ala Met Gin Thr Gin Asp Arg Val Arg Ser Ser Arg Leu Gin 11. 
Jl>^ 360 365 

Arg ser Ser Glu Leu Arg Leu Val Glu Glu Lys Lys Ala Leu T>'r Lys 
^'^ 375 



Glu He Arg Gin Ala Tyr Ser Asn Ala Val Ala Ala Asp Lys Ala He 
JO-^ 390 



Ala Ala Ala Glu Asn Ser Lys Ala Ala Thr Leu Lys Ala Tyr Glu ^yr 
»i « , 410 

Ala Arg Asp Ser Phe Glu Ala Gly Arg Leu Sor Ala Tyr Glu Tyr Ala 
QC - 420 425 430 

J5 Glu Ala Lys Thr Lys Tyr Ala Leu Ser Gin Val Glu Glu Leu Arg Ala 

Lys Tyr Asp Phe He Tyr Lys Ala Lys Val Leu Asp til Tyr Gin Gly 
455 4fio 

Lys Asp Phe 
40 4 65 

(2) INFORI4ATTON FOR SEQ ID NO:33S 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 51 amino acids 
(B» TYPE: amino acid 
(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
(iiil HYPOTHETICAI.: YES 



(VlJ ORIGINAL SOURCE: 

ORGANISM: Porphyromonas gingivalis 

(Ix) FEATURE: 



(AJ NAME/KEY: misc_f eature 
(B) LOCATION 1. . . 451 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335 

Lys Thr Ser Tyr Arg Asn Asn Met Arg Phe Gin HJ s Tyr Leu He Cys 

Thr Ala Ala Val Ala Ala Leu Ala Ala l^n Pro Leu Thr Gly Gin Ser 

Aan Met Thr Leu Glu Glu Cys He ^p Tyr Ala Arg Arg His Ser Ser 

Ala Val Ala Leu Ser Ala Ala G^u Leu Glu Gin Ser Lys Ala Asp Tyr 

L|u cm Ala Val Gly Asn Phe Leu Pro Arg Val ser Ala Gly Thr Gly 

Ala S^r Trp Asn Phe Gly Arg Gly Leu A^p IL Glu Thr Asn Thr ?yr 

85 90 Qc 

Thr Asp He Asn Ser Phe Asn Asn Ser Tyr Ser He His Ala ^hr Met 



110 
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65 



The 


Leu 


Phe 


Asp 


Gly 


Leu 


Gin 


Ser 


Val 


Tyr Arg 


Leu 


Arg 


Met 


Ala 


His 






115 










120 








125 








Ala 


Arg 


Arq 


Glu 


Ala 


Ser 


Arg 


Leu 


Ser 


Val Arq 


Glu 


Gin 


Gin 


Glu 


Leu 




130 










135 








140 










Ala 


Ala 


Leu 


Gly 


Thr 


Thr 


Glu 


Ala 


Tyr 


Tyr Asp 


Leu 


Val 


Tyr 


Ala 


Arg 


145 










150 








155 










160 


Gin 


Met 


Gin 


Gl u 


Leu 


Ala 


Met 


Gin 


Lys 


Tyr Glu 


Glu 


Ser 


Ser 


Arg 


Leu 










165 










170 








175 




His 


Arq 


Gin 


Thr 


Ala 


Arg Met 


Glu 


Glu 


Leu Gly 


Met 


Lyo 


Ser 


Arg 


Pro 








180 










185 








190 






Asp 


Val 


Leu 


Glu 


Met 


Gin 


Ser 


Arg 


Met 


Ala Gly 


Asp Arg 


Leu 


Ala 


Leu 




195 










200 








205 








Thr 


Gin 


Ala 


Asp 


Asn 


Gin 


Cys 


He 


He 


Ala Leu 


lis 


Arg 


Leu 


Lys 


Glu 




210 










215 








220 










Lys 


Met 


Asn 


Phe 


Pro 


He 


Asp Asp Glu 


Leu Val 


Val 


Asp Asp Met 


Pro 


225 










230 








235 










240 


Ala 


Asp 


Ser 


Leu 


Ser 


Ala 


Asp Met 


Ala 


Glu Ssr 


Asp 


Scr 


Ser 


Ala 


Gly 










245 










250 








255 




Val 


Phe 


Ala 


Arg 


Ala 


Ala 


His 


His 


His 


Pro Val 


Leu 


Leu 


Arg 


Ala 


Lys 








260 










265 








270 






Leu 


Asp 


Glu 


Gin 


Ala 


Ala 


Thr Asp Arg 


Leu Arg 


Ala 


Ala 


Arg 


Gly Ala 
















280 








285 








Phe 


L&u 


Pro 


Ser 


Val 


Ser 


Val 


Ser Gly 


Gly Trp 


Asn 


Thr 


Gly 


Phe 


Ser 




290 










295 








300 










Arg 


Phe 


Leu 


Asn 


Gly Ser Asp Tyr Thr 


Pro Phe 


Ser 


Glu 


Gin 


Phe 


Arg 


305 










310 








315 










320 


Asn 


Arg 


Arg 


Gly 


Glu 


Tyr 


Val 


Ser 


Leu 


Asn Leu 


Ser 


He 


Pro 


He 


Phe 










325 










330 








335 




Ser 


Cly 


Phe 


Ser 


Leu 


Val 


Scr 


His 


Leu 


Arg Gin 


Ala 


Arg 


Ala 


Glu 


Arg 








340 










345 








350 






Arg 


Ala 


Ala 


He 


Val 


Arg Arg Gly Glu Ala Glu 


Arg 


Arg 


Leu Tyr 


Ser 






355 










350 








365 








Glu 


He 


Ala 


Gin 


Ala 


Met 


Ala Asp Arg Asp Ala 


Ala 


Leu 


Ala 


Ser 


T^'r 




370 










375 








380 










Arg 


Gin 


Ala 


Lys 


Glu 


His Thr Asp Ala Met Gin 


Thr Ala Tyr Glu 


Ala 


385 










390 








395 










400 


Val 


Leu 


Gin 


Arg 


Tyr 


Glu 


Glu 


Gly Leu Asn Thr 


Ala 


He 


Asp 


Leu 


Thr 










4 05 










410 








415 




Thr 


Gin 


Ala 


Asn 


Arg 


Leu 


Leu 


Asp Ala Arg Val 


Gin 


Arg 


Leu 


Arg 


Ala 








420 










425 








430 






Ala 


Met 


Thr 


Tyr 


Arg 


Leu 


Lys 


Cys 


Lys 


Leu Tie 


Ala Tyr Tyr Gly Cys 






435 










440 








445 








Leu 


Ser 


Asp 




























450 




























(2) 


TNFORJIATIOM 


FOR 


SEQ 


ID NO: 336 















10 



15 



20 



25 



30 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 962 amino acids 
50 (B) TYPE: ajnino acid 

(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

55 till) HYPOTHETICAL: YES 

fvi} ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalls 

60 (ix> FEATURE: 

(A) tiWZ/KEri: misc_feature 

(B) LOCATIOH 1. . .962 



(xi} SEQUENCE DESCRIPTION: SEQ ID nO:336 



Ala He Phe Val Val Ser Leu Gin He Glu Lys He Thr Glu Asn Lye 
15 10 15 

Tyr Asn Ser Asp Lys Ser Met Asn Lys Phe Tyr Lys Ser Leu Leu Gin 
20 25 - 30 

70 Ser Gly Leu Ala Ala Phe Val Ser Met Ala Thr Ala Leu Thr Ala Ser 
35 40 45 

Ala Gin He Ser Phe Gly Gly Glu Pro Leu Ser Phe Ser Ser Arg Ser 

50 55 60 

Ala Gly Thr His Ser Phe Asp Asp Ala Met Thr He Arg Leu Thr Pro 

75 65 70 75 ao 
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Asp Phe A.-.n Pro Glu Asp Leu lie Ala Gin Ser Arg Trp Gin Ser Gin 

Arg Asp Gly Arg Pro Val Arg lie Gly Gin Vai He Pro Val Asp Val 
_ 1 00 2 05 Tin 

5 ASP Phe Ma s« Ly. Ala Ser His He Ser Ser He Gly A^p val A.p 

Val Tyr Arg Leu Gin Phe Lys Leu Glu Gly Ala Lys Alf He Thr Leu 

135 

10 IV5 JlS ■ry-^ 

Thr Pro A.P His Glu 11= Val Leu Gly Ala Tyr Thr Asn Ala Thr hI^ 

170 ITE 

Arg Arg Asn Gly Ala Phe Ala Thr Glu Pro Ve.1 Pro Gly Ser Glu Leu 

15 He Met A^p Tyr Glu Val Ser Arg Gl" Gly Thr Leu Pro ^Ip lie Lys 
Ti 20*^ 205 

He ser Gly Ala Gly Tyr He Phe A^p Lys Val Gly Gly Arg Pro Val 

20 22I ^^'^ *=P Ser Asp Ser Asp Cys 

Glu He Asn He A^n Cys Pro Glu Gly Ala ^p Trp Gin Ala Glu Lys 

250 '7t:E 

A.n Gly Val Val Gin HeL He Met Val Lys Gly Gin Tyr He Ser Met 
25 cy. Ser Gly A.-n Leu Leu Asn As„ Thr Lyr Gly A..p Phe Thr Pro Leu 
He He Ser Ala Gly His Cy^ A^a Ser He Thr Thr lln Phe Gly Val 

30 lol '■^'^ ^'^P ^•'^ 'T*^' «" Tyr Glu Lys 

Arg Gly Cj^s Ser Asn Gly Thr Leu Ala He Arg Gly Asn Ser He 

jZd 

He Gly Ala Ser Met Lys Ala Phe Leu Pro He Lys Gly Lys Ser Asp 
35 Gly Leu Leu Leu Gin Leu Asn Asp G^u Val Pro Leu Arg ^-r Arg Val 

360 

Tyr Tyr A..n Gly Trp Asp Ser Thr Pro Asp He Pro Ser Ser Gly Ala 

40 385 ^" ^^'^ I--" I-y- 

Lys Thr Pro Ala Leu Asn Thr Trp He Ser III Ser Gly Ser Gly G^y 
Thr Asp Asp His Phe Tyr Phe Lys Tyr Asp Gin Gly Gly Thr Glu Gly 
45 Gly Ser Ser Gly Ser Ser Leu Phe ^n Gin Asn Lys His yll Val Gly 



Thr Leu Thr Gly Gly Ala Gly il° Cys Gly Gly Thr G^u Phe Tyr Gly 
50 «| ^" T,^P ^" ■'y- i^P C31y Asn Thr Ser 



Arg Met Asp He Tyr ilu Asp Pro Gin Asn A^n Gly Gin Thr Thr 
48^ 490 AQ^ 

Leu Asn Gly Thr Tyr Arg Asp Gly Tyr Lys Pro Leu Pro Ser wll Pro 

55 Arg Leu Leu Leu Gin Ser Thr Gly l^p Gin Val Glu Leu ^n Trp Thr 



520 



Ala Val Pro Ala Asp Gin Tyr ^^o Ser Ser Tyr Gin v" Glu Tyr His 



60 sis ""^ sVo "^^ '^"^ '•^^ •Tvr 

ser Asp Ala He Asp Glu S=r He He Gly "r Gly He He Arg Tyr 
Glu val ser Ma Arg Phe He Tyr Pro llr Pro Leu Asp Gly vJi Glu 

65 ser Tyr Lys Asp Thr Asp Lys Thr III Ala Asp Leu Ala ifc Gly Asp 
He Gin Thr Lys Leu Lys Pro Asp Val Thr Pro Leu Pro Gly Gly Gly 

70 «s ^" ^he Leu ser G^^ Leu Val Ser Arg 

Phe Gly Glu Ser Pro Asn Pro Val Phe Lys Thr Ph. Glu Val Pro Tyr 
Val ser Ma Ala Ma Ma Gin Thr Pro l^n Pro Pro Val Gly v" Val 

75 He Ma Asp Lys Phe Met Ala Gly Thr Tyr Pro Glu Lys Sa Ala He 
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10 



15 



20 



25 



30 



35 



40 



45 



65 



70 



75 







67 5 










680 










605 








a 


Ala 


Val 


Tyt 


Val 


Met 


Pro 


Ser 


Ala 


Pro 


Asp 


Ser 


Thr 


Phe 


His 


Leu 




690 










695 










7 00 










Phe 


Leu 


Lys 


Sec 


Asn 


Thr 


Asn 


Arg 


Arg 


Leu 


Gin 


Lys 


Val 


Thr 


Thr 


Pro 


7 0S 










710 










715 










720 


Ser 


Asp 


Trp 


Gin 


Ala Gly Thr 


Trp 


Leu 


Arg 


He 


Aen 


Leu 


Asp 


Lys 


Pro 










725 










730 










735 




Phe 


Pro 


Val 


Acn 


Asn 


Asp 


His 


Met 


Leu 


Phe 


Ala 


Gly 


He 


Arg 


Met 


Pro 








740 










74S 










750 






Asn 


Lys 


Tyr 


Lys 


Leu 


Asn 


Arg 


Ala 


He 


Arg 


Tyr 


Val 


Arg 


Asn 


Pro 


Asp 






755 










760 










7 65 








Asn 


Leu 


Phe 


Ser 


Tic 


Thr 


Gly 


Lys 


Lys 


He 


Ser 


Tyr 


Asn 


Asn 


Gly Val 




770 










775 










780 










Ser 


Phe 


Glu 


Gly 


Tyr 


Gly 


He 


Pro 


Ser 


Leu 


Leu 


Gly Tyr 


Met 


Ala 


He 


785 










7 90 










795 










800 


Lys 


Tyr 


Leu 


Val 


Val 


Asn 


Thr 


Asp Ala 


Pro 


T^y? 


He 


Asp Met 


Ser 


Leu 










805 










810 










81S 




Val 


Gin 


Glu 


Pro 


Tyr 


Ala 


Lys 


Gly Thr 


Asn 


Val 


Ala 


Pro 


Phe 


Pro 


Glu 








820 










825 










830 






Lea 


Val 


Gly 


lie 


Tyr Val 


Tyr 


Lys 


Asn 


Gly Thr 


Phe 


He 


Gly 


Thr 


Gin 






835 










840 










845 








Asp 


Pro 


Ser 


Val 


Thr 


Thr 


Tyr 


Ser 


Val 


Ser 


Asp 


Gly Thr 


Glu 


Ser Asp 




850 










B55 










800 










Glu 


Tyr 


Glu 


He 


Lys 


Leu 


Val 


Tyr 


Lys 


Gly 


Ser 


Gly He 


Ser 


Asn 


Gly 


865 










870 










875 










880 


Val 


Ma 


Gin 


He 


Glu 


Asn 


Asn 


Asn 


Ala 


Val 


Val 


Ala 


Tyr 


Pro 


Ser 


Val 










885 










890 










995 




Val 


Thr 


Atp 


Arg 


Phe 


3er 


He 


Lys 


Asn 


Ala 


His 


Met 


Val 


Kis 


Ala 


Al<-i 








900 










905 










910 






Ala 


Leu 


Tyr 


Ser 


Leu Asp Gly Lys Gin Val 


Arg 


Ser 


Trp 


Asn 


Asn 


Leu 






915 










920 










925 








Arg 


Asn 


Gly Val 


Thr 


Phe 


Ser 


Val 


Gin 


Gly 


Leu 


Thr 


Ala 


Gly Thr 


Tyr 




930 










935 










94 0 










Met 


Leu 


Val 


Met 


Gin 


Thr 


Ala 


Asn 


Gly 


Pro 


Val 


Ser 


Gin 


Lys 


He 


Val 


945 










950 










955 










960 


Lys 


Gin 































(2) INFORMATION FOR SEQ ID HO: 337 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 702 amino acids 

(B) TYPE: amino acid 
(Dl TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

50 (vi) ORIGINAL SOURCE: 

(A) ORGAWISM: Porphyromonas gingival is 

Ux) FEATURE: 

(A) HAIIE/KEY: inisc_f eature 
55 <D) LOCATIOti 1...702 



60 





Ixi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 337 






Trp 


Lys 


Lys 


Ser 


Asn 


Pro 


Gin 


Tyr 


Arg 


Gly 


Aig 


Thr 


Ala Asn Tyr 


Arg 


1 








5 










10 






15 




Asp 


His 


Tyr 


Asn 


Thr 


Glu 


Ser 


Tyr 


Leu 


Ser 


Leu 


Arg 


Ary Asn Gin 


Thr 








20 










25 








30 




Met 


l.ys 


Tyr 


Leu 


He 


Arg 


Leu 


Phe 


Leu 


Ser 


Leu 


Met 


Leu Leu Ser 


Leu 






35 










40 










45 




Trp 


Thr 


Gly 


Cy.-? 


Thr 


His 


Glu 


Glu 


Leu 


Ser 


lie 


Cys 


Asp Gly Glu 


Asn 




SO 










55 










60 






Thr 


Leu 


Val 


Leu 


Arg 


Val 


Glu 


Thr 


Gly 


Lys 


AJ a 


Pro 


Asn Ala Arg 


Ala 


€5 










70 










75 






80 


Thr 


Glu 


Pro 


Gly 


Gin 


Gly 


He 


Tyr 


Asn 


Glu 


Asu 


Lys^Val Gly Ser 


He 










85 










90 






95 




Ser 


Val 


Leu 


Phe 


Tyr 


Leu 


Glu 


Gly 


Gin 


Leu 


Arq 


Trp 


Gin Val Lys 


Ser 








100 










105 








110 




Thr 


Asp 


Tyr 


Gin 


He 


His 


Glu 


Gly 


Ala 


Tyr 


He 


He 


Pro Val Lys 


Glu 






115 










120 










125 




Gin 


Met 


Arg 


Pro 


Leu 


Phe 


Asn 


Gly 


Asn 


Astn 


Asn 


Phe 


Ser He Tyr Val 
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130 135 140 

Val Ala Asn Leu Asp Phe Asn Ala Pro Ala Thr Glu Ala Ala Leu Ser 

iSO 155 
Gin Phe Val Val Glu Lys Ser lie Glu Val Ser Ser Thr Thr Ala Pro 
) 165 170 

Ala Asp Ph9 Val Met Leu Ala His Gly Asn Lyc Gin He Asn M-t Ala 

180 185 190 

Thr Thr Glu Gly Lys Leu Leu Gly Asp Tyr Lys Leu Lys Arg Val Ala 

195 200 205 

Ala Lys He Arg Met He Lys Pro Thr He Asn Val Gin Gly Tyr Glu 

210 215 22^^ 

Val Val Gly Asn He Gin Ala Lys Phe Arg Asn Ser Val Thr Lys Gly 
225 230 23S 240 

Phe Leu Thr Thr Glu Ala Gin Glu He Pro Ala Ala Ala Ser Tyr Lys 
^5 245 250 255 

Thr Ser Glu Tyr Leu Asp He Ala Glu Ser Ala Pro Ala Asn Ser II- 

260 2S5 270 

His Phe Tyr Scr Tyr Tyr Asn Lys Trp Thr Lgu Ser Thr Pro Glu Lvs 

275 2S0 285 

Arg Pro Glu Phe Phe He Met Val Lys Phe Lys Lys Thr Gly Gin Pro 

290 235 300 

Asp Asn Thr Ala Lys Pro Tyr Tyr Tyr Axg Val Pro Leu Glu Ser Gin 



10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



70 



305 310 



315 320 



Asp Asn Gin Val Lys Ser Asn Val Leu Tyr Asn Leu Asn Val Lys H- 

325 330 335 

Glu He Leu Gly Ser Leu Gin Glu Pro Glu Ala Val Ser Val Asn Glv 

340 345 350 

Thr Leu Ala He Glu Glu Trp He Leu His Gin Asp Ala Phe Asn Leu 

355 360 365 

Pro Ala Thr Asn Tyr Leu He Val Glu Gin His Glu He Phe Met Asn 

370 375 380 

Asn Val Asn Thr Tyr Ser Val Lys Tyr Gin Thr Ser Gin Lys Pro He 
"^^^ 390 395 

Ser He Ser He Gin Ser Val Thr Phe Ser Tyr Val Ser Ser Asp Glv 

405 410 415 

Thr Gin His Asn Asp Leu Val Ala Ser Ser Ser Asd Gin Tyr Pro Thr 

425 * 4S0 

He Thr Ser Asp A^n Thr Ser He He He Thr Ser Lys He Pro Val 

435 445 
Asn Asn Val Pro Lys Lys Tie Val Phe Glu Val Thr Asn Gly Val Ala 

450 45S 460 

Gly Leu Lys Glu Thr Val Thr Val Leu Gin Tyr Pro Ala Gin Phe He 
470 475 4go 

Val Asn Thr Leu Gly Thr Ala Ser Ala Trp Arg Pro Asp Gly Ser Leu 

495 490 
Ala Pro Gly Leu Asn Asn Lys Ala He Tyr His Val Val Val Leu Val 

500 505 510 

Pro Pro Glu Asn Leu Phs Glu Asp Gly Thr Gin Thr He He Gly Tyr 

515 520 525 

Pro Pro Thr Glu Thr He Ser Phe Hjs Lys Lys Glu Asn Asn Thr Tyr 

"0 535 540 

Pro He Val Trp Ser Asp Thr Asn Thr Thr Lys Gin Asp Leu Glu Thr 
S45 550 555 

Ser Arg Met He Ser Pro Ser Phe Glu Leu Ala Ser Gin Leu Gly Ala 

565 570 575 

Thr Leu Pro Met Pro Tyr Leu Glu Tyr Trp Pro Gly Thr Ser Tyr Leu 

580 ses 590 

Leu Asp Tyr Ser Gly Asn Tyr Asn Asn Lys Arg Tyr Ala Leu Phe Asn 

595 600 605 

Cys Ala Phe Tyr Trp Glu Lys Arg Lys Val Asn Asn Glu Glu He Lvs 

610 615 620 

Phe Asp Asp Trp Arg Leu Pro Thr Glu Ala Glu He Lys Leu He Asp 

630 635 
Lys Leu Gin His Asn Glu Gin Ser Ala Val Gin Ala He Met Thr Glv 

€45 650 655 

Asn Tyr Tyr Trp Aap Ser Tyr Ser Ala Asn Gly Ser Tyr Lys Met Gin 

660 665 670 

Gly Gly Gly Gly Gin Gly Asn Ser Ser Lys Ala T^^r Val Arg Cys Val 

675 680 685 

Arg Asp Val Lys Lys Pro He Arg Asp Lys Lys Ser Gly Lve 
690 695 700 

(2) INFORMATIOTJ FOR SEQ ID HO: 338 

75 (i) SEQUEUCE CHARACTERISTICS: 



SUBSlllU TH SHEET (Rule 26) (RCVAU) 
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10 



15 
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(A) Lfc:NGTH: 13i;> amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

5 (11) HOLECULE TYPE: protein 

(iii) HYPOTHETirAL: YE:> 

(vi) ORIGINAL SOURCE: 

(A) ORGAllIiM: Porphyi omonas glngivalis 

(Ix) FEATURE: 

(A) NAIIE/KEY: mlscf eature 

(B) LOCATION 1...1312 

(xi} SEQUEr^CE DESCRIPTION: SEQ ID NO:338 



Lys Arg Val Asp Ser Tyr Glu Cys Arg Gin Lys Ala Cys Lys Cys Ala 

20 He cys Val lie Gin Lys Phe Thr A.^n Val Lys Leu Asn Asp Met Arg 
20 25 30 

Lys He J.r.u Ser Phe Leu Met Met Cys Ser Leu His Leu Gly Leu Gin 

35 40 45 

25 ^^"^ "^^"^ '^''^ ^" Ala Ala Leu Pro Ser 

^" 55 €0 

lie Gly lie Gin dlu Ser Ser Cys Thr Arg He Thr Ph« Glu Val Val 

''O 75 8c 

Phe Pro Gly Phe Tyr Ser Val Glu Lys Arg Glu Gly Asn Gin Val Ph- 

85 90 95 

Gin Arg Ilo Ser Met Pro Gly Cys G)y Ser Phe Gly Asn Leu Gly Glu 
100 105 110 



30 



50 



j-uo 110 

Ala Glu Leu Pro Val Leu Ly:. Lys Met He Ala Val Pro Glu Phe Ser 
115 125 

35 '^^'^ ^^'^ Thr Glu Thr Phe Asp Asn 

130 135 140 

Tyr Asn He Tyr Pro Asn Pro Thr Tyr Val Val Glu Glu Leu Pro Glu 

150 155 , gQ 

Gly Gly Thr Tyr Leu Val Glu Ala Phe Ala He Asn Asn Asp Tyr Tyr 

40 Ser Gin Asn Val Ser Leu Pro Ser Thr His Tyr Val Tyr Ser Gin Asp 
18t) 185 190 

Gly Tyr Phe Arg Ser Gin Arg Phe He Glu Val Thr Leu Tyr Pro Phe 
195 20O 205 

45 l^n ^""^ ^'"^ "^^^ ^^l*^ Leu Phe Ala Lys Lys He Glu 

210 215 220 

Val Thr He Thr Phe Asp AFn Pro Gin Pro Pro Leu Gin Lys Asn Thr 
230 235 240 

Gly He Phe Asn Lys Val Ala Ser Ser Ala Phe He Asn Tyr Glu Ala 

250 255 
A=p Gly Lys Ser Ala He Glu Asn Asp Met Val Phe Ser Arg Gly Thr 

260 265 270 

Thr Thr Tyr He Ser Gly Asn Val Ala Ser Asn Leu Pro Gin Asn Cys 
2^5 280 28- 

5'^ lo^ '^y'' Asn Val Asn Gin Gin Pro 

290 29S 300 

His Asp Glu He Lys Arg Leu Cys Glu His Arg Ala Phe Tyr Asn Gly 

Phe Asp Val Ala Ala Val Ser He Lys Asp Val Leu Asn Ser Phe Pro 
Rn 330 335 

OU ser Asn Ala Thr Ser Tyr He Asn Glu Thr Lys Leu Lys Asn Phe Tl- 

345 350 
Arg S«r Val Tyr Asn Gin Ser Asn Ala Lys Arg Thr Leu Asp Gly Lys 
JJ>^ 360 

65 ■^^'^ Ser Lys Tyr Leu Ala 

375 390 
A^p Thr Arp Asn Thr Lys Val Pro Thr Ser He His Asn Val Ser 

; 395 400 

Leu He Pro Sor His Pro Thr Phe Gly Ser He Cys Ala Ser Asp Tyr 
7n r.^ "^^^ **10 415 

/U Phe Phe Ser Cys Val Ser Pro Leu Asp Thr Val Gly Asp Leu 
420 425 430 



Phe Phe Ser Cys Val Ser Pro Leu Asp Thr Val Gly Asp Leu Phe He 

'*25 430 

Gly Arg Phe Ser Val Thr Asn Ala His Glu Leu His Asn Leu He Glu 
'*35 440 

75 ^^'^ I5n ^''^ '^^"^ Ala His Lys Asn 

<55 450 
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lie Leu Tyr Ala Giu Gly r.ys Gly Cys A^p Ala Pro lie Leu Arg Leu 

475 460 
Phe Leu Lys Glu Tie Ala Ser Gly Tyr Thr Val Asn Ser He L-u Lys 

) Ser Asn Gin Val Ser Ala Ho Asp Ser He Phe A.^p Cys Leu A^n Asn 
500 505 
Gly Ser Hii- His Phe Tyr Phe Asn Thr His Gly Met Pro Thr Val Tro 

S20 525 
Gly lie Gly C,ln Gly Leu Asp Val Asn Thr Leu Thr Ala Arg L«u Asn 

3J0 535 540 

Asn Thr Ser Ser Gin Gly Leu Cys Thr Ser Leu Ser Cys Ser Ser Ala 
550 

Val Ala A.^p Ser Thr He Arg Ser Leu Gly Glu Val Leu Thr Thr Tyr 

565 570 
Ala Pro Asn Lys Gly Phe Ser Ala Phe Leu Gly Gly Ser Axg Ala Thr 

585 

Gin Tyr Ala Val Tyr Leu Glu Gly Pro Cys Pro Pro Ser Glu Phe Tyr 
595 600 605 



20 



35 



Glu Tyr Leu Pro Tyr Ser Leu Tyr His Asn Leu Ser Thr Val Val Gly 

610 615 620 

Glu Mec Leu Leu Ser Ser He He Asn Thr Asn Ser Val Asp Thr Tyr 
630 €35 e3o 



Ser Lys Phe Asn Phe Asn Leu Leu Gly Asp Pro Ala Leu Asn II 
64 5 650 



e Met 



25 Al. His Gly Met Glu Val Ser Asn Cys He Thr Lou Pro Asn tin Thr 

660 665 

He He Ser Ser Pro He Thr He Lys Asn Gly Gly Cys Leu Lys He 
6/0 680 685 

30 t\a ^^''^ ""'^ ""^^ ^^'^ '"■'^^ ""'y Gin Val 

690 695 700 

Met Ser Gly Gly Thr Leu Glu He Gly Asn Gin Ala Lys He Ser Gly 

715 

Glu Thr Gly Ala Asn Pro Thr Phe He Thr Val Tyr Gly Asp Gly L-u 

''25 730 735 

Ala He Asn Lys Gin Val Glu He Asp Asn He Asp Arg Leu Asn Leu 

7^0 7/15 750 

PhQ Ser Thr His Ser Val Met Pro Lys Phe His Phe Asp Ser Val Lys 

760 765 

40 ^^'^ "^^"^ '^^^ -'^-^ Val Glu He Ser 

770 775 780 

Asn Ci's Glu Phe Thr Asn Arg Ser Asp He He Ser Lys Asn Cys Asp 

795 800 
Leu Ser Val Glu Asn Ser Het Phe Ser Ser Ser Gly He Thr Val Phe 
AC 810 BIS 

Lys Pro Met Ala Thr Ser Ser He Thr Gly Leu Ser Thr Lys Ala Lys 

825 830 
He Thr Asp Asn Thr Phe Phe Ala Thr Gly Asn Phe Ala Tyr His H» 
... ... . S-IO 845 



"50 '"^'^ aV^ '^^'^ ^'^^ '^^^ Asn Ala Ala He Lys L^ 

3U 850 855 960 



eu 



Asp Asn He Pro Glu Tyr Tyr He Ser Gly Asn Lys He Val Asn Cys 

875 flflo 
Asp Glu Ala Leu Val Leu Asn Asn Ser Gly Asn Arg Thr Asn Arg Leu 
cr 890 895 

5b His Asn Ho Thr Arg Asn Val He Lys Asn Cys Arg He Gly Ser Thr 
300 905 91 0 

Leu Tyr Asn Ser Tyr Gly He Tyr Asa Arg Asn Lys He Ser Asn Asn 
915 920 925 

60 "^^ 930 ""'^ "^""^ 935 '^'^ ''^'^ ""''^ '^y^ 

Ala Pro Val He Asn Glu Glu Asp Lys Gin Thr Phe He Ser Asn Arg 

950 055 

Thr Trp Gin I.eu Tyr Ser Ser Acn Cly Thr Phe Pro Lgu Asn Phe His 

CC _ 970 075 

b5 Tyr Asn Ser Leu Gin Gly Gly Asp Thr Asp T>.r Trp He Tyr Asn Asp 

' 985 9Q0 

Thr Tyr Thr A^n Arg Tyr He Asp Val Ser Asn Asn His T^p Gly Asn 

33=> 1000 1005 

70 ^0^0''^'' ^^'^ '^^l.f " ^""^ "^^^ P^^' Asp Leu Pho 

101 5 1020 
He Trp He Pro Phe Trp Asp Gly Leu Pro 



75 



To^s ^"^ ^ ^^'^ Asn Gly Arg Ser Gly Asn 

1030 10J5 1 

Ser Ser Ala Glu Al^ Val Glu Phe Gin Thr Ala Leu Asp Cys He Gly 

1045 1050 10«;5 

Asn Sor Asp Tyr Leu Ser Ala Lys Val Ala Leu Lys Het Met Val Glu 
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1060 106b J 070 

Thr Tyr Pro Glu Scr Asp Pha Ala lie Ala Ala i>eu hys Glu Leu Phe 

1075 1080 1085 

Arg lie Glu Lys Met Ser Gly Asn Asp Tyr Glu Gly Leu Lys Asp Tyr 
5 1090 1095 1100 

Phe Arg Ser Asn Pro Thr He He Ser Ser Gin Asn Leu Phe Pro Thr 
1105 1110 1115 3120 

Ala Asp Phe Leu Ser Ala Arg Cys Asp lis Val Cys Glu Asn Tyr Gin 

1125 1130 1135 

Ser Ala He Asp Trp Tyc Glu Asn Arg Leu Asn Ser GLu He Scr Tyr 

1140 1145 1150 

Gin Asp Ser Vsl Phe Ala Val He Asp Leu Gly Asp He Tyr Trp Asn 

1155 1160 1165 

Met Gin Leu Asp Ser Leu Arg Gly Thr Gly He Asp Leu Asn He Leu 

1170 1175 H80 

ser Cys Glu Gin Arg Lys Ser Leu Glu Ser His Gin Asn Val Lys Asn 
1185 1190 1195 1200 

Tyr Leu Leu Ser Thr Leu Pro Glu Ser Thr Gly Thr Leu Leu Pro Pro 

1205 1210 1215 

Leu Glu Cys Asn Lys Ser Ser Leu Asp Lys Ser Lys He He Ser He 

1220 1225 1230 

Scr Pro Asn Pro Ala Lys Ala Val Val Thr He He Tyr Tyr Thr Asp 

1235 1240 1245 

Asn Pro Ser Cys Ser Val He Lys He Tyr Gly He Asn Gly Ala Ser 

1250 1255 1260 

Ala Asp He The Gly Lou Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser 
1255 1270 1276 1280 

He Gin Fhe Asn Thr Ser Asn Phe Asp Pro Gly Phe Tyr Leu Val Thr 

1285 1290 1295 

Leu Asn Val Asp Gin Lys He He Asp Thr Glu Lys Leu Arg He Lys 
1300 1305 1310 



(2) INFORMATION FOR SEQ ID NO: 339 

tl) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 938 amino acids 

(B) TYPE: amino acid 
(Dl TOPOLOGY: Linear 

(i±) MOLECULE TYPE: protein 



(Hi) HYPm'HETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAT^ISM: Porphyroraonas gingival is 

(iX) FEATURE; 

{A> HAJ-IE/KEY: misc_feature 
tB) LOCATION 1. . .938 

(xl) SEQUEIJCE DESCRIPTXOM: SEQ ID MO: 339 



Ser 


Glu 


Asn 


Tyr 


Arg 


Tyr 


Gly 


Lys 


He 


Thr 


Asn 


Gin 


He 


Met 


Ala 


He 


1 








5 










10 










15 




Met 


Met 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 








20 










25 










30 






Trp Ala 


Ala 


lift 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly 


Met 


Asn 






35 










40 










45 








Ala 


Ser 


Cys 


Lea 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 




50 










55 










60 










Ser 


Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 


He 


Asp Als 


65 










70 










75 










80 


Asp 


Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val 










85 










90 










95 




Pro 


Tyr 


Gly 


His 


Asn 


Gly 


Leu 


Cys 


Thr 


Tyr 


Ser 


His 


He 


Arg 


Ser 


Gly 








100 










105 










110 






He 


Ser 


Thr 


Ala 


Gly 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Arn 


He 


Glu 


Gly Ala 






115 








120 










125 








Lys 


Arg 


Val 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu 




130 










135 










14"0 










His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 


145 








150 










155 










160 


Val 


Leu 


Leu 


Phe 


Asp 


Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val 










165 










170 










175 




Trp 


Arg 


Arg 


Arg 


He 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


He 


Ala 
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15 



20 



— xo J 1 go 

Trp Arc, His Tyr Lys Val Thr Asp Ser His Thr Glu Phe Leu Ly= l.eu 

200 205 

5 ^f? ^" P~ P-^o Ala Thr 

^ 215 220 

A^p Phe Thr Vai Asn He Gly Gin Asn Val Gly Arg Leu Thr Trp 

230 235 7do 

A^n Tyr Pro Glu A^p Tyr Gin Pro Glu Gly Ly^ Gly Asn Glu Glu Leu 
in 250 255 

10 Gin Leu Ser Gly Tyr Asn He Tyr Ala Asn Gly Thr Leu Lea Ala Gin 
^, , 265 270 

He Lys A.^p Val Ser He Leu Glu Tyr Val A^p Ser Thr Tyr Ser Leu 

SI" Tyr Cys Val Thr A^a Vai Tyr Asp 
■^•'^ 295 ^ 

Glu Ser He Glu Ser Ser Thr Val Cys Gly Thr Leu His Tyr Ala Thr 

310 3j^5 - 

Asp Ala He Leu Tyr Glu Asn Phe Glu Asn Gly Pro Val Pro Asn Gly 

325 335 
Trp Leu Val He Asp Ala Asp Gly Asp Gly Phe Ser Trp Gly His Tyr 

340 3.15 

Tyr Asp Ala Phe Pro 

' 360 " 365 



Leu Asn Al. Tyr Asp Ala Phe Pro Gly His Asn Gly Gly His Cys Ser 

•^^^ ' ^60 365 

Leu Ser Ala Ser Tyr Vai Pro Gly He Gly Pro Val Thr Pro Asp A^n 



25 370 



' -' 380 

Tyr Leu He Thr Pro Lys Val Glu Gly Ala Lys Arg Val Lys Tyr Trp 
390 395 A Cin 

Val Sor Thr Gin Asp Ala Asn Trp Ala Ala Glu His Tyr Ala Val Het 

4 05 ail 

Ala ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe Glu 
420 



on 410 aiei 

30 Ala ser Thr Thr Gly Thr Ala Val Gly Asp Phe Val He Leu Phe 
420 425 

Glu Thr Met Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Arg Thr H 

35 4IS ""'^ ''^'^ 4^ '"^^ "^^P cys 

Thr TKcp He Tyr Phe Leu Lys Leu Asp Asp He Thr Val Phe Gly Thr 
Pro Ala Ser Glu Pro Glu Pro Val Thr Asp Phe Val Val Ser Leu He 

40 Glu Asn Asn Lys Gly Arg Leu Lys Trp ^n Tyr Pro Asn Gly ^yr Glu 

505 

Pro Asp Lys Thr Asp Asp Lys Asp Pro T.ou Gin Leu Ala Gly Tyr Asn 

520 

45 I30 ""^^ ""^^ '^^'^ 5I5 ^""^ Thr Val 

Leu Glu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Gin Val Glu 

Val Glu Tyr Cys Val Thr Ala Val Tyr Asn Asp Asn He Glu Ser Gin 



KA 570 575 

50 ser val cys Asp Lys L«u He Tyr Asp Ser Gin Ser Asp He lie Leu 

58S 590 
Tyr Glu Gly P|,e Glu Ala Gly Ser He Pro Glu Gly Trp Leu Leu He 

600 go5 
55 ""^^ fxl ""^^ ""^^ ""^P "^ll '^P Tyr Pro Trp Thr Met 

Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser tlo Ser Tyr Leu Pro 

Met He Gly Vai Leu Thr Pro Asp Asn Tyr Leu Vai Thr Pro Arg Itu 
an 650 fitic 

60 Glu Gly Ala Lys Leu Val Lys Tyr Trp Val Ser Ala Gin Asp Ala Val 

665 

Tyr Ser Ala Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 

680 

65 ''"^ Ho ""^ Thr Met Thr Ala Lys Ala 

695 700 
Asn Gly Ala Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys 

Tyr 11= Ala Trp Arg His Tyr Asp Cys Thr ^p Met Phe Phe Leu Leu 
7n ^ "^30 73c 

/U Leu Asp Asp He Thr Val Tyr Arg Ser Thr Glu Thr^Val Pro Glu Pro 

745 7S0 

val Thr Asp Phe Val Val Ser Leu He Glu Asn Asn Lys Gly Arg L-u 

75 i?o ^" T**-^ Asp Asp Lys 
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hys 


Pro 






Thr 


Gly 


Tyr Asn 


He 


Tyr 


Ala 


Asn Gly 


Ser 


Leu 


785 








790 








795 








800 


Leu 


Val 


His 


Tie Gin 


Asp 


Pro 


Thr Vul 


Leu 


Glu 


Tyr 


He Asp Glu 


Thr 








80S 








810 








815 




Tyr 


Ser 


Ser 


Arg Asp 


Gly Gin 


Val Glu 


Met 


Glu 


Tyr 


Cys Val 


Thr 


Ala 








820 






825 








830 






Val 


Tyr 


Asn 
835 


Asp Asn 


lie 


Glu 


Ser Gin 
8^0 


Ser 


Val 


Cys 


Asp Lys 
845 


Leu 


Asn 


Tyr Thr 


lie 


Thr Ser 


I.su 


A-p 


Asn lie 


Gin 


Ser 


Asp 


Thr Ser 


Leu 


Lys 




850 








8S5 








860 








lie 


Tyr 


Pro 


Asn Pro 


Ala 


Ser 


Tyr Val 


Val 


Arg 


He 


Glu Gly 


Leu 


Ser 


665 








870 






87 5 








880 


Arg 


Ser 


Lys 


Ser Thr 


He 


Glu 


Leu Tyr Asn 


Ala 


Leu 


Gly He 


Cys 


He 








885 








890 








895 




T.eit 


Arg 


GXu 


Glu Thr 


His 


Ser 


Glu Lys 


Thr 


Glu 


He 


Asp Va] 


Ser Arg 








900 






905 








910 






Leu 


Asn 


Asp 


Gly Val 


Tyr 


Leu 


lie Lys 


Val 


Val 


Gly Gly Asn 


Lys 


Thr 






915 








920 








925 






Thr 


Thr 
930 


Glu 


Lys Val 


Glu 


Tie 
935 


Lys Arg 


Pro 













(2) lUrORIlATIOM FOR SEQ ID NO: 34 0 

li) SEQUEilCE. CHARACr ERISTICS: 

1A> LEHGTH: 60€ amino acids 
(B) TYPE: amino acid 
(D} TOPOLOGY: Linear 

(ii) MOLECULE TYPF.: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISI-l: Porphyromonas gingivalls 

t ix ) FEATURE : 

(A) HAI-IE/K5Y: mis cofeature 

(B) LOCATIOH 1 . - -606 



(xll SEQUENCE DESCRIPTIOM: SEQ ID NO: 340 



He 


He 


Phe 


Cys 


Thr 


He 


His 


His 


Ser 


Glu 


Leu 


Glu 


He 


Met 


Asn 


Ser 


1 








5 










10 










15 




He 


Met 


Lys 


Tyr 


Gin 


Lsu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


Ala 


Leu 


Ser 


Val 








20 








25 










30 






Ser 


S-r 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 


Lys 


Arg 


Pro 






35 










40 










45 








Asp 


Thr 


Leu 


Arg 


Arg 


Glu 


Leu 


Thr 


He 


Val 


Asn 


Asp 


Gin 


Thr 


Val 


Glu 




50 










55 










60 










Met 


Glu 


His 


Ala 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu 


Pro 


65 










70 










75 










eo 


Arg 


Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


Phe 


Gly 


Phe 










85 










90 










95 




Val 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


A^n 


Leu 


Pro 


Asn 


He 


Leu 








100 










105 










110 






Pro 


Thr 


Glu 


Gly 


His 


Met 


Lys 


His 


Arg 


Gly 


Tyr 


Leu 


Aen 


He 


Gly He 






115 










120 








125 








Gly His 


Thr 


Lgu 


Asn 


Gin 


Arg 


Met. 


Asp 


Ala 


Gly Tyr 


Arg 


Leu 


He Asp 




130 










135 










140 










Ala 


Glu 


Gin 


Glu 


Arg 


Leu 


Aen 


l.eu 


Phe 


Leu 


Ser Ti'r 


Arg 


Gly Met 


Lys 


145 










150 










155 










160 


Ser 


Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp 


Arg 


Axg 










165 










170 










175 




Met 


I Let 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val 


Leu 








180 










185 










L90 






Ala 


Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


Asn 


Hie 


-Tyr 






-Ksn 


Tyr Gly 


Arg 






195 










200 










205 








Gly Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 




210 










215 










22Q. 










Thr 


Pro 


Gin 


Met 


Asp 


Asn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu 


Gly 


225 










230 










235 










240 


Ala 


Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


lie 


Arp 


Tyr 


Arg 


Phe 


Phe Arg 










245 










250 










255 




Ser 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Met 


Lys 


Ala 


Leu 


Thr 


Glu 


His 



260 265 270 
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Thr Pro Glu Lci: Asn Val Ihi: Met Ser Asn Glu Leu Ser Asp A^p lie 

Lys Leu Gly Val Glu Val Arg Thr Gly Gly Leu Ph« Phe Ala Lys A.n 

c 2 95 300 

3ol "° IiS Lou 

Tyr Tyr Val Glu Gly Ala Pro Thr II, Gly P^e Val Gly Asp s.r A^p 

330 ^-ic- 

Asn net Gin Trp A»n He Gin Ala Gly Val Gly 11= Ser Ser Hl= Phe 
^ 345 



15 



20 



25 



30 



40 



Glv Ala Lys Gly Arg Leu Phe Phe Trp Pro Lys Leu A^p Ala Ser Leu 

360 365 
Ser lie Phe Pro Ser Trp Ary Val Tyr Ala Lys Ala Phe Gly Gly Val 

375 33Q 
lie Arg Asn Gly Leu Ala Asp Val Met Gin Glu Glu Met Pro Tyr Leu 

Met Pro Asn Thr lie v;i Leu Pro Ser Arg A^n Ala Leu Thr Ala Gl^n 

^ 410 ,11 = 

Leu Gly val Lys Gly Asn lie Ala Asp Val Val Arg M-t Glu Val Tyr 

Gly Asp Pho ser Lys Leu Thr Gly v" Pro Phe Tyt Thr Pro Thr Leu 

Pro Leu Tyr Asn Pro Ser Asp tl° Tyr Gin Tyr Asn Vaf Ser Phe Leu 



Pro He Tyr Ala Asp Gly s"er Arg Trp Arg Ala g" Gly Lys I.eu Glu 



Tyr Ser Tyr Arg Asp Hot L^u Arg Phe Leu V^r Asp Ala Ser Tyr GlJ 



4 90 



Ly^ Trp Asn Lou A.p Gly Gly Leu Val Ala Ser Met Gin Pro A^p Leu 

505 51 n 

He Leu Lys Ala Glu Val Gly Val His Pro He Ala Pro Leu Asp Val 

Arg Leu Arg Tyr Thr Gin Leu Asn Gly Arg Tyr Arg Tyr Ser Phe Gly 

35 sor Ala Gly Ser Glu Ala Leu Gly He Gly Asn VaJ His Leu Leu Ser 

Ala Asp Val Ser Tyr Lys Leu Lys Lys Asn "u Ser Leu Tyr Leu Lys 

He Asp Asn Met Leu Ala Glu Thr Thr Glu Leu He Gly Tyr Tyr Pro 

Met Gin Pro Phe His Cys Phe Ala Gly Phe Ser Trp Thr Phe 



600 605 
{2) INFOWIATIOH FOR SEQ ID NO: 341 

(i) SEQ{JBlfCE CHARACTERISTICS: 

(A) LENGTH: 357 amino acids 
(BJ TYPE: amino acid 
5Q (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

I ill) HYPOrrilETICAL: YES 

55 (vi) ORIGIHAL SOURCE: 

{A) ORGAIUSM: Porphyromonas glnglvalls 
lix) FEATURE: 
en NAME/KEY: misc feature 

°" <B] LOCATION 1...357 

(xi) SEQUENCE DESCRIPTIOIJ: SEQ ID NO:341 

Asn ser Hi. Ala Thr Val Ser Val II. Cy:. Ser Met Met Glu Lys Cys 

lie Phe Ala Hi. Tyr Pro His A.n Leu Val Phe Met He Arg ^y. Hi. 

Phe Gly lie He Leu Gly Phe Leu Ser Leu Val Phe Ser aL Gly Ala 

70 Gin Gin Glu Ly3 Gin Val Phe Hl^s Phe Leu A.n Leu Pro Ala Thr Ala 

60 

Gin Ala Leu Ala Ala Gly Gly Lys Ala He Thr He Val Asp Asp Asn 
Pro Gly Leu Ala Phe Glu Asn Pro Ala Leu Gly Tyr Glu Ser Gi°y 
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10 



15 



20 



25 



30 



35 



55 



60 



65 



70 



75 



Gly Arg 


Ala 


Phe 


Leu 


Ser Tyr 


Leu 


Tyc 


Tyr 


Met 


Ser 


Gly 


Ser 


His 


Met 








100 








105 










110 






Gly 


Asn 




Cys 


Tyr Ala Ser 


Ser 


Val 


Gly 


Glu 


Arg 


Gly 


Met 


Trp 


Gly 






lie 

115 








120 










125 








Val 


Gly 


Met 


Arg 


Phe 


Leu Asn 


Tyr 


Gly Ser 


Met 


Gin 


Gly 


j Vr 


Asp 


Gin 




130 








135 










140 










Acn 


Ala 


lie 


Ala 


Thr 


Gly Ser 


Phe 


Ser 


Ala 


Ser 


As^p He 


Ala 


Val 


Gin 


143 










150 








155 










160 


Gly 


Phe 


Tyc 


Ser 


His 


Glu Leu 


Ser 


Asn 


His 


Phe 


Arq Gly 


Gly 


Val 


Ser 










165 








no 










lis 




Lftu 


Lys 


Ala 


Leu 


Tyr 


Ser Ser 


lie 


Glu 


Thr 


Tyr 


Ser 


Ser 


Phe 


Gly Leu 








180 








IftS 










190 






Gly 


Val 


Aiip 
19b 


Val 


Gly 


lie Ser 


Tyr 
200 


Tyr 


Asp 


Asp 


Asp 


Lys 
205 


Gly 


Tyr 


S«r 


Ala 


Ser 


Ma 


Leu 


Phe 


Lys Asn Val 


Gly 


Ala 


Gin 


Leu 


Lys 


Gly 


Tyr 


Asn 




210 








215 










220 










Giu 


Glu 


Arg 


Glu 


Pro 


Leu Asp Trp Asp 


Phe 


Gin 


Leu Gly 


Phe 


Ser 


Arg 


225 










230 








235 










240 


.Ser 


Phe 


He 


A^n 


Ala 
245 


Pro Pha 


Arg 


Leu 


His 
250 


lie 


Thr 


Leu 


Phe 


Asn 
255 


Leu 


Asn 


Pro 


His 


Tyr 
260 


Phe 


Lys Arg 


Lsu 


Val 
265 


Pro 


Arg 


Asp 


Leu 


Sec 

2"7 0 


Lys 


Met 


G] n 


Lys 


Pha 


Leu 


Arg 


His Phe 


Ser 


He 


Gly Ala 


Glu 


Phe 


Thr 


Pro 


Ser 






275 








280 










285 








Glu 


Arg 


Phe 


Trp Val 


Gly Leu 


Gly 


Tyr 


Thr 


Pro 


Gin 


He 


Ala 


Gin 


Asp 




290 








295 










300 










Phe 


Glu 


Val 


Glu 


Gly Gly Pisn 


Lys 


Trp 


Gly 


Gly 


Leu 


Ser 


Ala 


Gly 


Val 


305 










310 








315 










320 


Gly 


Ph« 


Thr 


Ser Gly 


Val Val 


Arg 


Val 


Gly Val 


Ser 


Ala 


Ala 


Thr 


Tyr 










325 








330 










335 




His 


Pro 


Ala 


Ala 


Leu 


Ser Phe 


Ket 


Cys 


Ser 


Val 


Gly He 


Arg 


Leu 


Asp 








340 








345 










350 






Asp 


Lys 


Ser 
3S5 


He 


Phe 























(Z( IWFORHATION FOR SEQ ID NOi342 



|i) SfcQUEnCE CHARACTERISTICS: 

(A) LENGTH: 337 amino acids 
40 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

45 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

<A) ORGAiaSH: Po rphyromanas gingival is 

50 (ix) FEATltRE: 

(A) UmC/KEY: inijfc_f eature 
(Q) LOCATION 1. . .337 



(Xl) SEQUENCE DESCRIPTIOM: SEQ ID HO:342 



Pro 


Gin 


Pro 


Val Gly 


Leu 


Lys 


Glu 


He 


The 


He 


Lys 


Pro 


Mot 


Cys 


Leu 


1 






5 










10 










15 




Glu 


Pro 


He 


He Ala 


Pro 


He 


Ser 


Ser 


Glu 


Leu 


Leu 


Glu 


Gin 


Glu 


Leu 








20 








25 










30 






Thr 


Ala 


Aap 


Arg Phe 


Leu 


Arq 


Met 


Thr 


Asn 


Lys 


Ala 


Gly 


Asn 


Glu 


He 






35 








40 










45 








Tyr Val 


Phe 


Thr Ala 


Glu 


Glu 


Ala 


Pro 


His 


Cys 


Met 


Lys 


Glu 


Val 


Gly 




50 








55 










60 










Arg 


Leu 


Arg 


Glu Glu 


Ala 


Phe 


Arg 


His 


Tyr 


Gly 


Gly 


Gly 


Thr 


Gly 


Lys 


65 








70 










75 










80 


Ala 


He 


Asp 


He Asp Glu 


Phe 


Asp 


Thr 


Met 


Pro 


Gly 


Ser 


Tyr 


Lys 


Gin 








85 










90 










95 




Leu 


He 


Val 


Trp A£p 


Pro 


Gin 


Asn 


Lys 


Ala 


He 


Leu 


Gly 


Gly 


Tyr 


Arg 








100 








105 










110 






Phe 


He 


Tyr 


Gly Arg 


Asp 


Val 


Ala 


Phe 


Asp 


Thr 


As'p 


Gly 


Lys 


Pro 


Leu 






115 








120 










125 








Leu 


Ala 


Thr 


Ala Glu 


Met 


Phe 


Arg 


Phe 


Set 


Asp 


Ala 


Phe 


Leu 


His 


Asp 




130 








13S 










140 










Tyr 


Leu 


Pro 


Tyr Thr 


Val 


Glu 


Leu 


Gly 


Arg 


Ser 


Phe 


Val 


Ser 


Leu 


Gin 


145 








150 










155 










160 
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10 



20 



25 



30 



40 



Tyr Gin Ser Thr Arg Met G3 y Thr Lys Ala He Phe Val Leu A^p jKsn 

lie Gly Ala Lea 

-J- Lys Val Thr Met 

200 205 



Leu Trp A.p Gly ill Gly Ala Lea Thr Va^ Val A^n Pro Glu Ala Leu 
Tyr The Tyr Gly Ly. Val Thr H« Tyr Lys Asp Tyr Asp Arg Arg Ala 



Arg Asn Lou 11= Leu Tyr Phe Leu Arg Lys His Phe Ser Asp Pro Glu 

Gly Leu Val Lys Pro lie His Pro Leu Pro He Ifu He Ser Ala Glu 

ZJO 235 
Asp Glu Ala Leu Phe Ser Ser Ser Asp Phe Asp Thr Asn Tyr Lys T^r 

250 ot^ 
Leu A.n He Glu Val Ary Lys Leu Gly He A^n He Pro Pro Leu Val 

260 265 270 

Ser Ala Tyr He Ala Leu Ser Pro Glu Met Arg Val Phe Gly Thr Ala 

Val A.n Glu Ser Phe Gly Glu vll Glu Glu Thr Gly He Phe Tie Ala 
Val Gly Lys He Leu Glu gIu Lys Lys Gin Arg »ls He Glu Ser Phe 
He Leu Ser Arg Asn Glu Lya Lys Gly Leu Asp Ser Ser Asn Gly 
Ser 335 



(2, IMFORltATION FOR SEQ ID HO: 343 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 566 amino acids 

(B) TYPt:: amino acid 
(D) TOPOLOGY: linear 



(11} MOLECULE TYPE: protein 
35 (iii) HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 
lix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .566 



^5 SEQUENCE DESCRIPTION: SEQ ID NO:343 

Tyr Asp Gly Ala Arg Leu Val Tyr Thr Leu Phe Arg Asn Arg Asn Asp 



H« His Pro Met Lys Thr He Val Arg Tyr ..er Arg Leu Pro vfl Al. 
50 I^u Phe Phe Cys Leu Leu Gly Ala Val His Leu Ser Val Glu Ala Gin 



Met Leu Asn Thr Pro Phe Glu l;u Sec Asp Gin He Val Leu Ser Pro 

55 Is"" '^^ ■'^^ ^'^ =1" Thr Lys Glu Lys Arg 

Gly Ala Asp Leu Phe Pro Leu Ser Asp Lys Leu Arg Asp Ser Ala ?yr 



val Arg Phe Gly Ser Ala Tyr Gly Asp lie Ala Gly Asp Tj^r Leu Pro 

60 Tyr Asn Gly Asn Asn Tyr Ser Ser Le! Set Leu Glu Ser My Gly Arg 
T 120 1 9 c ^ 

He ser Val Arg Asn Tyr Gly Thr Leu Gin Gly Ser Ala Ser Tyr Ser 

65 "-^^ tlo Arg Asn Ala Glu 

160 



Ala Tyr Tyr Pro Tyr i^u Val Ser Asp Ser ihr Gly Gly Asp Tyr "s 



Phe Glu Asp Tyr Arg Leu Ala Gly Tyr Tyr Ser Phe Arg Ala g" Arg 
70 Leu Pro Leu Gly He Gly Phe Ser Tyf Arg Gly Glu Val ^fa Tyr Arg 

Leu Thr Asp Pro Arg Thr Thr Asn Thr Thr Gly Ala Leu Glu Leu Ser 

215 220 

75 2^ ''''' '^^ ^« 
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15 



20 



25 
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Ala Ala Tyr Leu Tyr His Arg Gin His Leu Thr Gin Tyr Asn Trp Arg 

250 255 
^ro Gly Gin Gin Asp Lys Phe Phe Val Ser Tyr Gly Phe Gly GIm Val 
c 265 270 

t3 Asp Val ser Arn Ser Pro lie Trp Phe Gly He S^r Arg Met Asn Tyr 
2^5 280 285 

Val Asn Gly Trp Lys Leu Ser Ser Arg Leu Asp Thr Arg Arg Gly Asp 
290 295 300 

in ?ic ^^"^ '^^'^ ^^'^ "^y^ Leu Asp Thr Glu Glu Arg 

-lU 305 310 

Ser Ser lie Asn Leu Phe Ala Leu Leu Tyr Asn Arg Leu Arg L-u Tyr 

^25 330 335 

Gly Ser Trp His Leu Scr Asp Phe Asp Phe Ser Phe Ser Ala Asp Tyr 

345 350 
Ala Leu Arg Gin Gly He Glu Arg lie Tyr Glu Asp Tyr Lys Pro Asp 

360 36S 
Asp Asn Tyr His lie Tyr Asp Leu Arg He Leu Ala lie Arg Arg Trp 

375 380 
Tyr Met Leu A.sn Glu Phe Ser Ala Gin Ala Gin Ala Ser Tyr Arg He 

390 395 ^ 

Arg Thr Asp Arg Gly Cys Ala Leu Arg Val Ser Ala Gly Ser Asp Phe 

405 4io 415 

Tyr Gly Tyr A^p Glu Thr Tyr Arg Lys His Gly His His Thr Met Ser 

^20 425 430 

Gly Met Leu Arg Pro Phe Ala Gly He Ala Tyr Asp His Ala Gly S-r 

435 44 0 4. J 5 

Lys Leu Asp Phe Gly Leu Ser Leu Ser Ala Ala Tyr Arg Hyt Val Leu 
450 455 

qn T^c "^^ ^^"^ "^'-^'^ ^^"^ "^^^ "^^^ ^51n Lys Glu Gin Leu Asp Tyr 

475 480 
Gin Leu Ala Tyr Leu Pro Tyr Ala Tyr Arg Asn Arg Glu Gly Val Glu 

485 490 
Val Arg Ser Ser Leu Tyr Val Ser He Pro Met Gin Asn Thr His Arg 
Qc ^^^^ 505 510 

J:J Leu Met Thr Glu Leu Arg Leu Tyr Gly Asp Leu Met Lys Arg Lys Asp 
SIS 520 525 

Gly He Ala Tyr Gly Lys Thr Pro Gly Val He Ser His He Leu Ser 
530 535 540 

AD t^S ^" '^^'^ 'i'hr He Gly Ala He Cys 

S5S 560 

Asn He Ser Tyr Leu Phe 
565 

(2) IHFORMATIOtJ FOR StLQ ID HO: 34 4 

(i) SKQUEtlCE CHARACTER I ST ICS: 

(A) LENGTH: 819 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il» HOLEC'JLE TYPE: protein 

(iii) HYPOTHETICAL: YES 

55 tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



45 



50 



60 



70 



(ix) fXATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .aTg 

(xi> SEQUENCE DESCRIPT10^^: SEQ ID NO: 34 4 



65 i'''^ ^^'^ 10^ ^'^^ '^^^ 



Ala S-r Cys Ser Asn Arg Asn Ala Trp Met Ala Glu Asn Lys Pro Ser 

25 35 

ser Pro Glu Pro Asp Asn Thr Gly Val Gly Asn Ser Pro Sor Asp Tyr 

35 40 45 

Leu Ills Gly Glu Ala He He Pro Pro Leu Ser Ser'ieu Ser Asn Ph« 

t , 60 

Asn Asp Lys Arg Phe Met Lys Lys Leu His Met He Ala Ala Leu Ala 

Val Leu Pro Phe Cys Leu Thr Ala Gin Ala Pro Val Ser Asn Ser IL 

85 90 Qc 
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He Asp Ssr Leu Ser Acn Vai Gin Leu Gin Thr Val Gin Val Val Ala 

100 105 no 

Thr Arg Ala Thr Ala Lys Tljr Pro Val Ala Tyr Thr Asn Val Arg Ly«i 
- lis 120 125 

5 Ala Glu Leu Ser Lys Scr Asn Tyr Gly Arg Asp Tie Pro Tyr Leu Leu 
130 135 140 

MeL Leu Thr Pro Ser Val Val Ala Thr Ser Asp Ala Gly Thr Gly Tie 
143 150 155 

Gly Tyr Ser Gly Phe Arg Val Arg Gly Thr Atp Ala Asn Arg He Asn 
lU 165 170 175 

He Thr Thr Asn Gly Val Pro Leu Asn Asp Ser Glu Ser Gin Ser Val 

180 185 190 

Phe Trp Val Asn Met Pro Asp Phe Ala Ser Ser He Glu Asp Leu Gin 

19S 200 20S 

Val Gin Arg Gly Val Gly Thr Ser Thr Asn Gly Ala Gly Ala Phe Gly 

210 215 220 

Ala Ser Val Asn Met Arg Thr Asp Asn Leu Gly Leu Ala Pro Tyr Gly 
225 230 235 240 

Arg Val Asp Leu Ser Gly Gly Ser Phe Gly Thr Phe Arg Arg Ser Val 

245 250 255 

Lys Leu Gly Ser Gly Arg He Gly Arg His Trp Ala Val Asp Ala Arg 

260 265 270 

Leu Ser Lys He Gly Ser Asp Gly Tyr Val Asp Arg Gly Ser Val A^p 

275 280 295 

L»u Lys Ser Tyr Phe Ala Gin Val Gly Tyr Phe Gly Ser Asn Thr Ala 

250 295 300 

Lsu Arg Phe Tie Thr Phe Gly Gly Lys Glu Val Thr Gly He Ala Trp 
310 315 320 

7\sn Gly Leu Ser Lys Glu Asp Glu Ala Lys Tyr Gly Arg Arg Tyr Asn 

325 330 
Ser Ala Gly Leu Met Tyr Val Asp Ala Gin Gly Val Pro His Tyr Tyr 

340 345 350 

His Asn Thr Asp Asn Tyr Glu Gin Arg His Tyr His Ala He Met Thr 

360 . 365 

His Scr Phe Ser Pro Ser Val He Leu Asn Leu Thr Ala His Tyr Thr 

370 375 380 

Ala Gly Tyr Gly Tyr Thr Asp Glu Tyr Arg Thr Gly Arg Lys Leu Lys 
390 395 400 

Glu Tyr Ala Lsu Gin Pro Tyr Val Glu Asn Ser Val Thr Val Lys Lys 
^ 405 410 415 

Thr Asp Leu He Arg Gin Lys Tyr Leu Asp Asn Asp Phe Gly Gly Leu 

420 425 430 

He Gly Ser Leu Asn Trp His Thr Gly Ala Trp Asp Leu Gin Ph- Gly 

435 440 445 

Ala Ser Gly Asn He Tyr Lys Gly Asp His Phe Gly Arg He Thr Tyr 

450 455 460 

lie Lys Ly- Tyr Asn Gin Pro Leu Ala Pro Asp Phe Glu Tyr Tyr Ara 
465 470 475 480 

Asn Arg Ala Asp Lys Arg Glu Gly Ala Ala Phe Ala Lys Ala Asn Trp 

485 490 495 

Gin He Thr Pro Glu Leu Asn Met Tyr Ala Asp Leu Gin Tyr Arg Thr 

500 505 510 

He Gly Tyr Thr He Asn Gly He Thr Asp Glu Tyr Asp Glu Val Gin 

515 520 525 

Gly Ser Met Gin His He Asp Leu Asp Lys Thr Phe Arg Phe Leu Asn 

530 535 540 

Pro Lys Ala Gly Leu Thr Tyr Ser Phe Asp Asp Ala His Thr Ala Tyr 
545 550 555 

Ala Ser Val Ala Val Ala His Arg Glu Pro Asn Arg Thr Asn Tyr Thr 

565 570 575 

Glu Ala Gly Ho Gly Gin Tyr Pro Thr Pro Glu Arg Leu He Asp Tyr 

580 585 590 

Glu Leu Gly Tyr Arg Tyr Ala Ser Pro Leu Leu Ser Ala Gly Val Gly 

595 600 605 

Leu Tyr Tyr Met Gin Tyr Lys Asp Gin Leu Val Leu Asp Gly Arq Leu 

610 615 620 

Scr Asp Val Gly Gin Met Lsu Thr Ser Asn Val Pro Asp Ser Tyr Arg 
625 630 635 640 

Met Gly Leu Glu Leu Thr Leu Gly Trp Gin He Leu Pro Arg Leu Leu 

645 650 655 

Arg Trp Asp Ala Ser Phe Thr Met Ser Arg Asn Lys He Asp Arg Tyr 

660 665 670 

Val Gin Tyr Thr Ser Val Tyr Asp Ala Asp Tyr Asn Trp Leu Glu L-u 

€■^5 680 685 

Lys Glu Glu Thr Leu Glu Ser Thr Asp He Ala Tyr Ser Pro Asn Val 



15 



20 



25 



30 



35 



45 



50 



55 



60 



65 



70 



75 
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10 



15 



20 



25 



50 



55 



60 



65 



70 



75 





690 








695 










700 








lie 


Ala 


Gly Ser 


Met 


Leu 


Thr 


Leu 


Ser 


His 


Ala 


Gly Phe 


Glu 


Met 


Ala 


705 






710 










715 








720 


Trp 


Thr 


Ser Axg 


Phe 


Vai 


Ser 


Lys 


Gin 


Tyr 


Leu 


Asp Asn 


Thr 


Gin 


Arg 






725 










730 








735 




Ser 


Asp 


Arg Met 


Leu 


Ser 


Ser 


Tyr 


Trp 


Vai 


A^n 


Asp Leu 


Arg 


Leu 


Gly 






740 










745 








750 






Ty-r 


Val 


Leu Pro 


Val 


His 


Phe 


Val 


Lys 


Arg 


Val 


Ala Leu 


Gly Val 


Gin 






755 








760 








765 








Leu 


Asn 


Asa Leu 


Phe 


Asn 


Leu 


Met 


Tyr 


Ala 


Ser 


Ai»n Ala 


Tyr 


He 


Tyr 




770 








775 










780 








PiSp Ala 


Gly Tyr 


Val 


Gin 


Ala 


Ser 


Gly 


Glu 


Leu 


S-r Ala 


Tyr 


Aia 


Asp 


785 








790 










795 








aoo 


Leu 


Arg 


Tyr Tyr 


Pro 


Gin 


Ala 


Gly 


Phe 


Asn 


Aia 


Leu Gly Scr 


Leu 


Thr 








OOb 










aio 








815 




He 


Asp 


Phe 

























(2) IIlFOfU'lATIOM FOR SEQ ID NO: 345 

ti) SEQUEtlCE CHARACTERISTICS: 

lAi LENGTH: 532 amino acids 
(B) TYPE; amino acid 
(D» TOPOLOGY: linear 

(iij MOLECULE TYPE: pcotein 

(ill J HYPOTHETICAL: YES 

30 <vi) ORIGINAL SOURCF,: 

(Al ORGANISM; Porphyromonas gingivaiis 

( iX ) FEATURE : 

(A) NAt>lE/ICF.Y: misc_featuce 
35 (B) LOCATION l.,,532 

(xi( SEQUENCE DESCRIPTION: SEQ ID NO:345 

40 



45 



Tyr 


Arg 


Arg 


Ser 


Gly 


Arg 


Val 


Cys 


Pro 


Arg 


He 


Leu 


Arg 


Asn 


Lys 


Arg 


1 








5 










10 










IS 




Ser 


Tyr 


Ala 


Asp 


Thr 


Arg 


Leu 


Gin 


Pro 


Arg 


Arg 


Ala 


Val 


Arg 


Leu 


Arg 








20 










25 










30 






Ala 


Gly 


Thr 


Arg 


Thr 


Lys 


Met 


Lys 


Arg 


Arg 


Phe 


Leu 


Ser 


Leu 


Leu 


Leu 






35 










40 










45 








Leu 


Tyr 


He 


Leu 


Ser 


Ser 


He 


Ser 


Leu 


Ser 


Ala 


Gin 


Arq 


Phe 


Pro 


Met 




50 










551 










50 










Val 


Gin 


Gly 


He 


Glu 


Leu 


Asp 


Thr 


Asp 


Ser 


Leu 


Phe 


Ser 


Leu 


Pro 


Lys 


€5 










70 










75 










80 


Arg 


Pro 


Trp 


Arg 


Ala 


He 


Gly 


Lys 


Thr 


He 


Gly 


Val 


Asn 


Leu 


Ala 


Val 










85 










90 










95 




Trp 


Gly 


Phe 


Asp 


His 


Phe 


He 


Met 


Asn 


Glu 


Asp 


Phe 


Ala 


Asp 


He 


Ser 








100 










105 










HO 






Trp 


Gin 


Thr 


He 


Lys 


Ser 


Asn 


Phe 


Gin 


Thr 


Gly 


Phe 


Gly Trp Asp Asn 






115 










120 










125 








Adp 


Lys 


Phe 


Val 


Thr 


Asn 


Leu 


Phe 


Ala 


His 


Pro 


Tyr 


His 


Gly Ser 


Leu 




130 










135 










140 










Tyr 


Phe 


Asn 


Ala 


Ala 


Arg 


Ser 


Asn 


Gly 


Leu 


Ser 


Phe 


Arg 


His 


Scr 


Ala 


145 










ISO 










155 










160 


Pro 


Phe 


Ala 


Phe 


Phe 


Gly 


Ser 


Leu 


Met 


Trp 


Glu 


Leu 


Leu 


Met 


Glu 


Asn 










165 










170 










175 




Glu 


Pro 


Pro 


Ser 


He 


Asn 


Asp 


Leu 


Cys 


Ala 


Thr 


Thr 


He 


Gly Gly 


He 








180 










185 










190 






Ala 


Leu 


Gly 


Glu 


Met 


Gly 


His 


Arg 


Leu 


Ser 


Acp 


Leu 


Leu 


He 


Asp 


Asn 






19b 










200 










205 








Arg 


Thr 


Thr 


Gly 


Trp 


Glu 


Arg 


Met 


Gly 


Arg 


Glu 


Val 


Ala 


He 


Ala 


Leu 




210 










215 










220 










He 


Asn 


Pro 


Met 


Arg 


Phe 


Leu 


Asn 


Arg 


Leu 


Thr 


Ala 


Gly Glu Val Thr 


225 










230 








235 










240 


Ser 


Val 


Gly 


Ser 


Arg 


Set 


Gly 


Gin 


He 


Phe 


Gin 


Ser 


Val 


Pro 


He 


Asn 










245 










250 










255 




He 


Val 


Vai 


Asp 


Ala 


Gly 


Phe 


Arg 


Phe 


Leu 


Ala 


Asp 


Lys 


Arg 


His 


Ala 








260 










265 










2T0 






Arg 


Thr 


Gly 


Ala 


Thr 


Ala 


Leu 


Thr 


Leu 


Asn 


Leu 


Arg 


Phe 


Asp Tyr 


Gly 






275 










280 










285 








Asp 


Pro 


Phe 


Arg 


Ser 


Glu 


Thr 


Phe 


Ser 


Pro 


Tyr 


Asp 


Phe 


Phe 


Gin 


Phe 
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295 

Lys Ala Gly Leu Ser Phe Ser Glu Ser Gin Pro Leu Leu Ser Gin He 

Asn Leu He Gly lie Leu Ser Gly Cys Gin Leu Leu Ala His Glu 

-^25 330 ^ 

Thr V.l L,u val Gly Gly Leu Phe Gin Hi. Phe A=p Tyx Tyr "n S« 

345 350 
Glu Lys Arg lie Ser Lys A3n Ser Glu Glu Val Leu Val Thr Pro Tvr 
in 360 365 ^ 

Arg lie Ser Gin Val Ala Ala Leu Gly Gly Gly Leu lie Phe Gin His 
375 

His Gly Lys Phe Arg Arg Arg Pro Leu Glu Leu Tyr Ala Glu Thr Tyr 

390 395 A 

Leu A^n V=,l Val Pro Met Gly Ala s« Leu Ser Asp Hi. Tyr Asn Vst 



20 



25 



Asp A^n Arg Asp Tyr Asn Leu Gly Ser Gly Leu Ser Gly Lys ^eu Tyr 

<20 425 130 

Leu Gly Al^ Thr Tyr Asn Asp Leu Trp Ser Trp Leu Leu Gly Val Glu 

ser Tyr Ar, Leu Tyr Thr Trp He Gly Tyr Glu GJu Pro His Gin Lys 

ten Thr A.,p Val Ser Ser Pl.e Met Val Gin Gly tep Glu Ser Lys Al. 

Arg Leu Leu Val Thr Ser Ser Glu Phe Ala til Hi, Pro Gly Pro 
4-85 4 Mac, 

His val Ala lie Val Ala Arg Arg Phe He Arg Lys Thr Ala 'Jyr Gin 

Phe Tyr Pro Asn Val Ser Phe A^p ^hr Gly Asp lie Gin Leu Arg Val 

Phe 
530 

(2) INFORMATION FOR SEQ ID NO: 34 6 



in 520 coc 

30 Gly Phe His Phe 



35 (ij SEQUENCE CHARACTERISTICS: 

lA) LENGTH: 300 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



40 



45 



50 



(ii) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyroroonas ginglvalls 
(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .300 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:346 



Lys Arg Arg Lys Lys Gin Met Lys Arg Leu He Val Phe Leu Ala Met 

55 Gly Gly Leu Leu Phe Thr Lou Ala Asn Gin Glu Ala A.,n ^hr Ala 

^° 25 30 



60 



ser Asp Thr Pro Lys Lys Asp Trp Thr He Lys Gly Val Thr Gly Leu 
Asn Ala Ser Gin Thr Ser Leu ihr Asn Trp Ala Ala Gly Gly Glu Asn 
Thr val Ala Gly Asn Leu Tyr Leu A^n He A^ aL Asn Tyr Leu Lys 
Asp Lys Trp Ser Trp Asp Asn Gly Leu Arg ^hr Asp Phe Gly Leu Ihr 
65 T,.c Thr Thr Ala Asn Lys Trp Asn Lys ler Val Asp Lys He G^u Leu 
Phe Thr Lys Ala Gly Tyr Glu lie Gl'y Lys His Trp Tyr G^y Ser Ala 



70 



Leu Phe Thr Ph= Leu Ser Gin Tyr Ala Lys Gly Tyr Glu Lys Pro Ser 



135 



Asp His Leu Thr Gly Val L^^ Hi, He Ser Asn P^e Phe Ala Pro Ala 
Tyr Leu Thr Leu Gly He Gly Ala Asp Tyr "s Pro Asn Glu Lys p" 
75 ser Leu Tyr Leu sL Pro Thr Thr Gly tys Leu Thr Val Val 111 Asp 
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A.P TV. L,u sec Ser Leu Cly Ma .He GXy Val Ly. V=l .x. 
5 M=. P.e Ola Le. cx, ^x. Leu Va, v,l OX, Sec Asn He Asn 

L.U Met GXu ;..n VaX ..eu XXe The Lys ;^Xa IL" Phe Phe Sec .Xa 

rye The HU, ASP Phe OXy A.n IXe A.p lie ^3" Trp CXu Ma Met L^." 
10 Ala Met Lys He A«n T dk- , ™^ 253 

260 "^^'^ Thr Asn Leu 

lie Tyr Asp TVsp Aso Val i-^^ ^ic n ,^ 

275 '^^P *='y Lys lie Gin Phe 



15 290 '''' ""'^ P^^e 



285 



290 

(2) IHfORJ.tATlOH FOR SEQ ID HO r 34 7 

9n SEQUEUCE CHARACTERISTICS- 

tA) LENGTH: 221 amino acids 
(BJ TYPE: amino acid 
(DJ TOPOLOGY: linear 

25 MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi ) ORIGINAL SOURCE: 
30 ORGANISM: Porphyromonas glnglvalls 

(Ix) EXATURE: 

(A) NAI-IE/KEY: misc feature 

(B) LOCATION 1...221 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO; 347 

Thr Arg GIu Ser Val Leu Hi. Cys Arg Thr Ly. L.u Lys Ly. GIu Arg 
Lys Met Ly. Ly. Met He Leu Ala Ala Thr Met Leu Leu Ala ^hr He 
Oly P.e Ala A.n Ala Gin Ser Arg II. Ala Leu Arg Leu ^ Ala A.n 
Phe val Gly Ser A.n Leu Met Gin Lys Val Ala A.n Jhr Ser Val A.n 
45 A.n Ly. Met He Val Cly Le^. Arg Val Gly Ala ^l^a Ala GIu Phe Ala 
Leu ser A.n A.p Gly Phe Tyr Leu Ala Pro Gly Leu Al. Tyr Th. Met 
Arg Gly Ala Lys Met Olu Ser Leu Ser Gl'u Thr Thr Thr Arg Leu Hi. 
Tyr .eu Gl. He Pro Val A.n Ala Gly Met Arg Phe Ser Itl Ala A.p 
A.n Met Ala He Ser Leu GIu Ala Gly Pro Tyr Phe ^tf Tyr Gly Val 
55 Ala Gly Thr He Lys Thr Ly, Val Ala Gly Val ^ir Ala Ser Val A.p 
Ala Phe Gly A.p Asn Gly Tyr Asn Arg Phe L'^ Leu Gly Leu Gly tt^. 
50 ser Ala Ala Leu Ser Tyr Asp Arg Tyr Tyr Val Gin He Gly T^r GIu 
His Gly Leu Leu A.n Met Leu Lys Asp Ala Pro Asp Ly. ^h'x Ser Leu 
Arg Asn Hi. A.p Phe Phe Val Gly Leu Gly Val Arg pSe' 
65 220 
(2) INFORilATION POR SEQ ID NO: 348 

(i) SEQUENCE CHARACTERISTICS- 
70 t^' LENGTH: 248 aniino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(11) MOLECULE TYPE: protein 

^5 (ill) HYPOTHETICAL: YES 
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tvi) ORIGTMAL SOURCE; 

(A> ORGANISH: Porphyromo nas gingiva llr 

( ix ) FEATURE : 

(A) NAI1E/KEY: misc_f eatur^ 
(BI LOCATIOJJ 1 . , .24 0 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 34 8 

lie Lys Arg lie Glu Met Lys Arg lie Phe The Val Ala Leu Val Leu 

^ ^ 10 15 

Leu Ala Ser Val Thr Met Ala He Gly Gin Sec Arg Pro Ala Leu Arg 

20 25 30 

Val Asp Ala Asn Phe Val Cly Ser Asn Gin Ser Met Lys Arg Asp Gly 

35 40 45 

Tyr Val Trp Asp Thr Lys Met Asn Val Gly Leu Arg Val Gly Ala Ala 

SO 55 60 

Ala Glu Pl»e Met He Gly Ser Arg Gly Phe Tyr Leu Ala Pro Gly Leu 

"'O 75 ao 

Asn Tyr Thr Met Lys Gly Ser Lys Thr Glu Trp Asp He Pro Glu Met 

85 on g5 

Val Pro Gly Thr Tyr He Thr Met Val sir Thr Arg Leu His Tyr Leu 

100 105 lie 

Gin Leu Pro H- Asn Ala Gly Met Arg Phe Asp Leu Met Asn Asp Hc^t 
115 120 125 

130 '^^^ '^^^ ''"■''^ '^'^^ 

Thr Tyr Arg Gin Lys Leu Glu Gly Trp Lys Pro A^n Asn Tyr Ser Thr 
l'*^ 150 155 

Glu Phe Phe Gly Pro Thr Leu Gly Gly Pro Thr Asn He Arg Trp Asp 

165 i-;5 
He Gly Ala Asn He He Ala Ala Phe His Tyr Lys Arg Tyr Tyr He 

ISO 185 190 

Gin He Gly Tyr Glu His Gly Phe Val Asp He Val Ser Gly Gly Glv 

200 205 
Ser Asp He Pro Arg Leu Asn Asp Asn Arg Gin Ser Ser Ser Thr Thr 

210 21S 220 

Ala Leu Arg Glu Lys Gly Asn Asn Glu Tyr Ala Tyr Asn Arg Asp Phe 
230 235 240 

Phe Val Gly He Gly Tyr Arg Phe 
245 

(2) INFORMATIOr^ FOR SEQ ID NO: 34 9 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 211 amino acids 

(B) TYPE; aiRino acid 
(D) TOPOLOGY: linear 

Hi) MOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

fvi) ORIGINAL SOURCE: 

(A) ORGANISH: Porphyromonae gingivalis 

(1 X ) FEATURE : 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .211 

(Xl) SEQUENCE DESCRIPTION: SEQ ID NO:349 

Lys Arg Lys Ser Met Lys Arg Met Leu Leu Leu Leu Val Val Leu Leu 

*^ 10 15 

Tyr Gly He Ala Gly Arg Leu Ala Ala Gin Asp Val He Arg Pro Trp 

'^O 25 30 

Ser Leu Gin Val Gly Ala Gly Tyr Ser Asp Thr Glu Asn He Pro Gly 

Gly Phe Thr Tyr Gly Phe Tyr T.eu Gly Lys Arg nVt Gly Scr Phe Leu 

Glu Val Gly Leu Ser Met Tyr Asn Ser Thr Arg Gin Thr Ala Asn Asn 

75 BO 
Ala Asp Ser Phe Ala Ser Asn Glu Gly Asp Gly Ser Phe Gin Vai Asn 
05 90 Qt 
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45 



50 



55 



60 



65 



70 



75 



Met 


Ser 


Ser 


Pro 


Asn 


Glvj 


Lys Trp 


Ser 


Phe 


Phe 


Asp 


Ala Gly 


Ser 


Ala 








100 










105 








110 






Am 


Cys 


Tyr 


Het 


He 


Val 


Val 


Gly Val 


Asn 


Pro 


Lea 


His Leu 


Phe 


Trp 






115 










120 










125 




Gin 


Asn 


Ser 


Arg 


His 


Asn 


Leu 


Phe 


Leu 


Ala 


Val 


Gin 


Ala Gly Leu 


Ser 




130 










135 










I'lO 










hys 


His 


Asn 


He 


His 


Phe 


He 


Tyr 


Gly 


Asp 


Lys 


Gly Ala 


Lys 


Val 


145 










150 










155 






160 


Ser 


He 


Tyr 


Thr 


Asn 
165 


Ser 


Asn 


Thr 


Tyr 


He 

no 


Gly 


Tyr 


Gly Ala 


Arg 
175 


Val 


Ala 


Tyr 


GlU 


Tyr 


Gin 


He 


His 


Lys 


Asn 


Val 


Gly Ala 


Gly Ala 


Ala 


Val 








180 










185 








190 






Met 


Tyr 


Asp 
195 


His 


Gly 


Asn 


Lys 


Met 

200 


Leu 


Thr 


Ala 


Met 


Ala Thr 
205 


Leu 


Ser 


Thr 


His 
210 


Phe 



























10 



15 



(2) INI:OWlATIO!4 FOR SEQ ID HO: 350 

20 (1) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 953 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

25 (ill UOLECVJLE T^PE: protein 

liii) H/POTHETICAL: YES 

(vi) ORIGINAL SOimCE: 
30 (A) ORGAJIISM: Porphyrcmonas gingivalls 

(ix) FEATURE:, 

(A) WW'IE/KEY: misc feature 

(B) LOCATION 1. . . 953 

35 



40 





(Xl) SEQUENCE DESCRIPTION: ; 


SEQ : 


ID NO: 350 










He 


Arg 


net 


Arg 


Val Ser 


Asp 


Leu 


Cys 


Ser Arg 


Leu 


Ser 


Trp 


Leu 


Leu 


1 








5 








10 










15 




Pro 


Val 


He 


Leu 


Val Gly 


Leu 


Leu 


Cys 


Ala 


The 


Leu 


Val 


Ala 


Ala 


Glu 








20 








25 










30 






Arg 


Pro 


Het 


Ala 


Gly Ala 


Val 


Gly 


Leu 


His 


His 


Arg 


Arg 


His 


Ala 


Ala 






35 








40 










45 








Leu 


Ser 


Asp 


Ser 


Thr Ala 


Lys 


Asp 


Thr 


Val 


Pro 


Leu 


Ala 


Lys 


Pro 


He 




50 








55 










60 








Pro 


Asp 


Ser 


Ala 


Phe Arg Asp 


Ser 


Leu 


Pro 


Ala 


Asp 


Ser 


Thr 


Gly 


Ser 


€5 








70 










75 








80 


Hec 


Arg 


Gin 


Asp 


Ser Val 


Tyr 


Asp 


Asp 


Glu 


Phe 


Glu 


Leu 


Glu 


Asp 


He 










as 








90 










95 




Val 


Glu 


Tyr 


Glu 


Ala Ala 


Asp 


Ser 


He 


Val 


Leu 


Leu 


Gly Gin 


Asn 


Arg 








100 








105 










110 






Ala Tyr 


Leu 


Phe 


Gly Lys 


Ser 


Tyr 


Val 


Ser 


Tyr 


Gin 


Lys 


Ser 


TVrg 


Leu 






115 








120 










125 






Glu 


Ala 


Asn 


Phe 


Met Tyr 


Leu 


Asn 


Thr 


Asp 


Ser 


Ser 


Thr 


Val 


Tyr Thr 




130 








135 










140 










Arg 


Tyr 


Val 


Leu 


Asp Thr Ala 


Gly 


Tyr 


Pro 


Ket 


Ala 


Phe 


Pro 


Val 


Phe 


145 








150 










155 










160 


Lys Asp Gly 


Glu 


Gin Ser 


Phe 


Glu 


Ma 


Lys 


Asn 


Phe 


Thr Tyr 


Asn 


Phe 










165 








170 










175 




Arq 


Thr 


Glu 


Lys 


Gly He 


He 


Ser 


Gly 


Val 


He 


Thr 


Gin 


Gin 


Gly Glu 








leo 








105 










190 






Gly 


Tyr 


Leu 


Thr 


Ala Gly Lys 


Thr 


Lys 


Lys 


Met 


Pro 


Asp Asn 


He 


Met 






195 








200 










20S 








Ph- 


Met 


Gin 


Gly Gly Arg 


Tyr 


Thr 


Thr 


Cys 


Asp 


Asn 


His 


Asp 


His 


Pro 




210 








215 










220 










His 


Phe 


Tyr 


He 


Asn Leu 


Ser 


Lys 


Ala 


Lys 


Val 


His 


Pro 


Glu 


Lys 


Asp 


225 








230 










235 








240 


He 


Val 


Thr 


Gly 


Pro Val 


Asn 


Leu 


Val 


He 


Alii 


Asp 


Het 


Pro 


Leu 


Pro 










245 








250 










255 




He Gly Leu 


Pro 


Phe Gly 


Tyr 


Phe 


Pro 


Phe 


Ser 


Asn 


Lys 


Tyr 


Ser 


Ser 








260 








26b 










270 






Gly 


He 


Leu 


Met 


Pro Thr 


Tyr 


Gly 


Glu 


Asp Asn Arg 


Tyr 


Gly 


Phe 


Tyr 






275 








280 










285 






Leu 


Arg Asn 


Gly 


Gly Tyr 


Tyr 


Phe 


Ala 


Phe 


Ser 


Asp Tyr 


He Asp Leu 




290 








295 










300 
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Ala Leu AX9 Gly Glu Tie Phe Ser Lys Gly Ser Tcp Gly He S^c Ala 

Gin Ser Ly.^ Tyr Lys Lys Arg Tyr Ly^ Tyr Asn Gly Ser Phs Glu Ala 

330 

Asn Tyr Leu Val Ser Lys Ser Gly A^p Lys Tyr Val Pro Gly Asp Tyr 

Ser Lys Thr Thr Ser Leu A^n He A^g Trp Thr His Ser Gin Asp Pro 

360 

Lys Ala A^n Pro Leu Gin Thr Leu Ser Ala Asn Val Asn Phe Ala Thr 

375 380 
Gly Ser Tyr Phe Gin Asn Ser Leu Asn Thr Thr Tyr Asp Val Acn Ala 

Arg Thr Ala Thr Thr Arg Ser Ser Ala Val Ser Tyr Ser Arg Lys Phe 

410 415 
Pro Gly Thr Pro Phe Ser He Thr Gly Ser Met Asp He Ser Gin Asn 

425 430 
Met Arg Asp Thr Thr Val Ser Leu Thr Leu Pro Asn Leu Ser He Asn 

Met Ser Thr Arg Tyr Pro Phe Lys Arg Lys Thr Arg Val Gly Pro Glu 
1^ 455 460 



Ser Tie Leu Thr Lys Glu Lys Asp Leu Leu Oln Ser Asn Leu Val Arg 



Arg Trp Tyr Glu Lys Leu Ser Val Gly Tyr Ser Gly Gin Leu Arg Asn 
470 A'jc ' 

Lys 

'^^^ 490 

Asp .rp Lys Asn Gly Met Arg His Ser Val Pro He Ser Leu Thr Val 

50.'> 

Pro Leu Leu A:.p Tyr He Asn Leu Thr Met Gly Val Asn Tyr Asn Glu 

Trp Trp Tyr Thr Lys Gly lie Arg Lys Ser Trp Asn gIu Asp Lys Lys 

535 540 
Thr Phe Leu Pro Ser Asp Thr Thr Tyr Lys Phe Arg Arg Leu Tyr Asp 

Tyr ser Leu Ser Ala Gly Leu Ser Thr Thr Leu Tyr Gly Met Phe Lys 

570 575 
Pro Trp Lys Pro Phe Ser Phe Gly Gly Asn Lea He Met He Arg His 

595 500 
Arg Phe Thr Pro Thr Val Ser Phe Ser Tyr .let Pro A»p Phe Thr Lys 

600 605 
Arc, Arg Tyr Gly Phe Trp Glu Leu Leu Glu His Thr Asp Gin Asn Gly 

615 620 
Lys Leu His Thr Leu Leu Tyr Ser Pro Tyr Phe Glu Gin He Phe Gly 
630 635 *;y)n 

Al^ Pro Ser Met Gly Asn Ala Gly Ser Val Asn Phe Ser Phe Asp 



Asn Leu Glu Ala Lys He Lys Sex Lys Sor Asp Ser Thr Gly l" Lys 
Lys Ho ser Leu He Asp Gin Phe Thr Trp Ser Thr Ser Tyr Asn Het 



680 685 
Phe Ala Asp Ser He Arg Trp Ser Asn He Ser Ala Ser Leu Ala Leu 

695 700 
Ar| L=u ser lys Ser Phe Thr Leu Arg Leu Ser Gly Leu Phe Asp Pro 

715 720 



Tyr Leu Thr Lys Tyr Tyr Glu Gly Clu Asp Gly Lys He He Pro T^-r 



730 735 
Gly Lys Gly Leu Ala Arg 

He Ser Thr Gly Thr Ser Phe Ser ^^r Thr Leu Asn Lys Glu Ser Leu 



Lys Ser Asn Asp Leu Arg He Phe Asn Gly Lys Gly Leu Ala Ar^ Leu 

745 750 



760 765 
ser Gly Leu He Ala Leu Phe Ser Gly Lys Lys Glu Arg Arg Asp Glu 

Lys Lys Asn Thr Gly Ala Thr Pro His Glu Gly Asp Asp Ala Ala Asp 

He Leu Glu Gly Gly Arg Pro Gin Asn Glu Sor G,y Gly Ser Leu LeC 

Glu Arg Asn Arg Gin Gly Gly Ala Val Gin Asp Gly Tyr pite Ala 

Tyr Ser He Pro Trp Ser Leu Ser p" Asp Tyr Ser Trp "n He Ala 

Thr Asp Tyr Asn Arg Tyr Asn Va? Asn Lys Met Glu ml Tyr Tyr Arg 

865 '^'^ tVo ^" ^-^^ ^" 

Trp Ser Phe Gly Phe Asn Ala Asn Tyr Asn Phe Asp Leu Lys Lys He 

890 (J o^. 

Thr Ser Leu Thr Cys Asn Val Thr Arg Asp Het His Cys Trp lla He 
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905 

Ser Ala Ser Phe He Pro He Gly Ala Tyr Lys S=r Tyr A^n Phe Val 

920 925 
He Ser Val Lys Ser Ser Leu Leu Gin Asp Leu Lys Tyr Gin Gin Ser 
935 

Asn Arg Pro He Thr Asn Thr Trp Tyr 

(2) INFORI-IATIOM FOR SEQ ID NO: 351 

(i) SEQUENCE CHARACTERISTICS- 

W LENGTH: 1251 amino acid^ 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

111) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISl-I; Pocphyromonas gingivalis: 

{ix) EtATURE: 

(A) NAUE/KEY: misc featur* 

(B) I-OGATION 1 . . .1251 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO: 351 

Lys Trp Lys Leu Ala Cys Ala Phe Asp Cys Ala Cys Cys Phe Asp Pro 

Phe Val Val Thr Asn Glu Val He He lilt Met Lys Arg Tyr Thr He 

He Leu Ala Val Phe Leu Leu Phe Cys Thr Val Phe Thr Phe Gin He 

40 45 
Lys Ala Arg Pro Tyr Glu Arg Phe Ala Asp Val Glu Lys Pro Trp He 

Gin Lys His Ser Met Asp Ser Lys Leu Val Pro ^a Asn Lys Gly Asa 

Leu He Gin Ala Glu He Val Tyr Gin Ser Val Ser Glu His Ser ^p 

90 Qc 

Leu Val He S« Pro Val fen Glu He Arg Pro Ala Aarv Arg P^e Pro 

105 110 
ser His Arg Lys Ser Phe Phe Ala Glu Asn Leu Arg Ala Ser Pro Pro 

120 125 
Val Val Pro Val Ala Val Asp Lys Tyr Ala Val Pro Val Ala Asn Pro 

Met Asp Pro Glu Asn Pro Asn Ala Trp Asp Val Thr Leu Lys He Thr 

Thr Lys Ala Val Thr Val Pro Val Asp Val vll Met Val He Asp cfn 

Ser ser Ser Met Gly Gly Gin Asn He Ma Arg I.eu Lys Ser All He 
i?*^ ^ 185 190 

Thr Glu Gly Val Arg He Ala tZ Val Ser Tyr Asp H^s Glu Pro Hi 



Ala Ser Gly Gin Arg Phe Val Lys Lys Met Leu Pro Lys Gly Thr Al. 
i»3 200 



Arg Leu Ser Asp Phe Thr Ly^ A^p Thr Ala Phe Leu Cys Gin Ly= He 
ZJO 235 o>n 

Axg Ala Leu Thr Pro lie Trp Gly Thr HLs Thr Gin Gly Gly Leu Ly" 
^••S 200 255 



Met Ala Arg Asn He Met Ala Thr Ser Thr Ala Val Asp Lys His He 

He Leu Met Ser Asp Gly Leu Ala Thr Glu Gin Tyr Pro vll Lys Asn 

260 285 
Val Thr Thr Ala Asp Phe He Gly Lys Thr Gly Asn Ala Asn A.,p Pro 

He Asp Leu Val He Gin Gly Ala He Asn Phe Pro Thr Asn Tyr Val 

ser A^n Asn Pro Ser Thr Pro Leu Thr Pro Tyr Pro Thr His Ser 

ser Lys Val Gly Arg Arg Asn Leu Pro Glu Ser Lys Phe Asp Tyr Ser 

^45 350 
Asn Leu Ser Ala Arg He Thr Phe Asp Gly Val Ala Gly Ala Leu Val 

365 

Tyr Glu Pro Arg Phe Pro His Pro Tyr Tyr Tyr Tyr Phe Pro Cys Asn 
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3''0 375 3Qn 

Ala Ala lie Asn Glu Ala Gin Phe Ala Lys Asn Ser Gly Tyr Thr II- 
390 

His Thr IlG Gly Tyr Asp Leu Gly Asp Phe Ala L-u Ala Asn Asn Ser 
405 

Leu Lys Leu Thr Ala Thr Asp Glu A^n His Phe Phe Thr Ala Thr Pro 

420 425 
Ala Asn Leu Ala Ala Ada Phe Asp Aan He Ala Gin Thr He Asn 
435 

Gly lie Gin Arg Gly Glu Val Thr Asp Phe Vai Ala Pro Gly Phe Il« 

450 455 ^^rj 

Val Lys Asn Leu Thr Gin Ser Gly Asp Val Thr His Leu Leu Asn Val 

470 475 
Ser Asn Gly Thr Val Hl^ Tyr Asp Val Ser Thr L-- Lys Leu Thr Trp 



t 4 95 

The Thr Gly Thr He Leu Ser Ser Ser Glu Ala Thr He Thr Tyr Arg 

500 505 " 



4B5 490 495 

Tyr 

510 

He Tyr Ala A^p Leu Asp Tyr Ha Gin Asn Asn A^p H- Pro Vai A-n 

51S 520 525 

Thr Thr Ser Ala He Gly Pxro Asp Leu Gly Gly Phe Asp Thr Asn Thr 
53b 

Glu Ala Lys Leu Thr Tyr Thr Asn Ser Asn Gly Glu Pro Asn Gin Gin 



Leu He Phe Pro Arg Pro Thr Val Lye Leu GlJ Tyr Gly Val II- itl 
565 57g 

590 

Asp Phe Phe Leu Pro Ser Gly Gly Gly His Ho Val Pro Lys Trp He 

Lys Leu Asp Lys Thr Thr Glu Ala Leu Gin Tyr Tyr Ser Val Pro Pro 

630 635 g 

Thr Asn Thr Val He Thr Thr Ala Asp Gly Lys Arg Tyr Arg Phe Val 

645 650 655 

— ' woa 670 

Lys Lys Pro Ala Gly Asn Ala Tyr Phe Ala Tyr Lys Leu Leu Asn Tyr 

5'0 can ^^.^ •* 



560 



Arg His Tyr Val Leu Val Asn Lys Asp Gly Gin Pro He Gin Ala Asm 

!^80 535 

Gly Thr Val Val Ser Ser Leu Ser Glu Ala His V^i Leu Gin Ser Gin 

595 600 



oou 655 
Glu Val Pro Gly Ser Thr Pro Asn Pro Gly Gin He Gly He Ser Trp 
6t>0 665 



675 680 685 

Trp Met Gly Gly Thr Thr Asp Gin Gin Ser Glu Trp Asp Val Thr Ser 

690 695 700 

Asn Trp Thr Gly Al« Gin Val Pro Leu Thr Gly Clu Asp Val Glu Phe 

Ala Thr Thr Glu Asn Phe Gly Ser Pro Ala V^l Ala Asp Leu His III 
'25 

Pro Thr Thr Asn Pro Lys He He Gly Asn Leu He Asn Asn Ser Asp 

Lys Asp Leu Val Val Thr Thr Ser Ser Gin Leu Thr He Ian Gly Val 

Val Glu Asp Asn Asn Pro Asn Val Gly Thr He Val vfl Lys Ser Ser 

775 7eo 
Lys Asp Asn Pro Thr Gly Thr Leu Leu Phe Ala Asn Pro Gly Tyr Asn 

795 nno 
Gin Asn Val Gly Gly Thr Val Glu Phe Tyr Asn Gin Gly Tyr Asp Cys 
^05 810 

Het Tyr Arg Arg Ser Trp Gin Tyr Phe Gly 

825 

Val Asn Glu s« Gly Phe Pro He Asn A.p Val Gly Gly Asn Glu Thr 

840 845 
V,l A^n Gin Trp Val Glu Pro Phe Asn Gly Asp Lys Trp Arg ero Al. 

855 86^ 
Pro Tyr Ala Pro Asp Thr Glu Leu Gin Lys Phe Lys Gly Tyr Gin II, 

Thr Asn Asp Val Gin Ala Gin Pro Thr Gly Val Tyr Ser Ph. Lys gJv 
885 090 89S 



810 815 

Ala Asp Cys Gly Het Tyr Arg Arg Ser Trp Gin Tyr Phe Gly He Pro 

3Z0 82^ 



Met He eye Val Cy^ Asp Ala Phe Leu Asn Leu Thr Arg Thr Ser Gly 

90S Qio 
Val Asn Tyr Ser Gly Ala Asn Leu He Gly Asn Ser Tyr Thr Gly Ala 

315 920 -. 

He Afp He Lys Gin Gly He Val Phe Pro Pro Glu Val Glu Gin Thr 
935 



val Tyr Leu Phe Asn Thr Gly Tht Arg Asp Gin Trp Arg Lys Leu Asn 

955 o<;o 

Gly ser Thr Val Ser Gly Tyr Arg Ala Gly Gin Tyr Leu Ser Val Pro 

965 n-T-** 



970 975 
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I.yc Asn The Ala Gly Gin Asp Asn Leu Pro Asp Arg He Pro Ser Met 

980 965 990 

His Ser Phe Leu Val Lys Met Gin Asn Gly Ala Ser Cys Thr Leu Unk 
995 1000 1005 

5 He Leu Tyr Asp Lys Leu Leu Lys Asn Thr Thr Val Asn Asn Gly Asn 
1010 1015 1020 

Gly Thr Gin He Thr Trp Arg Ser Gly Asn Ser Gly Ser Ala Asn Met 
1025 1030 1035 1040 - 

Pro Ser Leu Val Met Asp Val Leu Gly Asn Glu Ser Ala Asp Arg Leu 
10 1045 1050 10S5 

Trp He Phe Thr Asp Gly Gly Leu Ssr Phe Gly Phe Asp Asn Gly Trp 

1060 1065 1070 

Asp Gly Arg Lys Leu Thr Glu Lys Gly Leu Ser Gin Leu Tyr, Ala Met 
1075 1080 1085 

15 Ser Asp He Gly Asn Asp Lys Phe Gin Val Ala Gly Val Pro Glu Leu 
1090 1095 HOC 

Asn Asn Leu Leu He Gly Phe Asp Ala Asp Lys Asp Gly Gin Tyr Thr 
1105 1110 1115 1120 

Leu Glu Phe Ala Leu Ser Asp His Phe Ala Lys Gly Ala Val Tyr Leu 
20 1125 1130 1135 

Hi.5 Asp Leu Gin Ser Gly Ala Lys His Arg He Thr Asn Ser Thr Ser 

IIJO 1145 1150 

Tyr Ser Phs Asp Ala Lys Arg Gly Asp Sr.c Gly Ala Arg Phe Arg Leu 
1155 1160 1165 

25 Ser Tvr Gly Cys Asp Glu A«rn Val Asp Asp Ser His Val Val Ser Thr 
1170 H75 1180 

Asn Gly Arg Glu He lie He Leu Asn Gin Asp Ala Lsu Asp Cys Thr 
1185 H90 1195 1200 

Val Thr Leu Phe Thr lie Glu Gly Lys Leu Leu Arg Arg Leu Lys Val 
30 1205 1210 1215 

Leu Ala Gly His Arg Glu Val Met Lys Val Gin Thr Gly Gly Ala Tyr 

1220 1225 1230 

He Val His Leu Gin Asn Ala Phe Thr Asn Asp Val His Lys Val Leu 
1235 1240 1245 

35 Val Glu Tyr 
1250 

(2) IM FORMAT I ON FOR SEQ ID HO s 352 

40 (i> SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 426 amino acids 
tBJ TYPE; amino acid 
(D) TOPOLOGY: linear 

45 (ill ttOLECULE TYPE: protein 

(ill I HYPOTHETICAL; YES 

tvH ORIGIHAL SOURCE: 
50 (Al ORGAhllSM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) HAf-fE/KEY: misc feature 
£B) LOCATION H - . 426 

55 



60 



65 



70 



75 





(xl) 


SEQUENCE DESCRIPTIOM: SEQ ID HO: 352 








Thr 


Met 


Glu 


Val 


Lys 


Lys 


Asn 


Thr 


Val 


Val 


Leu 


Arg 


Lcii 


Leu 


He Trp 


1 








5 










10 










15 


Phe 


Val 


Ala 


I.l e 


Leu 


Leu 


Phe 


His 


Ser 


Ser 


Arg 


Leu 


Trp 


Gly 


Gill Glu 








20 










25 










30 




Gly 


Glu 


Gly 


Ser 


Ala 


Arg 


Tyr 


Arg 


Phe 


Lys 


Gly 


Phe 


Val 


Asp 


Thr Tyr 






35 










40 










45 






His 


Ala 


Val 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 


Asp 


Phe 


Met 


Ser 


Ser 


Arg Thr 




50 










55 










60 








Arg 


Val 


Arg 


Gly 


Glu 


Leu 


Glu 


Arg 


Ser 


Phe 


Gly 


Asn 


Ser 


Lys 


Val Ala 


65 










70 










75 








80 


Val 


Ser 


Val 


Asn 


Ala 


Thr 


Tyr 


Asn 


Ala 


Leu 


Leu 


Lys 


Asp 


Glu 


Thr Gly 










85 










90 










95 


Leu 


Arg 


Leu 


Aig 


Glu 


Ala 


Phe 


Phe 


Glu 


His 


Gin 


Glu 


Glu 


His 


Trp Gly 








100 










105 










110 




Lsu 


Arq 


Leu 


Gly 


Arg 


Gin 


He 


Val 


He 


Trp 


Gly 


Ala 


Ala Asp Gly Val 






115 










120 










125 






Arg 


He 


Thr 


Ajp 


Leu 


He 


Ser 


Pro 


Met 


Asp 


Het 


Thr 


Glu 


Phe 


Leu Ala 




130 










135 








140 
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Gin Asp Tyr Asp Asp lie Arg Met Pro Val Asn Ala Leu Arg Phe Ser 
ISO iss 160 

Val Phe Asn Glu Ser Met Lys Val Glu Val Vsl Val Leu Pro Val Ph- 

165 170 175 

Glu Giy Tyr Arg Leu Pro Val Asp Pro Arg Asn Pro Trp Asn He Phe 

180 185 190 

Ser Leu Ser Pro He Ala Gin Gly Met Asn He Val Trp Lys Glu Glu 

195 200 205 

Ala Gly Lys Pro Ala Phe Lys Val Ala Asn lie Glu Tyr Gly Ala Arg 

210 215 220 

Trp Ser Thr Thr Leu Ser Gly He Asp Phe Ala Leu Ala Ala Leu Hi^- 
225 230 235 240 

Thr Trp Asn Lys Met Pro Val He Glu Val Gin Gly He Val Pro Thr 

245 250 255 

Glu He He Val Ser Pro Arg Tyr Tyr Arg H-t Gly Phe Val Gly Gl" 

260 265 270 

Asp Leu Ser Val Pro VaL Gly Gin Phe Val Phe Arg Gly Glu Ala Ala 

275 280 295 

Phe Asn He Asp Lys His Phe Thr Tyr Lys Ser His Ala Glu Gin Glu 

290 295 300 

Gly Phe Gin Thr He Asn Trp Leu Ala Gly Ala Asp Trp Tyr Ala Pro 
305 310 315 

Gly Glu Trp Met He Ser Gly Gin Phe Ser Met Glu Ser He Phe Aru 

325 330 335 

Tyr Arg Asp Phe He Ser Gin Arg Gin His Ser Thr Leu H- Thr L-u 

34 0 34 s 350 

Am Val Ser Lys Lys Phe Phe Gly Ser Thr I.eu Gin Leu Ser Asp Ph- 

355 360 365 

Thr Tyr Tyr Atp Leu Thr niy Lys Gly Trp Phe Ser Arg Phe Ala A] a 

370 375 380 

A?p Tyr Ala Leu A^n Asp Gin He His Leu Met Ala Gly Tyr Asp Trp 
385 390 



395 400 



Phe Ser Ser Lys Gly Ser Gly He Phe Asp Arg Tyr Lys Asp Asn Ser 

^OS 410 415 

GltJ Leu Trp Phe Lys Ala Arg Tyr Ser Phe 
420 425 

(2) lI/EXJW'iATIOM FOR SEQ ID NO: 353 

(1) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 464 amino acids 
IB) TYPE: amino acid 
{D) TOPOLOGY: linear 

fli) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION 1 ... 464 

(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 353 



Tyr Gly 


Lys 


Arg 


Arg 


Lys 


Leu 


Gly Thr 


Ser 


Val 


Arg 


Pro 


Ser 


Val 


Leu 


4. 


Gin 






5 










10 








IS 




Thr 


He 


Arg 
20 


Phe 


He 


Leu 


Asp 


Leu 
25 


His 


Leu 


He 


Thr Asp 

30 


Phe 


Phe 


Glu 


Gly 


Leu 


Arg 


Val 


Asn 


Pro 


He 


Gly Ala 


Ala 


Ala 


He 


Val 


Ala 


Phe 


He 




35 










40 










45 






He 
50 


Asp 


Leu 


Leu 


Leu 


Leu 
55 


Cys 


Cys 


Ser 


Ala 


Phe 
60 


Met 


Ser 


Ser 


Cys 


Glu 

65 

Glu 


Val 
Arg 


Ala 
Asn 


Tyr 
His 


Phe 
Ser 


Ser 

70 

Ser 


Leu 

Asp 


Lys 
He 


Pro 
Ala 


He 
Leu 


Asp 

75 
Ser 


Leu 
Asn 


Gin 
Leu 


Asn 
Leu 


He 

Asp 
95 


Arg 

80 

Asn 




Asn 


Gin 




85 










90 










Ser 


Leu 


Leu 


Ala 


Thr 


He 


Leu 


He 


Gly 


AJ9n 


Asn 


Val 


He 


Asn 


Val 


Ala 




100 










105 










110 




He 


Val 


He 


Leu 


Ser 


Asn 


Tyr 


Ala 


He 


Glu 


Gin 


Thr 


Phe 


Val 


Phe 




lis 










120 










125 






Ser 
130 


Ser 


Pro 


He 


He 


Gly 
135 


Phe 


Leu 


He 


Gin 


Thr 
140 


He 


Leu 


Leu 


Thr 
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10 



15 



20 



25 



30 



35 



40 



50 



55 



60 



65 



70 



75 



Thr 


Val 


l>eu 


Leu 


Leu 


Phe 


Gly Glu 


He 


Leu 


Pro 


Lys 


Val 


Tyr 


Ala Arg 


145 










150 










155 








160 


Lys 


Asn 


Pro 


Leu 


Gin 


Tyr 


Ser 


Arg 


Phe 


Ser 


Ala 


Ala 


Ala 


Met 


Ser Val 








165 










170 










175 


lie 


Tyr 


hys 


He 
180 


Leu 


Ser 


Pro 


Phe 


Ser 
185 


Lys 


Leu 


Leu 


Val 


Lys 
190 


Ser Thr 


Gly 


lie 


Val 


Thr 


Arg 


Gly He 


Ser 


Ly.«t 


Lys 


Lys 


Tyr 


Asp 


Met 


Ser Val 




1 95 










200 










205 






Asp Glu 


Leu 


Ser 


Lys 


Ala 


Val 


Ala 


Leu 


Thr 


Thr 


Thr 


Glu 


Gly Glu Pro 




210 










215 










220 








Glu 


Glu 


Lys 


Glu 


Met 


He 


Asn 


Glu 


He 


He 


Lys 


Phe 


Tyr 


Asn 


Lys Thr 


225 










230 










235 








240 


Ala 


Cys 


Glu 


He 


Met 
245 


Val 


Pro 


Arg 


He 


Asp 

250 


He 


Val 


Asp 


Val 


Asp Leu 

255 


Sec 


Trp 


Pro 


Phe 


Arg 


Lys 


Met 


Leu 


Asp 


Phe 


Val 


Val 


Ser 


Ser Gly Tyr 








260 










265 










270 




Ser Arg 


Leu 


Pro 


Val 


Ser 


Glu 


Gly S-r 


Glu 


Asp 


Asn 


He 


Lys 


Gly Val 






275 










280 










265 






He 


Tyr 


He 


Lys 


Asp 


Leu 


He 


Pro 


His 


Met 


Asp 


Lys 


Gly Asp 


Glu Phe 




290 










295 










300 








A.9p 


Trp 


His 


Pro 


Leu 


He Arg 


Lys 


Ala 


Tyr 


Phe 


Val 


Pro 


Glu 


Asn Lys 


305 










310 










315 








320 


Ax-q 


He 


Asp 


Ajsp 


Leu 


Leu 


Glu 


Glu 


Phe 


Arq Ala 


Asn 


Lys 


Val 


His Val 










325 










330 










335 


Ser 


lie 


Val 


Val 


Asp Glu 


Phe 


Gly Gly 


Thr 


Cys 


Gly Leu 


He 


Thr Met 








340 










34 5 










350 




Glu 


Arp 


Tie 


Leu 


Glu 


Glu 


He 


Val 


Gly Glu 


He 


Thr 


Asp 


Glu 


Tyr Asp 






355 










360 










365 






Glu 


Glu 


Glu 


Leu 


Pro 


Phe 


Lys 


Val 


Leu 


Gly Asp 


Gly 


Ser 


Tyr 


Leu Phe 




37 0 










375 










380 








Glu 


Gly 


hys 


Thr 


Ser 


Leu 


Ser 


Asp Val Arg 


His 


Tyr 


Leu 


Asp 


Leu Pro 


385 










390 










395 








400 


Glu 


Asn 


Ala 


Phe Gly Glu 


Leu 


Gly Asp Glu Val 


Asp 


Thr 


Leu 


Ser Gly 










405 










410 










415 


Leu 


Phe 


Lgu 


Glu 


He 


Lys 


Gin 


Glu 


Leu 


Pro 


His 


Val 


Gly Asp Thr Ala 








420 










425 










430 




Val 


Tyr 


Glu 
435 


Pro 


Phe 


Arg 


Phe 


Gin 
440 


Val 


Thr 


Gin 


Met 


Asp 
445 


Lys 


Arg Arg 


He 


He 
4 60 


Glu 


He 


Lys 


He 


Phe 
455 


Pro 


Phe 


Glu 


Arg 


Thr 
460 


Trp 


Glu 


Val Glu 



(2 1 IMF0RI4ATIOH FOR SEQ TO NOt354 



(IJ SEQUEHCE CHARACTERISTICS: 
45 (A) LENGTH: 266 amino acids 

(B) TYPE: amino acid 
(D! TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi» ORIGIt4AL SOURCE: 

(A) ORGAmSM: Porphyroitvonas glnqivalis 

FEATURE: 
tAJ UAME/KEY: misc_feature 
(BJ LOCATION 1. . .266 





(xil SEQUEHCE DESCRIPTIOH: ! 


SEQ ID NO: 354 










He 


He Tyr 


Arg 


Ser 


Thr 


Met 


Lys 


Leu 


Leu 


Lsu Tyr 


Leu 


Leu 


Leu 


Val 


1 






5 










10 






15 




Leu 


Ser Thr 


Leu 


Ser 


Pro 


Met 


Tyr 


Ser 


Gin 


Met Leu 


Phe 


Ser 


Glu 


Asn 






20 










25 








30 






Leu 


Thr Het 


Asn 


He 


Asp 


Ser 


Thr 


Lys 


Thr 


He Gin 


Gly Thr 


He 


Leu 




3b 










40 








45 








Pro 


Val Leu 


Asp 


Phe 


Lys 


Thr 


Glu 


Lys 


Glu 


Asn Val 


Phe 


Thr 


Phe 


Lys 




50 








55 


















Asn 


Thr Ala 


Asn 


Leu 


Asn 


Leu 


Leu 


He 


Lys 


His Gly 


Gin 


Val 


He 


Arn 


65 








70 








Ih 








80 


Leu 


He Asn 


Lys 


Leu 


Glu 


Phe 


Ser 


Thr 


Tyr 


Gly Asn 


Lys 


Val 


Thr 


Val 








85 










90 








95 




Ser Gly Gly 


Tyr 


Val 


His 


Thr 


Glu 


Tyr 


Arg 


Tyr Leu 


Leu 


His 


His 


Val 






100 










105 








110 
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Phe GlLi Val Tyr Pro Tyr Val Glu Ser Gin Trp Ala Glu Ser Arg Gly 

HeL hys Tyr Lys Val Ser Thr Gly Leu Gin Ser Arg Tyr Arg Leu Val 

5 A^n Ser Asp Asn Cys Leu Met Phe Ala Thr Leu G^y Val Phe Phe Glu 

Phs Glu Lys Trp Glu Gin Pro Ala Thr Ser Leu Phe Ala Gly Thr iyr 

Ala Tyr Ser Arg Ser He Ly. Ser His [eu Ser He Sec Phe ^rg HK- 

190 

Arg Leu Gly Glu His Trp Glu Phe Thr Thr Thr Ala Tie Hi3 Gin Gly 

200 205 
LV5 Pro A^p Ser Tyr Phe Lyn Lys Ala Arg Phe Gly Gly Ala He Arp 
210 215 220 



15 



20 



30 



45 



60 



Leu Lys Tyr His He Thr Pro Thr He Gly He Arg Gly Ala Tyr Arg 

235 240 

He He Tyr A^p Thr Ala Pro He Val Pro Val Arg Lys Asp Tyr A^n 

2^*5 250 7SS 

Thr Val Asp Val Gly He Asp He Ser Ph- 
260 265 



{21 IMPOIUiATlOH FOR SLQ ID MO: 355 

„_ <il SEQLfEHCE CHARACTERISTICS: 

^•J (A) LENGTH; <>n7 



(A) LENGTH: 907 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) H-ZPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 
35 t'^' ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .907 

40 (kL) SEQUEJ4CE DESCRIPTION: SEQ ID NO: 355 

Thr Tie Cys Val Ala Phe Leu Ser Ala Pro Val Ala Ala Leu Arg Glu 
Ser Pro Pro Met Gly Ala Glu Arg Lys ^hr Pro Ser Leu Leu Pro Leu 
Leu Phe Glu His Ser Asp Lys Gly Lys Gly Phe Asp Tyr Arg Leu Phe 
Thr Ser Asn Lys Leu Lys Val Phe Ala Thr Gly Asn Ser Arg Tyr He 
50 His Asn Lys Pro Thr He lie Gin Ala Met Lys t^g He Val Leu Ser 
Ser Phe Leu Phe Val Leu Ser He Leu Ser Zeu Met Ala Gin Asn l^n 

55 ^""^ Too '^^^ Ser ler Gly 

110 ^ 



Glu Pro Val Pro Tyr Ala Thr Val Ser He Arg Leu Thr Gly Ala Asp 

Thr Thr Gin Val Phe Arg Gin til Thr Asp Gly A.n Gly Tyr Phe Val 

135 l40 
Il« Gly Leu Pro Ala Ala Pro S«r T^^ His Leu Thr Ala Ser Phe Val 
1 50 155 1 -n 

Gly Met Lys Thr His Thr Het Gin He Ser Arg Gly Asn Gly Gin ml 

65 "-''^ tVo ^ilt »-v« °" 

Ser Thr Val Thr Val Ser Ala Ala Pro Leu Val Lye H^t Glu 11- 

* 205 

Asp Ar, Leu Ser Tyr A^n Met Lys Asp Asp Pro Ala Ala Lys Thr Asn 

/U Asn Leu Leu Glu Met Leu Arg Asn Val Pro Leu Val Thr Val Asp Gly 

235 7Art 
Gin Gly Asn He Gin Val Lys Gly Ser Ser Asn Phe Lys He His LeS 

75 "^"^ Ho ^" ""'^ ^" ^" ^" "l "he 

265 270 
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27 5 










280 
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2 90 
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Tyr 


He 
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420 








425 
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L-u 
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Thr 


Gin 


Met 
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Gly 


Ser 


Tyr 


Glu 


Lfeu 
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Al A 






435 










440 
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Asp 


Tyr 


Gin 


His 


Ser 


Thr 


Arg 


Leu Pro 


Gly Glu 


Leu 


Leu 


Thr 


Val 


Ser 




450 










455 
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Tyr 


Arg 
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Thr 
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Asn Asn 
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Thr 


Phe 


lie 
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Gin 












470 








475 
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Trp 


Lys 


Arg 


Asp 
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Leu 
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Thr Ala 


Asn 


Thr 


He 


Gin 


Tyr 


Ala 


Gly 










485 








490 










4 95 




Gin 


His 


Ser 


Lys 


Ser 


Asp Ala 


Gly Het 


Asp 


Glu 


His 


Thr 


Ala 


Gin 


Val 








500 
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Thr 
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Leu 
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Gly Leu 
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Thr Ser 
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He 
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€25 
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865 870 875 880 

Arg Thr lie Val Asn A^p Asp Leu Lys Gin Thr Ser Scr Gin Gly Gin 

885 890 895 

Gin Gly Gly Cily Gin Gly Asn Pro Thr Gly Asn 
5 900 905 

(2) INFORIIATION FOR SEQ ID HO: 356 

(1) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 4S0 amino ac;ids 

(B) rrPE: amino acid 
(DJ TOPOLOGY: linear 



15 



20 



(il) MOLECULE TYPE: protein 

(111) HYPOTHETICAL: YES 

(yH ORIGINAL SOURCE: 

(A) ORGA^JTSM: Porphyromonas gingiva lis 

(Ix) FFATURE: 

(A) NAUE/KEY: misc_feature 

(B) LOCATION 1. . . 450 



25 (xi) SEQUENCE DESCRIPTIOt^I : SEQ ID HO:3S6 

Trp Cys Gin Ser Asp Pro lie Pro His Cys Hi^ Ala Val Tyr Arq Asn 
^ 5 10 IS 

Ala Gin Gly His Gin Gin Gly Arg Ser Asp Gin Asp Arg Leu Pro Pro 
30 20 23 30 

Val Arg Gly Tyr Arg Tyr Ala Arg Cys Leu Gly Arg Tyr Glu Gly Phe 

35 40 45 

Gly Glu Arg Met Arg Leu Ser Ala lie Leu lie Ala Leu lie Val Met 
SO 55 eo 

35 Leu Pro Ala Val Leu Ser Gly Gin His Tyr Tyr Ser Met Ala Gly Glu 
65 70 75 GO 

Arg Leu Glu Thr Asp Ser lie Arg Pro Asn Giu Leu Ser Ala Ser lie 

as 90 95 

Arg Ser Ala Leu PVie Phe Arg Asn Asn Glu Tyr Asn Ala Arg Ser Val 
40 100 105 110 

Lys Gly Tyr Thr Leu Pro Gly Ala Arg Val Ser Ala Phe Ala Ser Tyr 

13 5 120 125 

Ser Leu Pro Ala Ala His Gly Val Lys Leu Ser Leu Gly Val Ser Thr 
130 135 140 

45 Leu Asn Tyr Trp Gly Ala Ser Arg Tyr Pro Ala Gly lie Ala Tyr Scr 
1*15 150 155 160 

Asp Leu Pro Tyr Trp Thr Asp Tyr Asn Asp Tyr Val Arg Leu Arg lie 

165 170 175 

Leu Pro Tyr Val Gin Ala Met Leu Lys Pro Thr Ala Thr Thr Ala Leu 
50 180 185 190 

Met Leu Gly Asn He Ala Gly Gly The Ala His Gly Leu lie Glu Pro 

1?5 200 205 

He Tyr Asn Pro Glu Leu Asp I.eu Thr Ala Asp Pro Glu Ala Gly Val 
210 215 220 

DO Gin Phe Arg Gly Asp Trp Thr Arg Phe Arg Met Arp Val Trp Val Asn 
225 230 235 240 

Trp Met Ser Met He Phe Lys Asn Asp Asn His Gin Glu Ser Phe Val 

245 250 255 

Phe Gly Leu Ser Thr Thr Ser Lys Leu Leu Ser Gly Glu Gly Lys Trp 
60 260 265 270 

Arg Leu Glu Leu Pro Leu Gin Ala He Ala Thr His Arg Gly Gly Glu 

275 280 285 

Tyr Asn Trp Ala Gin Gin Asp Thr Val His Thr Trp Val Asn Gly Ala 
290 295 300 

bo Vai Gly Leu Lys Leu Ser Tyr Arg Pro Arg Thr Asp Lys Pro Met Gin 
305 310 315 320 

He Trp Gly Ser Ala Tyr Gly Val Ala Ala Leu Ser Ser Gly Gly Tyr 

325 330 335 

Phe Pro Tyr Glu Arg Gly Trp Gly Gly Tyr Leu Se-r Leu Gly Met Asp 
/O 340 345 350 

Leu Glu His Phe Ala Phe Arq Thr Asp Tyr Trp Tyr Gly Arg His Tyr 

355 360 365 

Val Ser Pro Phe Ala Ala Pro Phe Ala Asn Ser Leu Thr Tyr Asp Lvs 
370 375 380 

75 Gin Pro Leu Thr Asn Gly Trp Gly Asp Tyr He Arg Leu Tyr Ala Asp 
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395 

Tyr Ser Trp Arg Met Ala Arg Ser Vai Ser Leu Ala Ala Val Ala Arg 
405 

Val Trp Phe Gin Pro Ser Asp Arg Phe Ala Met Ser Hi:s Ala L«u Glu 
420 jot 



10 



20 



30 



Asn His 
450 



'*20 425 430 

Leu ThL- Met Arg He Asp Pro Lys Phe Pro He Ala Phe Leu Lys G1-, 



(2) IHrORMATION FOR SfQ ID NO: 357 



(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 447 amino acids 
(D) TYPE: amino acid 
(D) TOPOrxxSY: linear 



(ii) MOLECULE TYPE: protein 
(iii> HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingival! s 

25 (ix) FEATURE: 

(A) HAME/KEY: misc_feature 

(B) LOCATIOTJ 1. . .447 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357 

Pro Phe Ser Pro Pro Trp Val Cys Ala Gly Ala Ala Ser Ser Val Arg 

Thr Arg Pro Cy^ Ser Thr Val Ser Ser Thr Ala Ala Ser Ser tils Arg 
or: 25 30 

35 Arg ser Glu His Met Arg Ser Ala A^p Ser Ser Pro Ala Tyr Ser Pro 
35 40 45 

He Gly Ser Lou Ser Ser Ser His Ser Phe Asp Ser He Pro Asp Gin 
55 go 

40 fis^ ^^"^ i*^" ^^"^ Il« cys Gly 

^0 75 80 

Met Pro Ala He Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu 

85 90 95 

Pro ser Leu Ala Glu Arg Val Phe Gly Leu Glu Gin Lys Gin Lys Lys 
Ai? . -^^^ 105 110 

4b Leu Lys Val Tyr Leu Gly He Gin Ser Phe Tyr Asp Gin Pro Leu Val 
115 120 125 

IKsp Asp Glu Ser His He Gly His Phe Lys Val Gin Glu Leu Arg Met 

50 ^45 ""^^ "^"^ ""^^ ""'^ ISO ^""^ "^"^ ^^"^ ""^P 

Arg Leu Asn Arg Ala Ala Asp Gly Thr Ser Phe Ala Asp Asn Leu Ser 

Asm Ala Ho Asp He Ala Gly Val Asp T^ His Pro Asn Asp III Val 

185 190 
t>b Ser Phe Phe Phe Gly Arg Gin Tyr Ala Arg Phe Gly Gly He Glu Tyr 

200 205 
Asp Het Asn Pro Val Glu lie Tyr Gin Tyr Ssr Asrp Leu Val Asp Tyr 

60 2?S ''^'^ "^^ '''^ '^'^ P'^^ Ala "p A^n Phe His Pro 

230 235 7jn 

Glu Gin Gin Leu Gin Leu Gin Val Leu Asn Ala Tyr Asn Asn Arg Ph^ 

, 250 255 

Ala Asp Arg Tyr His Val Thr Pro Asp Val Ala Thr Ala Thr Ser Tyr 

65 Pro Leu Leu Tyr Ser Ala Gin Trp ^n Gly Thr Leu Leu G^y Gly Ala 
-^'5 280 285 

Leu Hi^ Met Arg Tyr Ala Val Ser Met Ala His Cln Ala Gin Glu Arg 
295 300 
70 Tos ''^^ '''''' ^""^ ""'^ Asn- Pro Gly Lys Arg 

He Asn Gly Tyr Leu Asp Leu Thr Tyr Ser He Glu Gly Leu Asp Ifp 

325 ,-,c: 

Lys Gly He Met Thr Ala Arg Tyr Gly Lys Gly Lys Thr Leu Thr A^p 

TK -^^^ 345 350 

75 val Lys Tyr Tyr Ala Leu Val Ser Lys Trp Asn Phe Arg He Ph, A..p 
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355 360 365 

(Sin Val Asn Leu Phe I.eu Lya Gly Met Tyr Glu A.<?n Gly Tyr Ala Pro 

370 375 380 

Ala Gin Tyr Gly Glu Ser Ser His Thr Arg His Ser Tyr Gly Tvr M-t 

385 390 J J - 



395 400 



Giy Gly Val Glu Tyr Tyr Pro Thr Glu Thr Acn Phe Arg Leu Phe Val 

405 410 415 

Thr Tyr lie Gly Arg His Tyr Arg Tyr Ser Ala Thr Glu Thr Glu Ser 

420 426 430 

Thr Asn Ala Leu Arg A] a Gly Leu He Tyr Gin IIr Pro Phe Leu 
435 445 

(21 INFOWIATION FOR SEQ ID HO: 358 

(j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 amino acids 

(B) TYPE: aroino acid 
ID) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGIfJAL SOURCE: 

(A) ORGAHISM: Porphyromonns gingivalis 

<ix) FEATURE: 

(A> KAME/KEY: mloc_feature 

(B) LOCATION 1. . ,227 

(xl) SEQUETICE DESCRIPTIOW: SEQ ID MO: 358 

Tyr Lys Gin Ala He Met. Lys His Leu Phe Lys Ser Thr Leu Val Leu 

^ S 10 15 

Leu C^'s Ala Leu Ser Phe Ser Gly Thr Tyr Thr Phe Ala Gin Glu Asn 

20 25 30 

Asn Thr Glu Lys Ser Arg Phe Asp Phe Ser Val Arg Leu Gly Gin Glv 

35 40 45 

Tyr lie Ala Gly Ser Thr Thr JKsn Leu Met Tyr Gly Tyr Thr Ser Ala 

SO 55 go 

Asn Asp Arg Leu Leu Ser Gly Ala He Tyr Leu Gly Leu Thr Pro Ser 

70 75 aQ 

Lys Lys Glu Asn Ala Thr Gly Val Ala Phe Arg Phe Leu Ser Pro Ser 

85 90 95 

Pro Gly Tyr Tyr Val Asp He Ser Gly Lys Glu Asn Thr Leu Asn T/r 

100 105 110 

Ala Phe Tyr Val Val Gly Ala Tyr Asn Arg He Ala He Pro 11- Arg 

115 120 125 

Pro He Lys Asn Phe Asn Phe He Phe Ser Thr Glu Val Gly Met Ala 

130 135 140 

Trp Met Ser Arg Hts Glu Gin He Tyr Asn Ser Thr Ser Gin Thr Trp 
150 155 160 

Asp Lys Gin Arg Lys Ser Arg Ser Gly Leu Asp Phe Gly Leu Gly Met 

155 170 
His Leu Gin Unk His He Asn Lys Thr Val Tyr Phe Met Ala Gly Thr 

100 IBS 190 

Asp Leu Thr Ser Cys Met Phe Gly Lys Arg He Asn Asp Tyr Gin Gin 

195 200 205 

Lys Asp Axg Thr Phe He Ala Leu He Asp Asn Ser He Gly He Glv 

210 215 220 

Leu Asn Leu 
225 

(2) INFORI-IATION FOR SEQ ID NO: 359 

(i) SF.QlfEMCE CHARACTERISTICS: 

(A) LENGTH: 4 06 amino acidi= 

(B) TYPE: amino acid 
iD) TOPOLOGY: linear 

til J riOLECULE T-fPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



{A} ORGAI-JISH: Porphyromonas gingivaiis 

<ix) FEATURE: 

(A) NAME/ KEY: misc feature 

(B) UXy^TlOM 1. . .4 06 

SEQUENCE DESCRIPTION: SEQ ID NO: 359 

lie Phe He Asp Pro Asp Lys Asn Thr Lys Gin Asn Glu Arg 
^ S 10 

He He Lys Lys Met Leu Lys Asn Lys Leu Ala Pro Leu Ala 

20 25 30 

Phe Leu Phe Ala Pro Lys Ala Met Lys Ala Gin Glu Gin L«u 

3^ 40 45 

Val His Thr Scr Val Pro Ser Leu Asn He Scr Pro Asp Ala 

SO 55 60 

Ala Gly Met Gly Asp He Gly Val Ala Thr Thr Pro Asp Ala 

65 70 75 

Gin Tyr Trp Asn Pro Ser Lys Tyr Ala Phe Met Asp Thr Lys 

85 90 

He Ser Phe Ser Tyr Thr Pro Trp Leu Ssr Lys Leu Val Asn 

105 no 

Ala i.eu Hot Gin Met Thr Gly Phe Tyr Lys Leu Gly Thr Asp 

120 125 
Gin Ala He Ser Ala Ser Leu Arg Tyr Phe Thr Leu Gly Lys 

130 135 i:jo 

Thr Ph« Asp Glu Leu Gly Glu Scr Met Gly Glu Ala His Pro 
l'*^ ISO 155 

Phe Ala Val Asp Leu Gly Tyr Ser Arg Gin Leu Ser Glu Asn 

165 170 
MeL Ala Val Ala Leu Arg Tyr He Arg Ser Asp Gin Ser Thr 

185 190 
Thr Gly Glu Asn Gin Ala Gly Asn Ala Phe Ala Ala Asp He 

195 200 205 

Tyr Leu Gin Lys Tyr Val Leu Leu Gly Asn Ala Glu Ser L«u 

210 215 220 

Leu Gly Phe Asn Val Lys Asn He Gly Thr Lys He Ser Tyr 
225 230 235 

Gly Val Thr Ser Phe Phe H- Pro Thr Ser Leu Asn Leu Gly 

245 250 
Leu Leu Tyr Pro He Asp Asp Tyr Asn Ser He Asn Phe Asn 
260 265 270 

Leu Scr Lys Leu Leu Val Pro Thr Pro Pro He Met Asp Gin 

2^5 2B0 285 

Gin Ala Gly Tyr Glu Ala Ala Leu Lys Lys Tyr Gin Glu Thr 

2^0 295 300 

He Ser Gly He Phe Ser Ser Phe Gly Asp Ala Pro Gly Gly 
305 310 

Glu Glu Phe Ary Glu He Thr Trp Gly Leu Gly Ala Glu Tvr 

325 330 ^ 

Asp Asp Lys Phe Phe Val Arg Ala Gly Tyr Ser Tyr Leu His 

345 350 
Lys Gly Asn Leu Gin Tyr Phe Thr Ala Gly Ala Gly Phe Lys 

360 365 
He Phe Arg He Asp Ala Ser Tyr Leu Leu Ser Thr He Gin 
370 375 

Pro Leu Asp Gin Thr Leu Axg Phe Thr Leu Ala Phe Asp Met 
390 

Leu Arg Asn Leu Phe His 
405 



Acn Met 
15 

He Leu 

Asn Val 

Arg Ala 

Tyr Ser 
90 

Ala Gly 

95 

Asp He 

Glu Asn 

Leu Glu 

Asn Glu 
160 
Phe Ser 
175 

His Asn 

Ala Gly 

Trp Ser 

Asp Gly 
240 
Thx Gly 
255 

Leu Glu 

Asn Asp 

Ser Ser 

Leu Lys 
320 
Ser Tyr 
335 

Pro Thr 

Met Asn 

Ser Asn 

Asp Gly 
400 



65 



70 



75 



(2) iriiXJRHATION FOR SEQ 10 NO: 350 

fi) SEQUEl^CE CHARACTERISTICS: 

(A) LENGTH: 452 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) nOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyronvjjnas gingiva lis 
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(ix» FEATI.IRE: 

(A) MAME/KEY: misc_f ea ture 

(B) LOCATICT-f 1, . . 4S2 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360 

Leu Tie lie Glu l^ys Giu Met Lys Thr Thr Val Gin Gin He He Leu 

IS 10 15 

Ciis Leu Ala Leu Met Met Ser Gly Val Leu Gly Gly Asn Ala Gin Ser 

20 2b 30 

Phe Trp Glu Glu He Ala Pro Pro Phe He Ser Asn Glu Pro Asn Val 

35 40 45 

Lys Tyr He He Pro Asn Met Gly He Acp Ser Lys Gly Thr He Tyr 

50 55 60 

Val Thr Val Thr Lys Arg He Gin Gin Gly Ala Atn Tyr Thr Ser Giu 
65 70 75 80 

Gin Leu Gly Met Tyr Tyr Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp 

85 90 95 

Lys His Asp Pro Tyr Phe Asp Asp Lys He Val Ala TKsp He Gin Thr 

100 105 110 

Asp Ala Tyr Gly Arg Val Tyr Val Cys Thr Thr Ser Ser Arg Asp Gin 

115 120 125 

Glu Tyr Gin Leu Tyr He Asn Glu Gin Asn Glu Trp Arg Cys He Phe 

130 135 140 

Lys Thr Ser Val Ser Thr Tyx Glu His Gly Met Ala Val Phe Arg Ser 
1^5 150 155 

Ser Thr Gly Val Thr Tyr He Gly Thr Arg His His He Phe Ala Ser 

16b 170 175 

Gly Val Asn Asp Phe Glu Phe Asn Thr He Tyr Glu Asp Ser Thr Pro 

180 185 190 

Met Ser Cys Axg Phe Ala Glu Ala Thr Asn Ser Gly Thr He Tyr Leu 

195 200 205 

Ala Leu Met His Glu Thr Thr Met Ser Thr Thr He Leu Thr Tyr Gin 

210 215 220 

Asn Gly Glu Phe Val Asp He Ser Glu Ser Glu Leu Ser Asn Ser II- 
225 230 235 240 

He Ala Ser Met Cys Ser Asn Lys Glu Gly Asp He He Ala Leu Val 

245 250 255 

Thr Ser Tyr Thr Gly Phe Met Ser Gly Thr Leu Ala He Arg Lys Ala 

260 265 270 

Asp Glu Gly Lys Trp Gin Leu Val Gly Gly Asp He Gin Asn Ala He 

275 280 205 

Val Gin Asn He Cys Met Met Asp Asp Asn Lys He Ala Cys Glu Val 

290 295 300 

Phe Gly Thr Pro Asn Gly Val Asp Gly Arg Thr Ara Val Cys Val Ser 
305 310 315 ' 320 

Asp Ala Ser Val Phe Asp Phe Glu Trp Tyr Glu Asp Glu He Tyr Gly 

325 330 335 

Gly Leu He Dhe Asp Thr Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu 

340 345 
Tyr Ala Lys Phe Gly Gly He Met Leu Arg Ser Lys Glu Ser Phe Il« 

355 360 365 

Thr Ser Phe He Ser Pro Tlir Val Val Gin Gly Val Asp Val Tyr Thr 

370 375 380 

Leu Ala Gly Lys He Arg lie Glu Ser Glu Thr Pro Val Ser Glu Val 
385 390 395 400 

Leu Leu Phe Asp Leu Ala Gly Arg Met Val Leu Arg Gin Thr He Asp 

405 410 415 

Asn Lys He Tyr Ser Asp lie Asp Thr Asn Gly Leu Lys Arg Ser Gly 

420 425 430 

He Tyr Val Val Ser Val Arg Leu Ser Ser Gly Gin Val Phe Ser His 

435 440 445 

Lys Val Gin Val 
450 

(2) IHFORI-IATXON FOR SEQ ID NO: 361 

(i) SEQtiEHCE CHARACTERISTICS : 

(A) LENGTH: 331 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

tlii) HYPOTHETICAL: YES 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromon^is gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION I. . , 331 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 361 

Gly Leu Tyr Gly Mat Ser Val Val Pro He He He Tyr Leu Cys Gly 
^5 10 IS 

He Ser Asn Tyr Ala Arg Leu Met He He Arg Cys Leu He Arg Arg 

20 25 30 

Pro Arg Thr Val Leu Phe Gly Leu He Phe Val Val Gly Leu Phe Ser 

35 40 45 

Ala Met Ala Gin Glu Lys Lys Asp Ser Leu Ser Thr Val Gin Pro Val 
50 55 60 

9n ^-"^ ^'^'^ Leu Leu Ser He Asp His 

^0 65 70 75 80 

Pro Val Leu Pro Ala Ser Phe Gin Asn Thr Arg Thr Leu Lys Arg Phe 

85 90 95 

Arg Asp Lys His Leu Ser Asp Ala Leu Leu Asn Gly Leu Lys Pro His 
on 105 110 

ZD Arg Ser Ser Leu Gin Leu Asn Glu Glu Leu A^^n Phe Ala Ala Glu Arg 

120 125 
Arg Acp Phe Val Ser Pro Leu I.eu Gin Thr Arg His Ala Ala Gly Val 

130 135 
Leu Ser Trp Arg Pro Thr Asp Arg Met His Phe Tyr Thr Ser Gly Asn 

150 155 
He Gly Leu Gly His Asp Leu Leu Thr Gly Val Arg Lys Asp Phs Gly 

165 170 175 

Trp Asn Ala Gly Ala Asp Phe Leu Leu Ser Gin Asn Leu Thr Ala His 

180 185 190 

Val Gin Gly Gly Trp Gin Gin Asn Phe Gly Phe He Pro Met Thr Ala 

195 200 205 

Val Asn Gly Gin Leu Arg Trp Gin Ala Thr Glu Arg Leu Ser Phe Thr 
210 215 220 

225 230 235 240 

Thr Phe Ser Leu Lys Gly Ser Ala Arg Tyr Glu Val Met Asp Asn Val 

245 250 255 

Phc^ Val Asn Gly Phe Gly Ser Tyr Pro Leu Tyr Ser Ser Thr Arg Ser 
ytr 265 270 

Gly Leu Asn Met Ala Val Pro Met His Gly Phe Gly Pro Gin Tyr Gly 
275 280 285 

fo^ ^^'^ '^^'^ P*^** <^ly Ala Val Gly 

2^0 295 300 

*m ^""^ '^^''^ '^^^ "^^P "^^^ Arg Arg Trp Glu Thr His Tyr 

315 

Phe Ala Tyr Pro Val Phe Tyr Gly Asp Lys Lys 
32b 330 



30 



35 



55 



60 



65 



70 



75 



(2) IHrORMATlON FOR SEQ ID NO: 362 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGl'H: 329 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) fiYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

( Ix ) FEATURE : 

(A) NAME/KEY: mlsc feature 

(B) lOCATIOJJ 1. . .32 9 

Ixl) SEQUENCE DESCRIPTION: SEQ ID NO: 362 

Glu Thr Asn Ser Trp Val Ser Ser Asp Cys Asn Ser Thr Thr Met Lys 
^5 10 15 
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Thr Asn Arg Arg Tyr Ala Phe Val Leu Pro Leu Leu Leu Leu Thr Gly 

20 25 30 

Leu Leu Ala Trp Gly Gin A^p Ser Ser His Gly Ser Asn Thr Ala Ph» 

35 40 45 

Ala Thr Asp Ser Ser Ser Arg Glu Leu l»ro Thr Glu Gin Scr Ala Tyr 

50 55 60 

Arg He His Ser Ala Tyr Met Val Gly Gly Gly Gly Ser lie Thr Arg 
65 70 75 80' 

Asp Thr Tyr Leu Ser Pro Leu Arg Tyr Gly Gly Trp Thr Leu Asn Leu 

85 90 95 

Leu Gly Glu Lys Thr Phe Pro Leu Lys Ala Ser Asp Sex Arg Trp Het 

100 105 110 

He Arg Thr Gly His Glu Leu Asp Phe Ala Leu Met. Asp Asn Fro Ala 

115 120 125 

Asn Asn Ala Mis phe Tyr Ser Leu Leu Tyr Asn Gly Ser Ala Ala Ala 

130 13b 140 

Leu Tyr Arg Leu Gly Ala Lys His Leu Arg Ala AJ a Trp Met Asp Asn 
145 ISO 155 ISO 

Leu Arg Leu Ala Phe Gly Pro Gly Leu Glu lie Gly Leu Gly Gly He 

165 170 175 

Tyr Ser Thr Arg Asn Gly Asn Asn Pro Ala Thr Lsu Lys Leu Tyr Thr 

180 185 190 

Asn Ala He Ala Gin Ala Ser He Gly Tyr Tyr Val Pro Ser Glu Thr 

195 200 205 

Phe Pro Leu Tyr Phe Arg Leu Leu Ser Gin He Asn Leu Phe Gly Il« 

210 215 220 

Ala Tyr Gly Asn Gly Phe Gly Glu Ser T/r Tyr Glu Asn Phe Leu Leu 
225 230 235 240 

Asn Asn Gly lis Ala Gly Ser Leu His Phe Thr Tyr Pro Gly Lys Phe 

245 250 255 

Thr Arg Phe Thr Thr Leu He Thr Ala Asp He Pro He Arg Asn Phe 

260 265 270 

Cys Thr Leu Arg Val Gly Tyr Arg Tyr Ser His Leu Gly Ser Ser Leu 

275 280 285 

Asn Ala Leu Asp Thr Arg He His Ser His Thr Ala Phe He Gly Phe 

290 295 300 

Val Thr Giu Phe Tyr Arg Phe Arg Gly Arg Lys Ala Met Asn Thr Gly 
305 310 315 320 

Arg Arg Thr Ser Leu Tyr Tyr HI a Asp 
325 

{?.) IHtrORidATION FOR SEQ ID NO: 363 

(i) SEQ'-IENCE CHARACTERISTICS: 

(A) LENGTH: 319 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

<vl) ORICIHAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

(Ix) FEATURE: 

(A) HAl IE/KEY: mi«c_f eature 

(B) LOCATION 1...319 





(xi) SEQUENCE DESCRIPTION: 


SEQ 


ID NO: 363 










Dhe 


Thr Ser 


Gly 


Thr 


He 


Phe 


Val 


Arg 


He 


Al,i 


He 


Ser 


Gly 


Arg 


Val 


1 






5 










10 








15 




Val 


Cys TVrg 


Glu 


Val 


Glu 


Asp 


Asn 


Gly Arg 


Arg 


Giu 


Ala 


Val 


Arg 


His 






20 










25 










30 




Asp Gly Met 


Val 


Val 


Thr 


Leu 


Leu 


Val 


He 


Val 


Gly 


He 


Val 


Val 


Val 




35 










40 








45 








Val 


Arg Tyr 


Ser 


Leu 


Arg 


Val 


His 


Val 


His 


Lys 


Thr 


Gly 


Thr 


Val 


Val 




50 








55 








60 








Ser 


Ala Ala 


He 


Phe 


Gly 


Phe 


He 


Leu 


Leu 


Gly 


Lys 


Thr 


Val 


Pro 


cys 


65 








70 










75 










80 


Asp 


Thr Arg 


Asn 


Phe 


Phe 


Ser 


Ser 


Glu 


Ser 


Asp 


Glu 


Pro 


Glu 


Ser 


Arg 








85 










90 










95 


Val 


Ala Thr 


Glu 


He 


Ala 


His 


Leu 


Cys 


Glu 


He 


Gly 


Phe 


Gin 


He 


His 






lOO 










105 








ilO 
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Ala Ser Ser He His Val Ala Val Arg Thr Asp Phe Gly Gin Ala Gly 

120 125 
He His Cys Pro Met Ala Thr Asp Ala Ser Ala Thr Glu Phe Asp Arg 

130 135 
Ser Ala Glu Cys Ala Glu Arg Thr Ser Ala Gin He Asp Thr Ala II- 
150 155 160 



10 



15 



Arg Ser GXn Ser Gin He He Arg Thr His He Asp Thr Cys Pro Ly- 

i65 170 
Sei Ser Gly Thr He Gly Gly Ser Thr His Thr Ser Leu His Leu Lvs 

135 190 
Val Phe Asp Gly Arg Gly Glu Val Gly His He His Pro Lys Asp Gly 
IP 200 205 ^ 

Leu ^rg Phe Gly Val Val Glu Gly Tyr Ser He Gly Ser Tyr Val Asp 

-1*^ 215 220 

Ala He Gly He Gly Ala Thr His Ala Lys Ala Gly He Ser Asp Thr 
230 235 240 

Arg Thr Gly He Ala Gly Gly Tyr His Gly Gly Ser Gin His Gin Gin 

245 250 255 

He Gly Asp Val Thr Thr He He Gly Leu Gly Lys Phe Gly Leu Ala 
^ 260 265 270 

Asr. Val Gly Val Ser Asp Arg Gly Phe Arg Arg Gly Ala Ser Ser Tyr 

280 285 
Asp Leu Tyr Gly Leu Glu Leu His He Ala Lys Thr He Tyr Leu Al^ 
295 300 



25 



40 



45 



V^l Gly Asp Gly Gys Leu Cys Arg Gin Ala Glu Arg Gin Asp Gly 



315 

(2) IM FORMATION FOR SEQ ID NO: 364 

30 (i) szQUzncz characteristics: 

(AJ LENGTH: 614 amino acids 
(B) TYPE: amino acid 
(D) TOPOTOGY: linear 

35 Ui) MOLECULE TYPE: protein 

(iii» HYPOTHETICAL: YES 



(vi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas gingival is 

(ix> FEATURE: 

(A) NAtlE/KEY: mlfic_f eature 
IB) LOCATION 1. . .614 

(::i) SEQUENCE DESCRIPTlOf^: SEQ ID HO: 364 

Pro Tyr Asn Val Gin Ser He Ser Asn Lys Thr He Lys Lys Gin Met 

50 Glu Asn Leu Ly3 Asn He Gin Pro Arg Glu Arp Phe Asn Trp Glu Glu 
20 25 30 

Phe Glu Ala Gly Gly Val His Ala Ala Val Ser Arg Gin Glu Gin Glu 

3S 40 45 

Ala Ala Tyr Asp Lys Thr Leu Asn Thr He Lys Glu Lys Glu Vai Val 

55 60 
Met Gly Arg Val Thr Ala He Asn Lys Arg Glu Val Val He Asn Val 

70 75 
Gly Tyr Lys Ser Glu Gly Val Val Pro Ala Thr Glu Phe Arg Tyr Asn 

bU Pro Glu Leu Lys Val Gly A^p Glu Val Glu Val T^^r He Glu Asn Gin 
. . lOS 110 

Olu Asp Lys Lys Gly Gin Leu Val Leu Ser His Arg Lys Gly Arg Ala 

120 125 

65 ^30 ^^"^ 135 

Val Lys Gly Tyr Val Lys Cys Arg Thr Lys Gly Gly Met He Val Asp 
ISO 

Val Phe Gly He Glu Ala Phe Leu Pro Gly Ser Gin He A^p Val Arg 

70 Pro He Arg Asp Tyr Asp Ala Phe Val GUi Lys Thr Met Glu Phe Lys 

185 190 

He Val Lys He Asn Gin Glu Tyr Lys Asn Yal Val Vai Ser His Lys 

200 205 

75 ^^"^ ^-'^ ^^"^ lie lie Gly 

215 220 
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hys Leu Glu Lys Gly Gin Val Leu Gla Gly lie Val Lys Asn lie Thr 
225 230 235 240 

Ser Tyr Gly Val Phe lie Asp Leu Gly Gly Val Asp Gly Leu lie His 
245 2S0 255 

D He Thr Asp Leu Ser Trp Gly Arg Val Ala His Pro Glu Glu lie Val 
260 265 270 

Gin Leu Asp Gin Lys He Asn Val Val He Leu Asp Phe Asp Glu Asp 

275 280 285 

Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met Pro His Pro Trp 
10 290 295 300 

Asp Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Lys Val Lys Giv Lvs 
305 310 315 320 

Val Val Val Mot Ala Asp Tyr Gly Ala Phe Val Glu He Ala Gin Gly 
325 330 335 

lb Val Glu Gly Leu He Hlc Val Ser Glu Met Ser Trp Thr Gin His Leu 
340 345 
Aig Ser Ala Gin A.-p Phe Leu His Val Gly Asp Glu Val Glu Ala Val 

355 360 365 

He Lsu Thr Leu Asp Arg Glu Glu Arg Lys Met Ser Leu Gly Leu Lvs 
20 370 375 380 

Gin Leu Lys Pro Asp Pro Trp Ala Asp He Glu Thr Arg Phe Pro Val 
385 390 395 400 

Gly Ser Arg His His Ala Arg Val Arg Asn Phe Thr Asn Phe Gly Val 
on 410 415 

ZD Phe Val Glu He Glu Glu Gly Val Asp Gly Lsu He His He Ser Asp 
420 425 430 

Leu Ser Trp Thr Lys Lys H- Lys His Pro Ser Glu Phe Thr Glu Val 

435 445 
Gly Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn Arg 

4S0 455 460 

Arg Leu Ser Leu Gly His Lys Gin Lou Glu Glu Asn Pro Trp Asp Val 

470 475 
Phe Glu Thr Val Pho Thr Val Gly Ser He His Glu Gly Thr Val He 
«c 485 490 495 

Glu Val Met Asp Lys Gly Ala Val Val Ser Leu Pro Tyr Gly Val Glu 

500 505 510 

Gly Phe Ala Thr Pro Lys His Met Val Lys Glu Asp Gly Ser Gin Ala 

515 520 525 

Val Leu Glu Glu Lys Leu Pro Phe Lys Val He Glu Phe Asn Lys Asp 
'^0 530 535 54Q 

Ala Lys Arg He He Val Ser His Ser Arg Val Phe Glu Asp Glu Gin 
545 550 555 560 

Lys Met Ala Glu Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu Ala Lys 
AK 570 573 

Ala Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro Ala Gin Ala 

580 585 590 

Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu Lys 

595 600 605 

Glu Lys Leu Ssr Glu Asn 

50 610 



55 



60 



65 



70 



75 



(2) INFOW'IATIOJ FOR SEQ ID NO: 365 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 24 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

I XX.) FEATURE: 

(A) NAME/kEY: mlsc feature 
{B) LOCATION 1. , .243 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 365 

Ser Thr He Met Lys Lys Ala He Leu Sec Gly Ala Ala Leu Leu Leu 



1 5 



10 15 



Gly Leu Cys Ala Asn Ala Gin Asn Val Gin Leu His Tyr Asp Phe Glv 
- 20 25 30 
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His S*r lie Tyr A^p Glu Leu Asp Gly Arg Pro Lys Leu Thr Thr Thr 

Val Glu Asn Phe Thr Pro Asp Lys Trp Gly Ser Thr Phe Phe Phe He 

Asp Met Asp Tyr Thr Gly Lys Gly He Gin Se r Ala Tyr Trp Glu Zle 

Ser Arg Glu Leu Lys Phe Trp Gin Ala Pro vfl Ser He His Leu Glu 

90 95 

Tyr Asn Gly Gly Leu Ser Thr Ser Phe Thr Phe Gly His A,p Ala Leu 

105 1 1 n 

He Gly Ala Thr Tyr Thr Tyr Asn Asn Pro Ser Phe Thr Arg Gly Phe 

11= 120 125 

Thr lie Thr Pro Met Tyr Ly, „i<, As,p Phe Hi, Thr 

Tyr Gin He Thr Gly Tl.r Tr| Tyr Met Hl^ Phe J.eu Asp Gly Leu Leu 

155 1 fiO 

Thr Phe Asn Gly Phe Leu Asp Leu Trp Gly Phe Pro Gin Glu Asn Pro 
1^5 

He Gly Gly Pro Val Leu Lys Glu Gly Asp Lys Phe Val Phe Leu Ser 

^° las 

Glu Pro Gin Phe Trp He Asn Leu A.,n Arg He Lys Gly He Asp Lys 

200 205 

Asp Phe Asn Leu Ser He Gly Thr Glu Met Glu He Ser Arg Asn Phe 



25 Ala Arg Met Asp Lys Phe Ser Cys He Pro Thr Leu Ala Val Lys Trp 



225 230 
Thr Fh- Asn 



235 240 



30 {2} INfORMATiag FOR SBQ ID NO:366 



35 



50 



55 



65 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 
HYPOTHETICAL: YES 

40 

tvi) ORIGINAL SOURCE: 

{A) ORGANISM: Porphyromonas glnglvalis 

i i-^ ) FEATURE : 

(A) NA^iE/KEY: misc featur* 

(B) LOCATION 1...235 

(Xi) SEQUENCE DESCRIPTION: SEQ ID MO: 366 

Met Lys Thr He Ser Lys Asn His Ala Ala Arg He Cys Ala Ala He 

Ala Leu Phe Ala Val Cys Asn Gly Arg He Ala Ala Gin Asp Phe Leu 

Tyr Glu He Gly Gly Gly Phe Gly Ala Gin Tyr Phe G^y Asp Ala 

Asn Arg Gly L*u Phe Gly Ser Ser Gly Val Gly Leu gL Leu Val Gly 

55 5Q 
Arg Tyr Asn Tyr Asn Phe Arg Trp Ala Phe Ser Thr Het Leu Asp Trp 

60 Arg Thr Leu Arg Gly Asp Thr Asp Lys s«r Gly Asn Val Phe Pro Asp 

90 

Phe Ala Gin Ala Asp Phe Lys Val Gly Leu Thr Gin Leu His Val Arg 



ser Glu Phe As'n Phe Leu Pro Tyr Ser Asp Gly Tyr Lys ^yr Leu Gly 

120 125 
Val Ala Ala Gly Lau 

i35 



■'■^^ 125 

Thr Ala Arg I*u Ser Pro Tyr Val Ala Ala Gly Leu Ser Leu Gly Phe 

Ala ser Gly Ala Lys Gly Ser Ala Phe Ala Pro G^y He Thr Ala Gly 

70 Met Gly Val Lys Tyr Lys Leu Lys Pro Arg He Asn Val Gly He g" 

Tyr Ser Phe Thr Gly I.eu Leu Thr Asp M= Leu Asp Ala Leu ^hr Asp 

Lys Ser Val Trp Leu Glu Asp Pro Tyr Lys He Asn Asp Sor Trp Val 

•"^ 200 - — -. 
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Lys Asri Lys Asp Ala Thr Gly Ala Leu Val Lsu Arg He Thr Tyr Asp 

210 215 220 

Ph!^ Gly Leu Arg Lys Thr Phe Cys A^n Lys Gin 
225 230 235 



10 



40 



(2) IIIFOW-tATiaJ FOR SEQ ID NO: 367 

(i) SEQIIENCE CHARACTERISTICS: 

(A) LENGTH: 436 amino acids 
IB) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

15 t i i 1 ) H Y POTHETICAL : YES 

(vi) ORIGTIIAL SOURCE: 

(A) ORGANISM! Porphyrojnonas gingiva lis 

20 (Ix) FEATURE: 

(A) NAIiE/KEY: laisc: feature 

(B) LOCATION 1. . .43 6 

2g SRQUEHCE DESCRIPTIOM: SEQ ID HO: 367 

A^n He Met Tyr Lys Asp Tyr Lys Gly Leu Tyr Ala Ser Leu Arg Trp 
^ ^ 10 15 

Tyr Ala Leu He He Gly Leu Leu Phe Ala Ala Asp Gly He Gin Ala 
on 25 30 

Gin Asn Asn Asn Phe Thr Glu Ser Pro Tyr Thr Arg Phe Gly Leu Glv 

35 40 45 

Arg Leu Gly Glu Arg Thr Thr He Ser Gly His Ser Met Gly Gly Leu 

55 60 
Gly Val Gly Leu Arg Gin Gly Thr Tyr Val Asn Ala Val Asn Pro Ala 
65 70 75 80 

Ser Tyr Ser Ala Val Asp Ser Met Thr Phe He Phe Asp Phe Gly Ala 

flS 90 95 

Ser Thr Gly He Thr Trp Tyr Ala Giu Asn Gly Lys Lys Asp Asn Arg 

iOO 105 
Lys iret Gly Asn He Glu Tyr Phe Ala Met Leu Phe Pro He Ser Lvs 

120 125 
Ser He Ala Met Ser Ala Gly Val Leu Pro Tyr Ser Ala Ser Gly Tyr 

45 Va2 ^^"^ val Gin Tyr Thr 

155 160 
Arg L/f Tyr Leu Gly Thr Gly Asn Leu Asn Asp Leu Tyr Val Gly H- 

165 170 175 

Gly Ala Thr Pro Phe Lys Asn Phe Ser He Gly Ala Asn Ala Ser Ser 

cn ^ *^ 190 

Leu Phe Gly Arg Phe Thr His Ser Arg Gin Val He Phe Ser Thr Glu 

1^- 200 205 

Ala Pro Tyr Asn Pro Val His Leu Ser Thr Leu Tyr Leu Lys Ala Ala 
215 220 

55 ^X^^ ^y' Ser Asp Arg 

230 235 240 

ser T.eu Val He Gly Ala Val Tyr Ser Pro Arg Val Lys Met His Ser 

245 250 255 

Glu Leu Thr Gin He Lys Asn Gin Val Gin Asn Gly Val Val Val Glu 
en 265 270 

eO Ser Glu Thr Gin Glu Tyr He Lys Gly Met Asp Tyr Tyr Thr Leu Pro 
275 2B0 2BS 

His Thr Leu Gly He Gly Phe Ser Tyr Glu Lys Lys Asp Lys Leu Leu 
295 300 

65 305 '"^^ "''^ "^"^ fio '^^'^ ^" ^^"^ ''''^ ^^'^ ""^^ "-^^ "^^^ 

hyz Ser Asp Cys Lys Phe Gin Asp Arg He Arg Val Ser Leu Gly Gly 

325 330 
Glu He He Pro Asp He A.-n Ala Val Gly Met Trp Pro Lys Val Arg 
TCI -^^^ 345 350 

/U Tyr Arg Phe Gly Leu His Gly Glu Asn Ser Tyr Leu Lys Val Pro Thr 
355 360 365 

Lys Gly Gly Val Tyr Gin Gly Tyr His He Val Gly Ala Val Phe Gly 
3^0 375 

75 \il ^^"^ ^""^ ^^"3 Val Asn Val Ser Leu 

39<5 395 400 
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(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

{A> ORGAHISJl: Porphyroinonas gingiva lis 

<ix) FEATURE: 

(A) NAME/KEY: miec_feature 

(B) LOCATIOI^ 1. . . 945 

(xi) SEQUEMCE DESCRIPTION: SEQ ID NO: 368 
Asn Thr He Ser Glu Asn Gly Asn Asp Ser Thr Tyr Phe Ser Phe Gin 



1 5 



10 15 



35 so 



Thr Phe Leu Met Arg Ser He Tyr Gin Leu Leu Leu Ser He Leu Leu 

20 25 30 

Ala Ser Leu Gly Phe Val Gly Leu Glu Ala Gin Gin Ala Gly Val Ala 

35 40 45 

Gly Arg Val Leu Asp Glu Glu Gly Asn Pro list Ho Gin Ala Asn Val 



55 60 



Cln Leu Val Gin Ser Thr Gly Gin Val Ala Val Ala Ala Gly Ala Thr 

70 75 QQ 

Asn Glu Lys Gly Leu Phe Ser Leu Lys Thr Ser Gin Glu Gly Asp Tyr 

85 90 95 

He Leu Arg Val Ser Tyr Val Gly Tyr Thr Thr His Asp Glu Lys He 

100 105 110 

Ser Leu Arg Asn Gly Gin Thr He Thr Leu Lys Asp He Ser Met Asn 

115 120 125 

Glu Asp Ala Arg Leu Leu Gin Ser Val Thr Val Gin Ala Lys Ala Ala 

130 135 j^^Q 

Glu Val Val Val Arq Asn Asp Thx Leu Glu Phe Asn Ala Gly S-r Tvr 
l^"^ 150 155 ' 

Thr Val Ala Gin Gly Ala Ser He Glu Glu Leu He Lys Lys Leu Pro 

cn "^^^ I'^o 170 

Gly Ala Glu He Gly Ser Asp Gly Lys He Thr He Asn Gly Lys Asp 

ISO 185 190 

He Ser Lys He Leu Val Asp Gly Lys Glu Phe Phe Ser Lys Asp Pro 
195 200 205 

^^"^ Asp Met Val Asn Lys Val Gin 

210 215 220 

Val Leu Asn Lys Leu Ser Glu Leu Ser Arg Met Ser Gly Phe Asp Asp 
225 230 235 240 

Gly Glu Glu Glu Thr Val He Asn Leu Thr Val Lys Pro Glu Lys Lys 

245 250 255 

Lys Gly Leu Phe Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg 

2^0 265 27 0 

Tyr Met Ala Gly Gly Asn Val Asn Arg Phe Asp Gly Asn Lys Gin Trp 

27 5 280 295 

Thr Leu He Gly Ser Ala Asn Asn Thr Asn Asn Met Gly Phe Ser Glu 

2S0 295 300 

net Asp Ser Glu Net Gly Ser Met Thr Phe Phe Ser Pro Gin Gly Gly 
310 315 320 

Gly Arg Arg Gly Phe Gly Asn Ser Gly Gly Val Thr Ser Ser Ser Met 
in r ■'^^ 330 335 

/U Leu Gly Gly Asn Phe Ser Val Glu Phe Ser Ser Ala Leu Asn Thr Giv 
. 345 350 

Gly Asp Ala Arg Tyr Gly Tyr Asn Asp Lys Ala He Glu Thr Thr Lys 
355 360 365 

75 ^''^ Xnn ^^"^ '^'^ ^^'^ Thr Tyr Met Asp Glu Asn 

370 375 380 
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Glu Tyr Asp Arg Leu He Pro Lys Glu Gly Met He Lys Glu Asn Ala 

405 410 415 

Leu Lys Leu Thr Phe Gly Leu Thr Phe Asn Glu Ser Trp Phe Lys Lys 
^ 420 425 430 

D Leu Lys Leu Asn 
435 

(2) IMFORI'IATION FOR SEQ ID HO: 368 

^0 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 94 5 amino acids 
(BJ TYPE: amino acid 
<D) TOPOLOGY: linear 
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lie L,u Glu Ar, Se. Ph« Ser Hi, A,n Gly Gin Ala Ar« Phe Arg Her 
Gin Trp Ly« Pro s„ Glu Arg Thr Glu Val v!l Phe Glu Pro Asp Leu 
5 s=r He ser Ly, He Asp Gly Phe Ph. A^p Thr Tyr Glu ^Jr Ly. 

Asp Ala Thr Gly He Ser He Asn ^ys Gly Sor He His Gin Thr Thr 

10 «=P Ser His I.y, 

Leu Asn Asp Glu Gly Arg Thr He Ser Ala s=r Ser Gly Gly Leu 

Thr Asp Glu ASP Gly Asp Gly He Tyr Gin ill val Leu Gin Ser Vaf 

•1C 4o5 490 jor 

15 Glu Thr Asn Gin Lvs Gin Phe Aon jy^r^ <r n ^ 

T.eu Arg Leu Ser Tyr Val Glu Pro Leu Gly Lys Asn Tyr pje Ala Gin 



Ala He Leu Asn Arr, Arg Phe sl^ Arg Arg Asn Ser Asp Arg Glu Val 

^35 5«I0 
Tyr Arg Leu Gly Asp Asp Gly Gin Tyr Ser He Leu Asp Ser Gin Tyr 
r^" SS5 

25 Leu Lys Lys He Al" Lys Thr Trp Asp Thr Val Gly Phe A^n Val 



Gly Leu Ser Tyr Ser Asn Glu Phe Thr Gin Tyr Arg He Gly Leu ^'n 



585 



Asp Pro Asn Arg Thr Val Ser Tyr Arg Ser Val Ala Gly Vai Glu Gin 

600 



Asp Lys Leu Ala Phe Asn Arg Va"! Asn Leu Ser Pro He" Lou Arg He 

olo 620 
As„ Tyr Lys Pro Ser Arg Thr Thr Asn Leu Arg Val Asp Tyr Arg Gly 

Arg Thr Thr Gin Pro Ser He Asn Gin He HI Pro Val Gin Asp He 

35 Thr Asn Pro r.eu Phe Val Thr Glu Gly "n Pro Gly Leu Lys Pro Ser 

Tyr ser Asn Asn Val Met Ala Met Phe Ser Asp Phe Asp ^a Lys Ser 



Gin Arg Ala Phe Asn He Val Pho Phe Gly Asn Tyr Thr Phe Asp Asp 

o95 700 
He val Pro Asn Thr His Tyr Asp Pro Ser Thr Gly Ho Arg Thr Thr 

Arg Tyr Glu Asn Ala Ser Gly Thr Trp Gin III Asn Leu His Gly Thr 

45 Leu ser Leu Pro Leu Lys Asn Arg Ala Phe Ser Phe Arg Met Ser Leu 



Phe Asn Arg Leu Ala Glu Gly Gin Ser Phe He Asn Asp I^p Lys Asn 

'60 Tfi"^ 
Lys Ala Leu ser Ph. Arg Thr Arg Glu Arg Leu Thr Leu Thr Tyr Arg 



750 

Ser Phe lie Aan Asp 

■'SO 

Arg Glu Arg Leu Thr 
Asn Asn Trp He Asp Thr sir He Gly Gly Asn III Gly Phe Tyr Met 
Ala Asn Asn Ser Leu Ser Gly Gin Lys Asp Ser Axg Thr Tyr Asp Phe 
55 Gly Gly Asn Tyr Gin Val Ala Leu Thr Leu Pro Tyr Gly Phe Ar| He 
ASP ser Asp Val Glu Tyr Asn Thr fin Ser Gly Tyr Ser Gly Gly Phe 



60 



ser Leu Asp Glu Trp Leu Trp is; Ala Ser Leu Ser ^yr Ser Phe Leu 



855 



Arg Asp Lys Ala Gly Thr HZ Arg Val Asn Gly ?yr Asp He Leu Gly 



Gin Arg Ser Ser He S.l Arg Ser Ala ser 'la He Asn Ho Glu clu 

ce ao3 BOO A 

65 ser net Ser Asn Thr. XI. Gly Arg Tyr Val Me. Val Asp P.e HI Ty. 



Arg Phe A.n Ala Phe Ser Gly Gly Gly Ser Arg Ser A.p Hll Gin Arg 

70 Tzo ''^^ til <^^y oly Arg Arg Pro 

Scr 
945 

(2) INFORMATION FOR SEQ ID NO: 3 69 



PCT/AU98/01023 



SUBSTITLniE SHEET (Rule 26) (RO/AU) 



wo 99/29870 PCT/AU98/01023 



300/ 490 



ti) SEQUENCE CHARACTERISTICS: 

tA) IXNGTH: 412 amino acids 
(By TYPE: amino acid 
(Oy TOPOLOGY: linear 

(11) MOLECULE TYPE; protein 

tiii) HYPOTHETICAL: YES 

(vl> ORIGIfJAL SOURCE: 

(A) ORGAhdSht: Po rphyromonas gingivalls 

(ixj FEATURE: 

(A) NAftE/KEY: misc feature 

(B) LOCATIW 1 . . .412 



(xi} SEWEMCE DESCRIPTION: SEQ ID HO: 369 



Gly 


Glu 


Tyr 


Pro 


Ala 


Asn 


Ser 


Asn 


Asp 


Lys 


Lyc 


Glu 


Met 


Val 


Met 


Lys 


1 








5 










10 










15 




Leu 


lie 


Lys 


Arg 


Ser 


Leu 


Leu 


Leu 


Leu 


Gly Ala 


Vai 


Leu 


Leu 


He 


Thr 








20 










25 










30 






Leu 


Pro 


Ala 


Tyr 


Ser 


Gin 


Asn 


Asp 


Asp 


He 


Phe 


Glu 


Ayp Asp 


He 


Tyr 






35 










40 










45 








Thr 


Ser 


Arg 


Lys 


Glu 


He 


Arg 


Lys 


Gin 


Asn 


Gin 


Val 


Lys 


Asp 


Trp 


Gin 




50 










S5 










60 










Asn 


Gin 


Glu 


Asp 


Gly 


Tyr Gly 


Asp 


Asp 


Thr 


Glu 


Tyr 


Thr 


Val 


Ala 


Sec 


65 










70 










75 










80 


Asp 


Arg 


Asp 


He 


Asp 


Ala 


Tyr 


Asn 


Arg 


Arg 


Asp 


Gly 


Gin 


Ser 


Tyr 


Asp 










85 










90 










95 




Gly 


Lys 


Lys 


Leu 


Ser 


Lys 


Asp 


Lys 


Lys 


Arg 


Asp 


Ser 


Thr 


Arg 


Ser 


Ser 






100 










105 










110 






Vai 


Pro 


Gly 


Axg 


Tyr 


Ser 


Arg 


Arg 


Leu 


Ala 


Arg 


Phe 


Tyr 


Lys 


Pro 


Asn 






lis 










120 










125 








Thr 


He 


Val 


He 


Ser 


Gly Ala 


Asp 


Asn 


Val 


Tyr 


Val 


Thr 


Asp Asp 


Gly 




130 










135 










140 










Giu 


Tyr 


Phe 


Val 


Tyr 


Gly Asp 


Glu 


Tyr 


Tyr Asp 


Asp 


Ala 


Ser 


Ser 


Val 


145 










ISO 










155 










160 


Asn 


He 


Tyr 


He 


Asn 


Ser 


Pro 


Trp 


Cys 


Asp 


Pro 


Phe 


Pro 


Tyr 


Thr 


Ser 










165 










170 










175 




Trp 


Tyr 


Pro 


Ser 


Phe 


Ser 


Gly 


Trp 


Tyr 


Asn 


Tyr 


Thr 


Trp Asn Tyr 


Pro 








IBO 










185 










190 






Trp 


Phe Tyr 


Tyr 


Gly 


Ser 


His 


He 


Gly 


Trp Gly 


Gly 


Tyr 


Tyr 


Pro 


Gly 






195 










200 










205 








Tyr 


Asn 


Trp 


Tyr 


Trp 


Ser 


Tyc 


Tyr 


Tyr 


Asp 


Pro 


Phe 


Tyr Asn 


Pro 


Tyr 




210 










215 










220 










Gly 


He 


Gly 


Met 


Gly 


Trp 


Gly 


Tyr 


Pro 


Tyr 


Gly 


Trp 


Gly 


Ser 


Tyr 


Tyr 


225 










230 










235 










240 


Gly 


Trp 


Gly 


Gly 


Tyr 


Pro 


Gly 


Val 


He 


His 


His 


Tyr 


His 


His 


Tyr 


Pro 










245 










250 










255 




Lys Lys 


Thr 


Tyr 


Ser 


Asn 


Gly 


Gin 


His 


Ser 


Gly 


Ala 


Tyr 


Tyr 


Ser 


Tyr 








2€0 










265 










270 






Gly Arg 


Pro 


Asn 


Arg 


He 


Lys 


Gly 


Gly 


Thr 


Ser 


Gly 


Ala 


Lys 


Leu Gly 






27 S 










280 










285 








Thr 


Gly 


Arg 


Tyr 


Asp 


Arg 


He 


Gin 


Asn 


Ser 


Ser 


Ser 


Gin 


Lys Asn 


Ly5 




290 










2 95 










300 










Phe 


Gly Leu 


Gin 


Ser 


Asn 


Lys 


Pro 


Asn 


Asn 


Asn 


L<2U 


Gin 


Asn 


Val 


Lys 


305 










310 










315 










320 


Ser 


Gly Arg 


Thr 


Gly 


Arg 


Ala 


Asn 


Arq 


Asp 


Arg 


Asn 


He 


Glu 


Thr 


Val 










325 










330 










335 




Thr 


Pro 


Asn 


Asn 


Gly 


Gin 


Lys 


Gin 


Aen 


Arg 


Pro 


Val 


Phe 


Gin 


Gin 


Asn 








340 










345 










350 






Gin 


Ser 


Gly 


Asn 


Asp 


Arg 


Pro 


Thr 


Gly 


Arg 


Asn 


He 


Arg 


Ser 


Glu 


Arg 






355 










360 










365 








Gin 


Gly Glu 


Asn 


Asn 


Asp Arg 


Thr 


Phe 


Ser 


Thr 


Pro 


Ser Arg 


Ser 


Asn 




3/0 










375 










380 










Ser 


Asn 


Gly 


Gly 


Phe 


Ser 


Thr 


Pro 


Ser 


Arg 


Ser 


Ser 


Ser 


Gly 


Ser 


Met 


365 










390 










395 










400 


Ser 


Gly 


Gly 


Gly 


Gly 


Axg 


Ser 


Gly 


Arg 


Gly Arg 


Asn 


















405 










410 














(2! 


INFORMATION 


FOR 


SEQ ID NO: 370 

















(1» SEC3UENCE CHARACTERISTICS: 

(A) LENGTH: 601 amino acids 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



»r=ii -tr™* ipfii „S -.*i= -«t«ii» tP=5) .tF«i) 

lUi o -iL ^o: ^oi o m u o iir. o lui lui 
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301/490 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

b 

(ill) HYPOTHETICAL; YES 

(vi) ORIGIIJAL SOURCE: 

ORGANISM: Porphyronvonas gingivaiis 
(ix) FEATURE: 

(A) NA/1E/KEY: misc feature 

(B) I,OCATIO{J 1 . . .601 

SBQUEMCE DESCRTPTIOM: SEQ ID IJO:370 
Ser A-.n Scr Ser Ser Hi^ Lys Trp Leu lie Tyr Tyr His He Glu Ly: 



2^ Thr Ly. Ser II. Met He Arg Lys Leu iL Leu Leu Le. Ala ^eu Met 
Ala 

35 4 0 - ;r 



Pro Vol Al. llr V.l Aia PI.e Ala "l Pro Thr A^p S„ Thr Glu Ser 



Ly, A^p Asn Arg He Leu Thr Ser H«t Gin Ser S,r Ser Leu Asn Ary 
Z& A^p Asp Ala Pro A^p Lys Trp Gin Pro Met His Ala Asn Phe Ser Tl^ 
Gin Ser A.p H.t L=,. Leu Ser Thr Al^ Gin Lys Ser Lys A.n Thr ?rp 

30 too III '^'^ '^'^ -rvr "u Glu 

Phe Gly Al| Arg Phe Glu A.p Leu ^yr Ly3 Pro Leu Pro G^y Hi« Glu 

120 



Pro Glu Met Gly Arg Gly Val Hi. Met Tyr Val Lyl Gly Ser Tyr 



105 

Tyr Lys Pro Leu 
His Met Tyr Val 

35 His Al. Glu Leu Thr mH Gly A.p p^,, Tyr Hp Gin Phe Gly Ser 



40 



Gly Met V.X Phe Ar| Thr Tyr Glu Glu Ar, A.' Leu Gly He Asp 

170 nc 
Ala Val Arg Gly Gly Arc, He Val Leu Thr Pro Phe Asp Gly Val Arg 

val ,,y. Cly xie Ala Gly Gin Gin "| A.n Tyr Phe A^p jlrg Thr Gly 

200 20S 
by, val Phe A=n Ser Gly Ar, Gly Tyr Leu Leu Gly ser Asp Leu Glu 
, " Z15 220 

45 Leu Asn val Glu Arg Trp Ser Ser Ala ,let Arg A.p A.n Asp Tyr Hi. 

Leu Ala He Gly Gly Ser Phe Val Ser Ly. Hi' Olu Ala Asp Glu ^p 

He Phe Val Gly Val Gly Glu A«p Arg Ly^ Arg Leu Asn Leu III Leu 

A^n Val Pro He Het Gly Leu Arg Thr A^n Phe Gin Ly, cly Gly Leu 

Ala Leu Tyr Ala Glu Tyr Gly T^r Lys Tyr Asn Asp Pro Ser Ala Asp 

Gly Gin Ala 

315 



55 As„ Asp Tyr He Tyr His Asp Gly Gin Ala Ala Leu Leu Ser Ala Ser 



60 



Tyr ser Lys Lys Gly Met Ser He Leu Leu Gin Ala Lys Arg Cys Glu 
As„ Phe Ala Phe Arg Ser L^. Arg Ser ^a Gin Leu Thr Pro Leu Met 
He Asn Tyr Met Pro Ala Phe Thr Gl'n Ala His Thr Tyr Thr Leu Ala 
Ala lie Tyr Pro Tyr Ala Thr Gi° Pro Gin Gly Glu Ala Phe Gin 
65 Gly Glu Leu Arg Tyr Asn Phe Ala Arg Arg Thr Ma" Leu Gly Gly Arg 



Tyr Gly Thr Gly Leu ll^ He Asn Val Ser fus Val Arg Gly Leu Asp 



410 

70 "-''^ "^^^ =1" 1"" Gil' Thr Glv 

Tyr Thr Va, Ser Phe Phe Gly Met Gl'y Asp Leu Tyr Tyr Ser Asp He 
ASP val Glu He Thr Lys Lys Val Ser Pro Gly Phe til P.e Thr Leu 

75 Thr Tyr Leu Asn Gin He Tyr Asn Asn Lys Val Leu His Gly Ala Ala 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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302/490 



465 








470 










475 








400 


GJy 


Glu Lys 


Pro 


Glu 
465 


Lys 


lie 


Tyr 


Ala 


Asn 
4 90 


He 


Phe 


Val 


Tyr Asp 
4 95 


Gly 


Lys 


Tyr Lys 


Leu 


Ser 


Asn 


Lys 


Val 


Ala 


Leu 


Arg 


Thr 


Glu 


Leu Gin 


Tyr 






500 










505 










51 0 


LfiU 


Ills Thr 


Lys 


Gin 


Asp 


Gin 


Gly 


Asp Trp 


He 


Tyr Gly Met Ala 


Glu 




515 










520 










525 






Lsu 


Ser He 


Leu 


Pro 


Ser 


Leu 


Met 


Lttu 


Ser 


Leu 


S-r 


Glu 


Gin Tyr 


Asn 




530 








535 










540 






11- 


C.lY Glu 


Thr 


Lys 


Lys 


His 


Tyr 


Val 


Wet 


Gly 


S«r 


Val 


Thr Tyr 


Thr 


Si 5 








550 








555 






560 


His 


Gly Ala 


His 


Arg 
565 


Val 


Ala 


Phe 


Ser 


Ala 
570 


Gly 


Lys 


Thr 


Arg Ala 
57 5 


Gly 


Itet 


Asn Cys 


Ser 


Gly Gly 


Val 


Cys 


Arg 


Val 


Val 


Pro 


Glu 


Thr Gin 


Gly 






580 










ses 










590 


Ph- 


Tyr Leu 
595 


Sec 


Tyr 


Ser 


Thr 


Asn 
600 


Leu 















12) INFORMATIOr^f rOR SEQ ID NO; 371 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 252 amino acids 
ID) TYPf: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: prot-ln 

Uil) HYPOTHETICAL: YES 



(vi) ORIGIHAL SOURCE: 

I A) ORGAMISM: Porphyromonas ginglvalis 

tix) FEATURE: 

iA) HAJ4E/KEY: mi£C_f eature 
(B) LOCATION 1. . .252 



(xi) SEQUENCE OESCRIPTIOW : SEQ ID MO: 371 



Arg 


Gly 


Ser 


Sec 


Ser 


Gly 


He 


Ser 


Ala 


Arq 


Gly Arg 


Asp Het 


Arg 


Ser 


1 








5 










10 










15 




Leu 


Phe 


Leu 


Ser 


Ala 


Leu 


Arq 


Ser 


Ser 


Ser 


Leu 


His 


Gly 


Ser 


Glu 


Arg 








20 










25 








30 






Arg 


Ser 


Arg 


He 


Ser 


Ser 


Ser 


Val 


Val 


Met 


Ser 


He 


Arg 


Gin 


Lys 


He 






35 










40 










45 






Arq 


Leu 


Phe 


His 


Leu 


Ser 


Val 


cys 


Ala 


Gin 


Thr 


His 


Aep 


His 


Leu 


He 




50 










55 








€0 








Glu 


He 


His 


Leu 


Val 


cys 


He 


Glu 


PhG 


Gly 


Ala 


He 


Asp 


Thr 


Asp 


Glu 


65 










70 










75 








80 


Phe 


Arg 


Leu 


Ser 


Scr 
85 


His 


Ala 


Tyr 


Thr 


Thr 
90 


Ser 


Pro 


Thr 


His 


Thr 
95 


Gly 


Ala 


He 


His 


His 
100 


Asn 


Cys 


He 


Glu 


Arq 
105 


Ser 


Tyr 


Gly 


Arg 


Tyr 
110 


Leu 


Val 


Thr 


Phe 


Gly 


Gin 


Glu 


Arq 


Asn 


Glu 


Leu 


His 


His 


His 


Ser 


Arq 


Pro 


Asp 






115 










120 










125 






Arg 


A;?n 


Ala 


Glu 


Val 


Tyr 


Arg 


Phe 


Pro 


Phe 


Asp Asn 


Ala 


Phe 


His 


Ser 




130 










135 










140 










Jle 


Arg 


Tyr 


Glu 


Ala 


Phe 


Arg 


Pro 


He 


Arg 


Pro 


He 


Val 


Cys 


His 


Asp 


145 










150 








155 










160 


A^p 


His 


Phe 


He 


Ala 
165 


He 


Gly 


Ser 


His 


Leu 
170 


Phe 


Phe 


Lys 


Asp 


Asn 
175 


Gin 


He 


Phe 


Ser 


Ser 
180 


Gly 


Ser 


Gin 


Tyr 


Asp 
185 


Asn 


Tyr 


Thr 


Val 


Ala 
190 


Cv^ 


Phe 


Val 


Glu 


Ser 
195 


Leu 


His 


Asp 


Arg 


Glu 
200 


Gin 


Arg 


Ser 


His 


Thr 

205 


Hi!? 


Thr 


Ala 


Ser 


Gly 


Thr 


Asn 


His 


Cys 


Ala 


Asp 


Leu 


Phe 


Asp Met 


Arg Thr 


Leu 


Ser 




210 










215 










220 










Gin 


Arg 


Thr 


Tyr 


His 


He 


Arg 


Asp 


He 


Val 


Ala A^p 


Phe 


Glu 


Phe 


Gly 


225 










230 










235 










240 


Gin 


Phe 


Leu 


Gly 


Arg 
245 


Phe 


Ala 


His 


Gly 


Leu 
250 


Asn 


His- 











12) 1NFOR14AT10N FOR SEQ ID HO: 372 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 90 amino acidff 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



E -43 «iL rc:: -oi « ^a^^ o o -iL-i! n J 



10 
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303/490 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

«vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

Ux) FEATURE: 

(A) MAIIE/KEY: mlsc_feature 

(B) LOCATIOH 1, . .790 



60 



15 (xl) SEQUENCE DESCRIPTIOM: SEQ ID MO:372 

Leu Leu Phe Ser Ser Pro Leu Pro Arq Met A^p Arg Pro Lys Pro Ser 

Tyr Tie Val Arg He Ala Ala He Leu Cys Leu The Val Giy Arg Pro 

20 25 30 

Leu Phe Ala Gin Ser Tyr Val A^p Tyr Vai Asp Pro L«u he Gly Thr 

Leu ser Ser Phe Glu Lea Ser Ala Gly Asn Thr Tyr Pro Val He Gly 

55 go 
Leu Pro Trp Gly »et A^n Ser Trp Thr Pro Met Thr Gly VaJ Pro Gly 

Asp Gly Trp Gin Tyr Thr Tyr S=r Ala His Zy.« He Arg Gly Phe Ly.^ 

Gin Thr His Gin Pro Ser Pro Trp He Asn Asp Tyr Gly Gin Ihe Ser 

105 

Leu Leu Pro Leu Thr Ala Pro Gin Lys Pro Ser Ser Asn Asp Ser He 

^■^^ 120 
Ala Leu Thr Lys Tcp Cys Lys Gin Lsu Phe Ser Asp Glu Gin Thr Ser 

Trp Phe ser His Lys Ala Glu Thr Ala Thr Pre Tyr Tyr Tyr Ser Val 
^ 155 1 #;n 

Tyr Leu Ala Asp Tyr A^p Thr Arg Val Glu Met Ala Pro Thr Glu A^q 

170 nc 
Ala A] a He Phe Arg He Arg Tyr S=r Gly Asn Thr Glu Ser Gly Ser 

185 190 
Gly Arg Trp Leu Arg Leu Asp Ala Phe Thr Gly Gly Ser Glu He Ser 

200 20S 
He val Asp Pro His Thr Val Val Gly H= Ser Arg Lys Asn Ser Gly 
AC 220 

45 Gly val Pro Ala Asn Phe Ala Q-s Tyr Phe He Leu Gin Ser Asp Thr 

230 235 o/ift 

Pro >ret. Ala Asp Val Leu Leu Glu Thr Asp Thr Gly Lys Ser Asp lju 

250 255 

50 '^^ ^■''^ ser Gin Glu Val Thr 

. ^o*- ■ 265 77n 

val Arg Val Ala Ser Ser Phe He Ser Val Glu Gin Ala Glu Arg Asn 

280 285 
Leu Ala Glu Val Lys Gly Gin Ser Phe Asp Arg He Arg Leu Ala Gly 
ec 295 300 

03 Arg Glu Ala Trp Asn Lys Val Leu Gly Arg He His Val Glu Gly Gly 
Thr Lys Asp Glu Arg Thr Thr Phe Tyr Ser Wa Leu Tyr Arg Cys Leu 



20 



25 



30 



35 



40 



Leu Phe Pro Arg Phe Tyr Glu Glu ^p Ala Ser Gly Asn Phe Val 

345 350 



His Tyr Ser Tro Tyr Asn Gly Glu Val Leu Pro Gly Tyr l" Tyr Thr 

360 

Asp Thr Gly Phe Trp Asp Thr Phe Arg Ala Leu Phe Pro Leu Leu Asn 

D(- J75 380 

65 Leu Leu Tyr Pro Asp Glu Asn He Lys He Gin Glu Gly Leu Leu Asn 

jyu 39S 



val Tyr Arg Glu Ser Gly Phe Phe Pro Glu Ala Ser Pro Gly ^is 

Arg ASP cys Met He Gly Asn Asn Ser Wa Ser Val Leu Ala Asp Ala 

Tyr Leu Lys Gly Val Arg Val Glu a" Thr Arg Thr Leu Met Asn Gly 

Leu LOU His Ala Thr Lys Ala Val Hi, Pro Lys He Ser Ser Thr Gly 
'7K 460 

75 Arg Lys Gly Trp Glu Trp Ty. Asn Ser Leu Gly Tyr Val Pro Ala Asp 



70 



SUBS i ll U l'H SHEET (Rule 26) (RO/AU) 
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465 470 475 480 

Ala Gly lie Asp Glu Ser Ala Ala Arg Thr Leu Glu Tyr Ala Tyr Asn 

48S 490 -195 

Asp Trp CyB lie Leu Arq Leu Gly Arg Thr Leu Gly Trp A5p Arg Ala 
5 500 505 510 

Ala Leu Asp Thr Leu Ala His Arg Ser Met Asn Tyr Arg His Leu Phe 

SIS 520 S25 

Asp Pro Glu Thr Lys Leu Met. Arg Gly Arg Asn Gin Asp Gly Ser Phe 
530 535 540 

10 Arg Thr Pro Phe Ser Pro Phe Lys Trp Gly Asp Val Phe Thr Glu Gly 
545 550 555 560 

Asn Ala Trp His Tyr Thr Trp Ser Val Phe His A^rp Val Gin Gly Leu 

565 570 575 

Tie Asp Leu Met Gly Gly Asp Arg Pro Phe Val Ser Met Leu Asp Ser 
15 580 585 S90 

Val Phe Acn Thr Pro Pro Met Phe Asp Glu Ser Tyr Tyr Gly Phe Val 

595 600 605 

He His Glu lie Arg Glu Het Gin He Ala Asp Met Gly Asn Tyr Ala 
610 615 620 

20 HiK Gly Asn Gin Pro He Gin His Met He Tyr Leu Tyr Asn His Ala 
625 630 635 64 0 

Gly His Pro Trp Lys Ala Gin Glu Arg Leu Arg Glu VaJ Het Gly Arg 

645 650 655 

Leu Tyr Arg Pro Thr Pro Asp Gly Tyr Cys Gly Asp Glu Asp Asn Gly 
25 660 ■ 665 670 

Gin Thr Set Ala Trp Tyr Val Phe Ser Ala Leu Gly Phe Tyr Pro Val 

675 680 685 

Thr Pro Ala Thr Asp Gin Tyr Val Leu Gly Ser Pro He Phe Ser Lys 
690 695 700 

30 Val He Leu Ser Phe Pro Asp Gly His Lys Thr Val Leu His Ala Pro 
705 710 715 720 

Ala A£n Ser Ala Asp Thr Pro Tyr He Arg Ser He Ser Val Glu Gly 

725 730 735 

Lys Glu Trp Ser Cys Asn Tyr Leu Thr His Glu Gin Leu Arg Ser Ser 
35 740 745 750 

Ala Sor He Gin Trp Met Met Asp Thr Lys Pro Asn Tyr Asn Arg Gly 

755 760 76S 

Met. Lys Glu Ser Asp Arg Pro Tyr Ser Pht* Ser Thr Glu Gin Gin Arg 
770 775 780 

40 Arg Ala Asn His Ser Asn 
785 790 

(?.) INFOWIATION FOR SEQ ID NO: 373 

45 {i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 
{0} TOPOLOGY: linear 

50 (ill MOLECULE TYPE: protein 

(tli) HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 
55 (Al ORGANISM: Porphyromonas glncjivalis 



60 



65 



70 



75 



Ux) FEATURE: 

(A) NAUE/KEY: misc feature 
fB) LOCATIOi; 1 , . .286 

i'Ai) SEQUENCE DESCRIPTION: SEQ ID NO:373 



He 


Cys 


Gly 


Ser 


Lys 


Het 


Asn 


Leu 


Ser 


Gly 


I.C;\I 


Gin 


Ser 


Phe 


Thr 


Met 


1 








5 










10 










15 




Met 


Lys 


Ser 


Itet 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 








20 










25 










30 






He 


Thr 


Ala 


Leu 


Gly 


cys 


Ser 


Asn 


Asn 


Lys 


Alo 


Ala 


Glu 


Ser 


LyjT 


Ser 






35 










40 








45 








Val 


Ser 


Ph^ 


Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


He 


Pro 


Leu 


Arg 


Ala 




50 










55 










60 










Asp He 


Asp 


Thr 


Pro 


Ser 


Leu 


His 


Val 


Met 


He 


Ser 


Tyr 


Val 


Tyr 


Pro 


65 










70 










75 










80 


Ser 


Gly 


Asp 


Asp 


Met 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Phe 


Gly 










85 










90 










95 




Asp 


Ser 


l.eu 


Met 


Asp 


Ser 


Ser 


Sor 


Pro 


Glu 


Asn 


Ala 


Met 


Giu 


Gly Tyr 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



30 



40 



45 



50 



»P^b -tf^ii TP™ ""-"b J»»ti rf'nii ""h h^Ii t»«»i 

!ui ■-s Z3i ii-ii -iL ic; o. io u, o C3i iTj;; o iuj n-ii 
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305/ 490 



100 los 



Ala Gin M« Leu Gly Glu A.p Tyr Arg S« Asn A«n Al. Ala A^n 

g Leu Gin Gly Leu Pro Ser A.p du r.eu A,p Ty. II. Jy^ ^lu 
Asn Thr He Ala Tyr Cy^ A^p Tht Gly Lbu lie Ser Thr Arg He Asn 
T>.r Tyr Thr Tyr Glu Gly Gly Ala Mis Thr Glu Asn Thr Val Arq Phe 
10 Ala Asn He Leu Arg Thr Thr Gly Lys vll Leu Glu Glu Arg ^Ip He 
Phe Ly, lie A,p Tyr Ala Glu Arg Ht Ser Ala Leu He n'e Gly Gin 

200 

15 zlo ■^''^ '^-P Ala He 

Gly Phe Phe Asn Ala Glu Glu llo Gin Pro Asn Gly Asn Phe Met He 
230 235 ojn 

A^p A^p Ly3 Gly Leu Thr Tyr Cys Phe A|n Glu Tyr Gin He Ala 111 

20 Tyr Ala Arg Gly Ala Val Tyr Val Arg "u Gly Tyr Asp Val "u Ala 

265 

Pro I.CU Leu Arq Asp Asp Ser Pro Leu Lys Arg Tyr Leu Pro 
275 280 285 

25 (7.) ItJFORItATIOH FOR SEQ ID NO: 37 4 



fi) SEQL'ENCF: CHARACTERISTICS: 

(A) LENGTH: 37 8 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: YES 

35 

(Vi) ORIGINAL SOURCE: 

(A) ORGAI^ISM: Porphyromonaa glnqivalis 
lix) FEATURE: 



(A) NAME/KEY: misc feature 

(B) LOCATIOH 1, . .37 8 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:374 

Gly lie He Ser Val Leu Ser His Val Val Asp Arg Pro Gin Trp Gly 
^ S 10 ^ 



Ala ser Pro Glu Ala Ala Gly Thr His Ser Val Tyr Ser He ^eu His 



Pro ser Ala Gly He He Arg He Arg Ser Het Gly He He Ser Ala 



40 



Cys Arg Ho Ala He Leu Ala aly Lys Hi. Pro Arg Ser Gly Leu Ser 

» « , __ 60 



55 Lys Lys Arg Lys Leu Gin Glu Lys Asn Val Phe Leu Gin He Arg ^u 



Arg Gin Ser Phe Asn Asn Leu He Pro l« Leu Pro Phe Arg He Asp 
Asn Thr Lys Lys He Thr Glu Met iys Lys Thr Thr Leu TUc Gly Ser 



Arg Ala Asn Val Gly He Leu Ser Tyr Asn Pro Glu Asn Thr Pro Glu 
Lys 
Phs 

"0 -2 

He cys Ala Leu Leu Lou Phe 'du Gly Leu Ser Ala ^^n Ala Gin Ser 
Lys LOU r.y.. He Lys Ser He Glu Ala Ala Thr Thr Phe Ser Ser Ala 
65 Thr Ala Gly Asn Gly Phe Gly Gly Asn He Phe Gly Met Asp Met Ser 
He Arg Het Arg Val His His Ser He f^u Pro Glu Gly Leu ^Ip Phe 
70 "'^ ^''^ Hi-- Trp Gl„ Glu il'a" Gly Ser 

Pro Lys Leu Met Tyr Thr Asn Val Pro Ser He He "y He Val G.u 
Lys Val He Val Phe Glu Asp Ala Glu Asp Phe III Asp Lys Lys Ala 
75 Leu Gly Arg Phe Leu He Ser Leu Gly He Ser Tyr Thr I.ys „ls L^u 
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WO 99/29870 



PCT/AU98/01023 



306/ 490 



245 



250 



255 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Gly 


Ma 


Tyr 


Trp 
260 


Gly 


Trp 


Thr 


Asn 


Asp 
265 


Ala 


His 


He 


Leu Phe Ser 
270 


Pro 


lie 


Pro 


Lys 
275 


Ser 


Lys 


Val 


His 


Tyr 
280 


Asp 


Thr 


Tyr 


Thr 


Arg Ala Gly 
295 


Ser 


Asp 


L«>u 


Val 


Leu 


Gin 


Ser 


Glu 


Asp 


Val 


Ma 


Thr 


Val 


Ser Asn Gly Phe 




290 










2SS 








300 






Ser 


Pro 


Cily 


lie 


Gly 


Leu 


Lys 


Ser 


Ser 


lie 


Trp 


Trp 


Lys Met Pro 


lie 


305 










310 










315 






320 


Lys 


Ser 


Lys 


Tyr 


Asp 
325 


Phe 


Arg 


Leu 


Gly 


Phfi 
330 


Ser 


Leu 


Gly Tyr Glu 
335 


Tyr 


Leu 


Asn 


Leu 


Leu 
340 


Tyr 


Pro 


Tyr 


Arg 


Asn 
345 


Phtt 


Lys 


Leu 


Asp Gly Asn 
350 


Lys 


Pro 


Leu 


Ser 
355 


Ala 


Leu 


Ser 


Pro 


Arg 
360 


Met 


Asn 


His 


lie 


Gly His Val 
365 


Gly 


Phe 


Asn 
37 0 


Phe 


Thr 


Val 


Gly 


Leu 
37 5 


Trp 


Thr 


Asn 











(2) INFOPJIATICX^ TOR StIQ ID MO: 37 5 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEIJGTH: 1269 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE; protein 

fill) HYPOTHETICAL: YES 

(vU ORIGIHAL SOURCE: 

(A) ORGAHISM; Porphyromonas gingivalis 

( ix ] FEATURE : 

(A) NAHE/KEY: misc feature 

(B) LOCATION 1. . .1269 



(xi) SEQUENCE DESCRIPTIOIJ : SEQ ID NO:375 



Val 


Lys 


Cys 


Net 


Gly 


Lys 


Tyr 


Lys 


Arg 


Ma 


Lys 


Tyr 


Arg 


Tyr 


Trp Leu 


1 








5 










10 










15 


Phe 


Pro 


Phe 


Cys 


Ser 


Asp 


Tyr 


Tyr 


Thr 


Phe 


Glu 


Gly 


Val 


Thr 


Phe Leu 








20 










25 










30 




Cys 


Ala 


Scr 


Asp 


Asp 


Met 


Thr 


Thr 


L^'s 


Lys 


Pro 


Gin 


Ala 


He 


Leu Asp 






35 










40 










45 






Leu 


Glu 


Lys 


Ala 


Tyr 


Asn 


He 


Glu 


He 


Pro 


Asp 


Leu 


Ser 


Ser 


Gin Glu 




50 










55 








60 








Gly 


lie 


Ser 


Trp 


Ser 


Val 


Asn 


Arg 


Tyr 


Phe 


Lys 


Gin 


Asp 


Ser 


Ser Gly 


65 










70 










75 








80 


Ala 


Val 


Val 


Glu 


Leu 


Cys 


Leu 


Arg 


Glu 


Cys 


Gin 


He 


Glu 


Ser 


Met Thr 










B5 










90 










95 


Trp 


Leu 


He 


Asp 


Phe 


Pro 


Ala 


Leu 


Lys 


Lys 


Leu 


Asp 


Leu 


Ser 


Tyr Asn 








100 










105 










110 




Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser Leu Thr Lys 






lis 










120 










125 






Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly Leu Asp Ser 




130 










135 










140 








Leu 


The 


Ser 


Leu 


Thr 


Lys 


Leu 


Ser 


Leu 


Ser 


Asp 


Asn 


Gin 


He 


Ser Lys 


145 










150 










155 








160 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


Leu Leu 










165 










170 










175 


Asp 


Ash 


Gin 


He 


Scr 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser Leu 








180 










185 










190 




Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 


G] y Leu 






195 










200 










205 






Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly Asn 


Gin He 




210 










215 










220 








Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu Arg 


225 










230 










235 








240 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu Arg 


Leu Thr 










245 










250 










255 


Ssr 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Sec 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu Glu 








260 










265 










270 




Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly Asn 






275 










290 










285 




Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Scr 


Ser 


Leu 


Thr Lye 
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WO 99/29870 PCT/AU98/01023 

307/490 

290 295 3GO 

Leu Arg Leu Arg Ser Asn Gin He Ser Lya Leu Glu Gly Leu Glu Arg 
305 310 315 320 

L&u Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys 
5 325 330 335 

Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu 

340 3<5 350 

Asp Asn Gin He Arg Lys Leu Glu Giy Leu Glu Arg Leu Thr Ser Leu 
355 360 365 

10 Thr Lys Leu Axg Leu Arg Ser Asn Gin He Ser Lys Leu Glu Gly Leu 
370 375 360 

Asp Ser Leu Thr Ser Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin He 
385 390 395 400 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr 
15 405 410 415 

Leu Leu A^rp Asn Gin He Arg Lys Leu Glu Gly Leu Asp Gly Leu Ala 

420 425 430 

Ser Leu Thr Arg Leu Ser Leu Arg Arg Asn Gin He Ser Lys Leu Glu 
435 440 445 

20 Giy Leu Asp Arg Leu Lys Val Leu Arg Lys Leu Asp Val Ser Gly Asn 
450 455 460 

Asp He Gin Ser He Asp Asp He Lys Leu Leu Ala Pro He Leu Glu 
465 470 475 480 

Glri Thr Leu Glu Lys Leu Arg He His Asp Asn Pro Phe Val Ala Ser 
25 4 85 4 90 495 

Ser Giy Leu He Leu Ser Pro Vyr Asp Asn His Leu Pro Glu He Lys 

500 505 510 

Ala Leu Leu Glu Lys Glu Lys Glu Lys Gin Lys Lys Thr Ser Val Glu 
515 520 525 

OU Tyr His Pro Phe Cys Lye Val Met Leu Leu Gly Asn His Ser Ser Gly 
530 535 S40 

Lye Thr Thr Phe Leu Ser Gin Tyr Asp Thr Asn Tyr Thr Tyr Gin Lys 
545 550 555 

Asn Thr His Val Leu Ser He His Arq Ser Asn Asn Pro Asn Ala He 

565 570 575 

Phe Tyr Asp Phe Gly Gly Gin Asp Tyr Tyr His Gly He Tyr Gin Ala 

580 585 590 

Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala Lys Lys 

S95 600 605 

Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr Leu Asn 

610 615 620 

Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys Asn Arg 
625 630 635 640 

Cys Met Sec Val Gly Gly Asn Pro Asp Gly Lys Asp Thr Pro Gin Thr 

645 650 655 

Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr Gly Ala 

660 665 670 

Lys Gin Gin Thr Leu Gly Cys Ala Ala Glu Asn Gly Val Leu Glu Glu 

675 680 685 

He Tyr Val Ser Leu Glu Pco Lys Ala Asn Ser Ali Val Hie Ala Leu 

690 695 700 

Asn Tyr Leu Asn Glu Arg Val Arg Glu Val Val Ala Ser Arg Ser Lys 
705 710 715 720 

Ser He Gin He Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro 

"725 730 735 

Thr He Ala Gly Asp Asn Lys His He Pro He Ser Leu Glu Ala Leu 

740 745 750 

Ala Ala Glu Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr H» 

''SS 760 765 

Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser Leu Arg Gly Glu Val 

770 775 780 

Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp Leu Asp 
''SS 790 795 800 

Pro Ala Ala Phe Val Gin Met He His Gly Glu He Leu Gin Lys Asp 

805 810 815 

Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu 

320 825 830 

His Asn Leu Ser Ser Gly Ser He Phe Glu Glu Asp Gly Gin Asn Gly 

835 840 845 

Asn Met He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp 

850 855 860 

Lys Asp Cys Tyr Val He Pro Gly Tyr Leu Pro Leu His Ser Asp Asp 
865 870 875 880 

Glu Ala Tyr Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asm Phe Val 
885 890 895 



SXJBSnrUTE sheet (RuIc 26) (RO/AXJ) 



10 



20 



30 



50 



55 



60 



65 



WO 99/29870 PCT/AU98A)1023 

308/490 

Leu Lys Phe Glu Arg Phe tie Pro Phe Gly Leu lie Asn Gin lie He 

90S Qi o 

Ala Tvr Tyr Gly Arg Glu Glu Gly Ala Leu Lys Axg Tyr Trp Arg A«p 

^■"■^ 920 
Gin Val He Phe Thr Ala Gly Arg Glu Met Asp Arg Gin Thr L=u Glu 
935 

Gin Glu Glu Glu Lys Glu Gly Leu Pro Lys Thr Asn Ala Glu Asp Tyr 

Gin He Trp He Lys Leu Asp Phc Thr A^p Leu Ala He Ser Val fhe 
965 



70 



955 9€0 
Val 

He Lys Glu Gin Arg Lys Thr Ser Ala Lys Asp MeU Gin Arg Lyf Glu 



?80 ^ 985 990 

lu Asp Met Tyr Trp Asn 
)00 101 

--^ -ys Asp Thr Glu Gin Thr 
1010 _ 1015 1020 



sa^ «90 

Ala Thr He Leu Ser Asp Met Lcu^Asp Met Tyr Trp Asn Asn He Pro 

Glu 

He^Arg Glu Thr Asn Arg Lyl^iiys Arg Pro He Gln"^p Leu Tyr Leu 



It: 1000 1005 

15 Pro Arg^Glu Gin He Gly Asp Lys Asp Thr Glu Gin Thr Arg Ser Thr 



1030 1035 in^n 

ser Cys Ala Gin Ala Asp Lys Asp Leu Thr Glu Ser His Tyr He Mis 



10^5 1050 " 1055 

i'^L^^ ^""^ '^^'^ "^^^ Ala Arg He Ala Ala 

l^^O 1065 1070 



Tyr Pro Lys A^n Gly Val He A^p Lys Glu Arg Val Arg Glu Val 

1075 1080 1085 

Inlo"""^ ^^'^ ^-1 A.n Leu Ala Thr 

1095 ^ J 00 

Ala__Ly-s Gin lie Phe lle^Ser Tyr Ser Ly. Glu Ajp Gin Thr Glu L,u 

In lis 
1135 



Ills 1120 

Glu lie Arg iys-Arg He Val Glu A^a'Lp Cys He He Ala^Leu He 
xi»=> IISO 



Glu Thr cys Leu Gln^Phe Pho Lys Pro Leu Glu Lys Asn Gly Gin He 

Glu He Tyr Tyr Asp Lys Leu Thr Lys Sh"llu Thr Pro He Hl"pro 

114S iiso 



oc iiso lies 

ser Gin Arg Tyr Leu Ala Thr Asp Tyr He Leu Aap His Glu Leu Pro 

1175 HBO 
Val^Phe Arg Glu Tyr Asn^Ly, Thr He Val Pro He Leu He Lys Pro 

cys Thr Phe Glu Asp Asp Glu Phe Leu Arg Gli"ys Tyr Phe Ala Gln"°° 

1205 1210 15161 

Lys Ala Gin lie He Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala 

122S 123b 
Tyr A^p Ser He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala 
AC 1Z35 1240 1245 

""^^ 12SD^^ ^^'^ ""^^ 1'^" Arg He Thr Lys Gin Glu 

1255 1260 

Val Asn Thr Asp Glu 
1265 



(2) INFOfU-tATION FOR SEQ ID HO: 37 6 

(1) SEQUEWCE CHARACTERISTICS: 

(A) LEl^GTH: 231 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL; YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingivalis 
tlx) FEATURE: 

(A) NAIIE/KEY: mlsc feature 

( B ) LOCATIOJJ 1 . . . 231 

fxl) SEQUEi;CE DESCRIPTION: SEQ ID NO: 37 6 



Lys Phe Met Met Lys Lys Ala Phe Val Phe Val Leu Leu Val Cys Leu 

Phe ser Ser Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg 

75 ''''^ "'^'^ ""^^ •^^^ V^l Ser Leu Asn Leu Gly Val 

40 45 
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309/490 



15 



25 



30 



60 



65 



70 



Plo Pro Val Ser Thr Gl 



" Val Trp Gly Met Thr His Asp Ala Asn Gly 
SO 5S 60 

Leu Pro Phe Glu He Pro He Ser Phe Ser Arg Phe Asn Ser Gin Glv 
c 75 80 

b Asp ile Ala Thr Thr Tyr Tyr He Ala Asn Ser Glu Ala Thr Leu Asn 
85 90 95 

Glu Trp Cys Asp Tyr Ala His Pro Gly Gly He Val Arg Val Glu Gly 

105 110 
Arg Phe Trp Lys Met Thr Tyr Asn lie Pro Thr Tyr Asn Ala Val Cys 
lU 115 120 125 

Thr Arg lie Thr Phe Glu Asn Gin Glu lie Glu Gly Thr lie Val Leu 

130 135 140 

lie Pro Lys Pro Lys Val Ser Leu Pro His Val Ser Glu Ser Val Pro 
l''^ ISO 155 160 

Cys lie Arg Thr Glu Ala Gly Arg Glu Phe He Leu Cys Glu Glu Asp 

15S 170 175 

Asp Thr Phe Val Ser His Asp Gly Asn Glu Val Thr Ile Gly Gly Lvs 

IQO 185 190 

Pro Phe Leu Leu Asn Thr Asn Val Lys He Val Gly Asp Val Ser Gin 

195 200 205 

Lys Tyr Ala Vai Gly Val Gly Glu He Arg Phe Leu Gin He Cys Ala 

210 215 220 

Gin Thr Val Ser Gin Gin Lys 
225 • 230 



(2) IHFORllATION FOR SEQ ID NO: 37 7 

(1) SEQUE^^CF: characteristics: 

(A) LEHGTH: 563 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ti) MOLECULE TYPE: protein 

35 (111) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: PorphyroRionas gingivalls 

40 (ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION I... 563 

Ixl) SEQUENCE DESCRIPTION: SEQ ID NO: 377 

Ser Leu He Asn Asn Ser Arg Asp Trp Arg Ala Pro Val Arg Leu Ser 
^ S 10 15 

Leu Lys l.ys Lys Thr Lys Thr Met Lys Thr Lys Val Leu Arg Lys Phe 
cn ^0 25 30 

50 Val Val Ala Ala Phe Ala Val Ala Thr Leu Cys Pro Leu Ala Gin Ala 
3S 40 45 

Gin Thr Hct Gly Gly Asp Asp Val Lys Val Val Gin Tyr Mn Gin Glu 
SO 55 gQ 

'^^'^ "-^^ V«l Asp Asn Gly Trp Ile Tyr 

65 70 75 

Val Met Thr His Ser Gly Tyr Asp Thr Gly Asn Ser Asn Val Lys He 

85 90 95 

Phe Arg Ser Lys Asp Gin (Sly Ala Thr Tyr Gin Lys Leu Arg Asp Trp 
. » i05 110 

Asp Pro Ser Asp Asp Tyr Gin Phe Gin Asp Phe Asp lie Val Val Thr 

lis 120 125 

Gly Lys Asn Glu Ser Asp He Lys lie Trp Ser Val Glu Leu H-t Asn 

135 140 
Lys Pro Gly Gly Tyr Lys Ser Arg Val Ala Val Phe Ser Arg Asp Ala 
150 155 j^gQ 

Asn Ala Gin Asn Ala Lys Leu Val Tyr Lys Glu Asp Phe Ser Asn Val 

1€5 170 
Gin Leu Tyr Asp Val Asp Ile Ala Ser Asn Tyr Arg Ser Pro Ser Ser 

190 185 190 

Leu Asn Asn Gly Gly Asn Pro Phe Ala Leu Ala Phe Ala Tyr Thr Gly 

19S 200 205 

Phe Asn Asn Thr His Lys Ile Ser Phe Val Asp Tyr Val Phe Ser Leu 

210 215 220 

ooc Le" 1-*" Phe Ser Gin Asp Glv 

^25 230 235 2-10 
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10 



15 



20 



25 



Tyr Thr Ser Ser Ala Asn Ala Leu Tyr Thr Pro Gin Val Asp lie Asn 

440 445 

TSo ^^'^ ^^"^ ^-"^ "^^P Glu Tyr Leu Pro Glv 

4S5 460 

30 465 ^^^^ Inl '^''P '^''^ '^^^ ^^i' 

Glu Asp He Val Met Gin Glu Gly Ser Met iy^ Leu Tyr Pro Asn Pro 

Ala Gin Glu Tyr Ala Val He Ser Leu Pro Thr Ala Ala A^n Cys Lys 
or 505 siO 

d5 Ala Val Val Tyr Asp Met Gin Gly Arg Val Val Ala Glu Ala Ser Phe 
515 520 

^^"^ 530 ^^"^ '^^'^ '^^^ "^^ "T^^ 

40 ISf^ ^^"^ '^^g Val Glu Lys Leu 

550 555 cf-n 

lie Val Glu 

(2) INFORJ-IATIOrJ FOR SEQ ID MO: 378 



50 



60 



(i) SEQUENCE CHAKACTERISTI CS: 

(A) LENGTH: 786 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 



(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

55 (vi) ORIGIHAL SOURCE: 

(A> ORGANISM; Porphyrotnonas gingivalis 

(ix) FEATURE: 

(A) NAHE/KEY: misc feature 

(B) LOCATION 1. . .786 



(xi) SEQUENCE DESCRIPTia-f: SEQ ID NO:37B 

lie Val Tyr Leu Cye His Cys Met Asn His Arg Arg Ser Lys Thr Met 

^0 15 

Leu Thr II- Arg Asn Phe L*;u Leu Phe Cys Cys Leu Ser Leu lie Ala 

25 30 

Phe Ala Ala Asp Ala Gin Ser Ser Val Ser Ser Gly Arg Arg Leu Thr 

70 Glu Tyr Val Asn Pro Phe He G^y Thr Ala Asn Tyr G^y Thr Thr Asn 

55 60 



Pro Gly Ala Val Leu Pro Asn Gly Leu Met Ser Val Thr Pro Phe Asn 



75 



65 70 
Th 

85 90 " ' 95 



Val Ser Gly Ser Thr Glu Asn Arg Phe fKsp Lys A.-p Ser Arg Trp Trp 



WO 99/29870 PCT/AU98/01023 

310/490 

Glu Lys Lys lie Asp Lys Val Asp Leu Ser Leu Gly Ser Thr Ser Glu 

245 250 255 

Ser Met Gly His Asn Ala Trp Pro Leu Met Gly Val Val Phe Glu Met 

265 270 
Asn Lys Gin Gly Gly Lys Ser Asp He Gly Phe Leu Ser Asn Phe Val 
2^5 230 285 

^""^ 295 

Ser Asp Met Sor Phe Ser Pro Lys He Gin Met Leu Leu Asp Glu Asp 
^^-^ 310 315 

Asn Asn Thr He Asn Gly Glu Ser Cys His Asn Phe Met He Thr Tyr 

325 330 335 

ser Asp Tyr Asp Ser Glu Tyr Ser Asp Trp Asp He Arg Tyr Val Tyr 

345 

Pro Lys Lys Ser Phe Lys Tyr Glu Lys Gly Lys Thr Pro Thr Met A.-p 

360 365 
Asp Leu Val Glu Ala Phe Leu Thr Ala Ser Tyr Gin Ser Glu Thr Asn 

375 380 
Ser Gly Leu Gly Tyr Asp Lys Asn Ala Asn His Tyr Leu He Thr T-r 

390 305 
Ala Lys Lys Glu Glu Asn Gly Thr Asn Thr Leu Lys Tyr Arg Trp Ala 

405 410 
Asn Tyr Asp Lys He Hie Asn Lys Asp Leu Trp Ser Asp Thr Ph- Thr 
420 425 430 



SUBSTmrUE sheet (Rule 26) (RO/AU) 



ti^'ii ti"* IP™' «?i "■'=^1 ^I=rn .rtf iT=^i -r)^ iP^ii -n^-i, jji^ii 

■iLJi "Hji Oi ii::: €^ Oi m u cai ^ cji jlji 



WO 99/29870 PCT/AU98/01023 

311/490 

Ser Ala Pro Tyr Ser Ala A^p Asn Ser Tyr uys lie Gly Phe Ser His 

100 105 110 

Val Asn Leu Ser Gly Val Gly Cys Pro Glu Leu Ser Gly He Leu L-u 
115 120 125 

5 Met Ala Thr Ser Gly Thr Phe Asp Pro Asp Tyr Cys Cys Tyr Gly Ser 
130 135 140 

Ser Leu Ser Arg Glu Tyr Ala Arg Pro Gly Glu Tyr Lys Ala Val Leu 
14S 150 155 1 60 

Asp Lys Tyr Gly He Asp Ala Ala Val Thr Val Thr Glu Arg Thr Ala 
10 165 170 175 

Leu Thr Glu Phe Ala Phe Pro Glu Gly Glu Gly His He Leu Leu Asn 

180 185 190 

Leu Gly Gin Ala Leu Ser Asn Glu Ser Gly Ala Ser Val Arg Phe Leu 

200 205 
15 Asn Asp Ser Thr Val Val Gly Ser Arg Leu Met Gly Thr Phe Cys Tyr 
210 215 220 

Asn Pro Gin Ala Val Phe Arg Gin Tyr Phe Val Leu Gin Val Ser Arg 
225 230 235 240 

Arg Pro He Ser Ala Gly Tyr Trp Lys Lys Gin Pro Pro Met Thr Val 

245 250 255 

Glu Ala Gin Trp Asp Ser Thr Ala Gly Lys Tyr Lys Gin Tyr Asp Gly 

260 265 270 

Tyr Lys Arg Glu Met Ser Gly Asp Asp He Gly Val Arg Phe Ser Phe 
275 280 285 

ZD Asn Cys Asp Gin Gly Glu Lys He Tyr Val Arg Ser Ala Val Ser Ph«. 
290 295 300 

Val Ser Glu Ala Asn Ala Leu Tyr Asn Leu Glu Ala Glu Gin Glu Glu 
305 310 315 320 

Val Phe Lys Ser Val Gly Gly Asn Pro Ala Lys Ala Phe Ser Ala He 
30 325 330 335 

Arg Ser Arg Ala He Glu Arg Trp Glu Glu Ala Lou Gly Thr Val Glu 

340 345 350 

Val Glu Gly Gly Thr Pro Asp Glu Lys Thr He Phe Tyr Thr Ala L'^u 
__ 3SS 360 365 

Jo Tyr His Leu Leu He His Pro Asn He Leu Gin Asp Ala Asn Gly Glu 
370 375 380 

Tyr Pro Met Met Gly Ser Gly Lys Thr Gly Aen Thr Ala His Asp Arg 
305 390 395 4OO 

Tyr Thr Val Phe Ser Leu Trp Asp Thr Tyr Arg Asn Val His Pro Leu 
4U 405 410 415 

Leu Cys Leu Leu Tyr Pro Glu Lys Gin Leu Asp Met Val Arg Thr Leu 

420 425 430 

He Asp Met Tyr Arg Glu Ser Gly Trp Leu Pro Arg Trp Glu Leu Tyr 
4 35 440 445 

45 Gly Gin Glu Thr Leu Thr Met Glu Gly Asp Pro Ser Leu He Val He 
450 455 460 

Asn Asp Thr Trp Gin Arg Gly Leu Arg Ala Phe Asp Thr Ala Thr Ala 
465 470 475 480 

Tyr Glu Ala Net Lys Lys Asn Ala Ser Ser Ala Gly Ala Thr His Pro 

485 490 495 

He Arg Pro Asp Asn Asp Asp Tyr Leu Thr Leu Gly Phe Val Pro Leu 

500 505 510 

Arg Glu Gin Tyr Asp Asn Ser Val Ser His Ala Leu Glu Tyr Tyr Leu 
S15 520 525 

OD Ala Asp Trp Asn Leu Ser Arg Phe Ala His Ala Leu Gly His Lys Glu 
530 535 540 

Asp Ala Ala Leu Phe Gly Lys Arg Ser Leu Gly Tyr Arg Hlc Tyr T-yr 
545 550 555 5^0 

Asn Lys Glu Tyr Gly Met Leu Cys Pro Leu Leu Pro Asp Gly Ser Phe 
60 565 570 575 

Leu Thr Pro Phe Asp Pro Lys Gin Gly Glu Asn Phe Glu Pro Asn Pro 

580 585 590 

Gly Phe His Glu Gly Ser Ala Tyr Asn Tyr Ala Phe Phe Val Pro His 
S95 600 60S 

DD Asp He Gin Gly Leu Ala Arg Leu Het Gly Gly Ala Lys Val Ph- Ser 
610 615 620 

Glu Arg Leu Gin Lys Val Phe Asp Glu Gly Tyr Tyr Asp Pro Thr Asn 
S25 630 635 640 

Glu Pro Asp He Ala Tyr Pro Tyr Leu Phe Ser Tyr Phe Pro Lys Glu 
70 645 650 ^ 655 

Ala Trp Arg Thr Gin Lys Leu Thr Arg Glu Leu He Asp Lys His Ph- 

660 665 670 

Cys Asn Ala Pro Asn Gly Leu Pro Gly Asn Asp Asp Ala Gly Thr Met 

675 680 685 

Ser Ala Trp Leu Val Tyr Ser Het Leu Gly Phe Tyr Pro Asp Cys Pro 



75 
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690 



€95 



7 00 



Gly Ser 


Pro 


Thr Tyr 


Thr 


Leu 


Thr 


Ssr 


Pro 


Val 


Phe 


Pro 


Arg 


Val 


Arg 


705 










TiO 










715 










720 


He 


Arg 


Leu 


Asn 


Pro 


Gin 


Tyr 


lyr 


Pro 


Gin 


Gly 


Glu 


Leu 


He 


He 


Thr 










725 










7 30 










735 




Thr 


Asn 


Thr 


Glu 


Asn 


Gin 


Pro 


Thr 


Asp 


Sec 


He 


Tyr 


lie 


His 


Thr 


Val 








740 










745 










750 






Ser 


Leu 


Gly Aen 


Lys 


Thr 


Leu 


Pro 


His 


Gly Thr Arg 


His 


He 


Sec 


His 






755 










760 










765 








Ala 


A5p 


Leu 


Val 


Arg 


Cys 


Gly His 


Leu 


Arg 


Tyr 


Glu 


Leu 


Ser 


Asn 


Arg 




770 










775 










780 










Pro 

































(2) INFORHATION FOR SZQ ID NO: 37 9 

U) SEQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 814 amino acids 
{BV TYPE: amino acid 
(D) TOPOLOGY: linear 

(iil HOLECt/LE TYPE: protein 

(Hi) HYPOTHETICAL; YES 

( vi ) ORIGINAL SOURCE ; 

(A} ORGANISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) HWIE/KEY: Jnisc feature 

(B) LOCATIon 1. . .814 



(Xi> SEQUENCE DESCRIPTIOM: SEQ ID HO: 379 



Pro 


Arg 


Phe 


His 


Pro 


Gly 


Gly 


Thr 


His 


Ala 


Gin 


Leu 


Cys 


Arg 


Aen 


Gly 


1 








5 










10 










IS 




Arg 


Arg 


Asn 


Gin 


Lys 


Ser 


Asn 


Gin 


Ser 


Ser 


Cys 


Phe 


Gly 


Gly Gly 


Thr 








20 










25 










30 






Ala 


Pro 


Arg 


Phe 


Phe 


Ha 


Met 


Cys 


Lys 


He 


Arg 


Phe 


Ser 


Leu 


Leu 


Gin 






35 










40 










45 








Ala 


Leu 


Val 


Val 


Cys 


Leu 


Leu 


Phe 


Thr 


Ser 


Phe 


Ser 


Leu 


Gin 


Ala 


Gin 




50 










55 










60 










Glu 


Glu 


Gly 


He 


Trp 


Asn 


Thr 


Leu 


Leu 


Ala 


He 


His 


Lys 


Thr 


Glu 


Lys 


65 










70 










75 










80 


Ala 


Val 


Glu 


Thr 


Pro 


Lys 


Lys 


Val 


Phe 


Ala 


Val 


Ala 


Aen 


Gly Val 


Leu 










85 










90 










95 




Tyr 


Ser 


Val 


Gly 


Lys 


Glu 


Ala 


Pro 


His 


Glu 


Ala 


Lys 


He 


Phe 


Asp 


Arg 








100 










105 










110 






He 


Ser 


Gly 


Leu 


Ser 


Asp 


Thr 


Ser 


Val 


Ser 


Ser 


He 


Ala 


Tyr 


Ser 


Glu 






115 










120 










125 








Gin 


Leu 


Lys 


Ser 


Leu 


Val 


He 


Tyr 


Tyr 


Ala 


Ser 


Gly 


Asn 


He Asp 


lis 




130 










135 










140 










Leu 


Asp 


Glu 


Ala 


Gly 


Arg 


Val 


Thr 


Asn 


Val 


Pro 


Ala 


Leu 


Lys 


Asp 


Asn 


145 










ISO 










155 










160 


He 


Asp 


Leu 


He 


Asp 


Lys 


Thr 


Leu 


Asn 


Arg 


Leu 


Leu 


He 


Val 


Gly 


Arn 










165 










170 










175 




Arg 


Ala 


Tyr 


Leu 


Ala 


Gly 


Gly 


Phe 


Gly 


Leu 


Ser 


Val 


Leu 


Asp 


Val 


Ala 








180 










IQS 










190 






Glu 


Ala 


Arg 


He 


Pro 


Ala 


Thr 


Tyr 


Ala 


Lys 


Gly 


Thr 


Lys 


Val 


The 


Asp 






195 










200 










205 








Val 


Ala 


hys 


Leu 


A£p 


7\sn 


Asp 


Arg 


Leu 


Leu 


Met 


Leu 


Lys 


Glu 


Gly 


Gin 




210 










215 










220 










Leu 


Phe 


He 


Gly 


Lys 


Glu 


Thr 


Asp 


Asn 


Leu 


Gin 


Asp 


Pro 


Ala 


Ala 


Trp 


225 










230 










235 










240 


Thr 


Ala 


Lsu 


Ser 


Leu 


Asn 


Leu 


Pro 


Met 


Gly 


Ser 


Val 


Thr 


Gly Leu 


Giy 










245 










250 










255 




He 


Val 


Gly 


Glu 


Asp 


He 


Cys 


Phe 


Lsu 


Leu 


Ala 


Asp 


Gly 


Arg 


Val 


Tyr 








250 










265 










270 






Val 


Ala 


Ala 


Asn 


Gin 


Ser 


Phe 


Glu 


Pro 


Glu 


Leu 


Leu. 


. Leu 


Ser 


Ser 


Ser 






275 










280 










285 








Ala 


Asp 


Ser 


Arg 


Leu 


Tyr 


Val 


Thr 


Asp 


Arg 


Gly 


Leu 


Phe 


He 


cys 


Ala 




290 










295 










300 










Glu 


Asn 


Arg 


He 


Tyr 


Phe 


He 


Glu 


Lys 


Gly 


Arg 


Lys 


Thr 


Thr 


Gin 


Phe 


305 










310 










315 










320 


Pro 


He 


Ala 


Asp 


Val 


Leu 


Gly 


Val 


Gly 


Ala 


Met 


Asn 


Glu 


Ser 


Asn 


Thr 



785 
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325 






330 










335 






Ala Tyx 




Ala 


Leu 


Gly 


Glu Glu Gly 


Leu 


Ala 


Ser 


Leu 


Leu 


Leu 


Ala 










340 






345 










350 








Glu Glv 


Ser 


Thr 


Ala 


Glu 


Ala Met Pro 


Val 


Ala 


Phe 


Asp 


Gly 


Pro 


Gly 


5 






355 








360 








365 










Asp Asn 


Aep 


Phe 


Tyr 


Glu 


Met Arg Phe 


Ser 


His 


Gly 


Arg 


Leu 


Tyr 


Ala 






370 










375 






380 












Ala 


Ser 


Gly 


Leu 


Trp 


Gly 


Thr Asn Leu 


Met 


Gly 


His 


Ala 


Gly Met 


Val 




3B5 










390 






355 










400 


10 


Lys 


Leu 


Tyr 


Asp 


Gly 


Asn 


Arg Trp Thr 


Asn 


Phe 


Asp 


Lys 


Lys 


Thr 


Val 










4 05 






410 










415 






nin 


Glu 


Gin 


Leu 


Gly 


Gly 


Gly Phe Ser 


Phe 


Asn 


Asp 


Ala 


He Asp 


He 










420 






425 










430 








Ala 


Val 


Ser 


Asn 


Gly 


Asp 


Pro Asp His 


Phe 


Phe 


Val 


Gly 


Thr 


Trp 


Gly 


15 






435 








440 








445 










Asn 


Gly 


Leu 


Ph« 


Glu 


Phe 


Lys Asp Gly 


Lys 


Ala 


He 


A.la 


Arg Tyr 


Scr 






450 










455 






460 












Glv Asn 


Glu 


Thr 


Ala 


He 


Ala Glu Cys 


Asn 


Pro 


Gly 


Asp 


Ala 


Arg 


Val 




4 65 










470 






475 










480 


20 




Ala 


He 


Ala 


Phe 


Asp 


Asn Lys Gly 


Asn 


Leu 


Trp 


Gly 


Thr 


Leu 


Gly 










485 






490 










495 






Ala 


Val 


Gly 


Lys 
500 


Asn 


lift 


Phe Met Tyr 
505 


Asp 


Pro 


Gin 


Ser 


Ser 
510 


Thr 


Trp 




Hl£ 


Ser 


Phe 


Ser 


Tyr 


Pro 


Asp Val Ala 


Asn 


Leu 


Ala 


Ser 


Phe 


Gly Asn 


25 






515 








520 








525 










Met 


lie 


lie 


Leu 


Pro 


TVsn 


Gly Asp Lys 
535 


Trp 


Val 


7\sn 
540 


H e 


Leu 


His 


Arg 




Ser 


Gly 


Gly 


Ser 


Thr 


Arg 


Lys Gly Val 


Leu 


He 


Phe 


Asn 


Asp 


Arg 


Gly 




545 








550 






555 










560 


30 


Thr 


Pro 


Glu 


Thr 


Thr 
565 


Ser 


Asp Asp Ser 


His 
57 0 


Leu 


Tyr 


Val 


Glu 


Gin 
575 


Phe 




Val 


A*»n 


Arg 


Leu Gly 


Ala 


Ala lie Gly 


His 


Lys 


Thr 


He 


Tyr Ala 


Met 










580 






585 










590 








Ala 


Val 


Asp 


His 


Asn 


Gly 


Ser Val Trp 


Met 


Gly 


Ser 


Asp 


He 


Gly 


He 


35 






595 








600 








605 










Phft 


Gly 
610 


Val 


Tyr 


Asn 


Ala 


Ala Gly Val 
615 


Leu 


Ser 


Ser 
620 


Thr 


Ser 


Thr 


Pro 




lie 


Ala 


Val 


Arg 


Pro 


Val 


Gly Gly Glu 


Glu 


Pro 


Asn 


Leu 


Tyr 


Tyr 


Val 




625 










630 






635 










640 


40 


Leu 


Asp 


Lys 


Val 


Thr 


Val 


Thr Asp He 


Val 


Val 


Asp 


Lys 


Lgu 


Asn 


His 










64 5 






650 










655 






Lys Trp 


Val 


Ala 


Thr 


Gin 


Gly Thr Gly 


Leu 


Tyr 


Leu 


Leu 


Ser 


Glu 


Asp 










€60 






665 










670 








Cys 


Ser 


Lys 


He 


Leu 


Ala 


Gin Phe Thr 


Val 


Glu 


Asn 


Ser 


Pro 


Leu 


Leu 


45 




675 








660 








685 










Ser 


Asn 


Asn 


He 


Leu 


Scr 


Leu Ala Leu 


Asn 


Asp Asp 


Asn 


Gly Leu 


Leu 






6?0 










695 






700 












Tyr 


Tie 


Gly 


Thr 


Ala 


Asp 


Gly Leu Met 


Thr 


Phe 


Gin 


Thr 


Gly Thr 


Gly 




705 










710 






715 










720 


50 


Ser Gly 


Ser 


Ala 


Ser 


Glu 


Leu Asp Gly 


Val 


Tyr 


Val 


Tyr 


Pro 


Asn 


Pro 












725 






7 30 










735 






Leu 


Acg 


Pro 


Glu Tyr 


Pro 


Asp Gly Val 


Thr 


He 


Ala 


Gly 


Leu 


Gin 


Ala 








740 






745 










750 








Gly Cys 


Ser 


Val 


Lys 


He 


Thr Asp Thr 


Thr 


Gly 


Arg 


Leu 


Leu Tyr 


Gin 


55 






755 








7 60 








765 










Thr 


Glu 


Ser 


Val 


Thr 


Thr 


Glu Val Lys 


Trp 


Asn 


Ala 


Arg 


Gly Ala Asp 






770 










775 






780 












Gly Asn 


Arg 


Val 


Ala 


Ser 


Gly Val Tyr 


A] a 


Val 


Ala 


Val 


Tyr Asp 


Pro 




785 










790 






795 










800 


60 


Val 


Ser 


Lys 


Lys 


Ser 
80S 


Lys 


Leu He Arg 


Phe 
310 


Ala 


Val 


He 


Arg 







(2) IMFORMATIOW FOR SZQ ID HO: 380 

65 (i) SEQUENCE CHARACTFRISTICS: 

(A) LEIJGTH: 1162 amino acids 

(B) TYPE: amino acid 
•ID) TOPOLOGY: linear 

70 (ii) UOLECULfe TYPE: protein 

UiU HYPOTHETICAL: YES 

(vi) ORIGINAL SOt'RCE: 
75 {A} ORGANISM: Porphyromonas gingiva lis 



PCT/AU98/0I023 



SUBSTITUTE SHEET (Rule 26) <RO/AU) 




wo 99/29870 



PCT/AII98/01023 



314/ 



490 



(ixl FEATURE: 

(A) HAIlE/^Xr: misc featuce 

(B) LOCATION 1...1162 



(xi) SEQUEt/CE DESCRIPTION: SEO ID HO:380 



Ala 


He 


Ser 


Gin 


Met 


Lys 


Arg 


He 


Leu 


Pro 


He 


Val 


Ala 


Phe 


Leu 


Ser 


1 








5 








10 










15 




Leu 


Phe 


Leu 


Ala 


Leu 


Ala 


Leu 


Pro 


Ala 


Lys 


Ala 


Gl n 


Arg 


Ala 


Met 


Gly 








20 










25 








30 






Lys 


Thr 


Ala 


Asp 


Arg 


Ser 


Leu 


Met 


Ala 


Ser 


Gly 


His 


Trp 


Val 


Lys 


He 






35 










40 










45 








Arg 


Val 


Asp 


Ala 


Ser 


Gly 


Val 


Tyr 


Arg 


Leu 


Thr 


Asp 


Glu 


Gin 


Leu 


Arg 




50 










55 










60 










Ala 


Affn 


Gly 


Phe 


Ser 


Asp 


Pro 


Ser 


Lys 


Val 


Gly 


Val 


Phe Gly Tyr Gly 


65 










70 










75 










80 


Gly 


Gly 


Val 


Leu 


Pro 


Glu 


Asp 


Leu 


Ser 


Arg 


He 


Thr 


Thr 


Asp 


Asp 


Leu 










85 










90 










95 




Pro 


Pro 


Val 


Pro 


Val 


Leu 


Arg 


Gin 


Gly 


Asn 


Ala 


Leu 


Tyr 


Phe 


Tyr 


Ala 








100 










105 










110 






Val 


Giy 


Pro 


Val 


Thr 


Trp 


Phe 


Tyr 


Asn 


Pro 


Ala 


Lys 


The 


Thr 


Met 


Glu 






115 










120 










125 








His 


Thr 


Val 


Asn 


Thr 


Tyr 


Ser 


Thr 


His 


Gly 


Tyr 


Tyr 


Phe 


Leu 


Set 


Asp 




130 










135 










140 










Ala 


Ain 


Gly 


Ala 


Pro 


Leu 


Gin 


Met 


Ser 


Gin 


Tyr 


Thr 


Gly Gly 


Gly Ala 


145 










150 










155 










160 


Ser 


Ma 


Glu 


Ala 


Leu 


lie 


Asp 


Tyr 


Tyr 


Asp 


Glu 


Leu 


Met 


Leu 


His 


Glu 










165 










170 










175 




Gin 


Glu 


Leu 


Tyr 


Ser 


Pro 


Lys 


Glu 


Ser 


Gly 


Arg 


Asp 


Leu 


Tyr 


Gly Glu 








IQO 










185 








190 






Ser 


Phe 


Ser 


Ala 


Val 


Asn 


Thr 


Arg 


Thr 


Val 


Lys 


Phe 


Pro 


Leu 


Arg 


Gly 






195 










200 










205 








Asn 


TJir 


Arg 


Scr 


Ser 


Gly 


Glu 


Leu 


Gly 


Thr 


Val 


Phe 


Ser 


Tyr 


lie 


Ala 




210 










215 










220 










Lys 


Ala 


Arg 


Ser 


Ala 


Gly 


Gly 


Gly 


Arg 


Glu 


Met 


Ser 


Leu 


Ser 


Ala 


Asn 


225 










230 










235 










240 


Gly 


lie 


Leu 


11 2 


Phe 


Ser 


Asp 


Pro 


Phe 


Ser 


Met 


Thr 


Ser 


Asn 


Glu 


Val 










245 








250 










255 




Ser 


A5n 


Ser 


Tyr 


Leu 


Ala 


Gly 


Lys 


Lys 


Arg 


Arg 


Leu 


Tyr 


His 


Ser 


Thr 








260 










265 










270 






Pro 


net 


Asn 


Ser 


Leu 


Val 


Asn 


Glu 


Leu 


Arg 


Leu 


Asp 


Ala 


Asn 


Tyr 


Ser 






275 










280 










285 








net 


Thr 


Gly 


Asp 


Ala 


Val 


Asn 


Leu 


Asp 


Phe 


lie 


Glu 


Val 


Ala 


Thr 


Gin 




290 










295 










300 










Asn 


Asp 


Leu 


Arg 


Tyr 


Asp 


Gly 


Ala 


Pro 


Met 


His 


He 


Arg 


Arg 


Phe 


Ser 


305 










310 










315 










320 


Asn 


Leu 


Pro 


Val 


Leu 


Gly 


Gly 


Glu 


Ser 


Cys 


Arg 


Phe 


Val 


He 


Ser 


Glu 










325 










330 










335 




Val 


Pro 


Glu 


Ser 


Leu 


Val 


Val 


Leu 


Gin 


Ala 


Asn 


Ser 


Ser 


Leu 


Thr 


Ala 








34 0 










345 










350 






Ser 


Leu 


Val 


Pro 


Val 


Lys 


Thr 


Val 


Gly 


Asp 


Lys 


Thr 


He 


Glu 


Phe 


Val 






355 










360 






365 








Ala 


Pro 


Pro 


Lys 


Gly 


Gin 


Asp 


Arg 


Arg 


Thr 


He 


Asn 


Thr 


Phe 


Tyr 


Ala 




370 










375 










380 










Val 


Asp 


Leu 


Ser 


Gin 


Ala 


Ser 


Ala 


Pro 


Glu 


He 


Leu 


Gly Ala 


Val 


Pro 


3S5 










390 










3 95 










4 00 


Asn 


Gid 


Asn 


Leu 


His 


Gly 


Glu 


Glu 


He 


Pro 


Asp 


Leu 


He 


He 


Val 


Ser 










405 










410 










415 




Thr 


Gin 


Ala 


Leu 


Leu 


I*eu 


Glu 


Ala 


Asp 


Arg 


Leu 


A] a 


Thr 


Tyr Arg 


Arg 








420 










425 










430 






Glu 


Lys 


Asn 


Gly 


Leu 


Lys 


Val 


Leu 


Val 


Val 


Leu 


Gin 


Glu 


Gin 


Val 


Phe 






435 










440 










445 








Asn 


Glu 


Pile 


Ser 


GJy 


Gly 


Thr 


Pro 


Asp 


Ala 


Thr 


Ala 


Tyr 


Arg 


Leu 


Phe 




450 










4 55 










460 










Ala 


Lys 


Met 


Phe 


Tyr 


Asp 


Arg 


Trp 


Lys 


Ala 


Asn 


Ala 


Pro 


Val 


Gly Glu 


465 










470 










475 










4S0 


Thr 


Phs: 


Pro 


Met 


Gin 


Met 


Leu 


Leu 


Phe 


Gly 


Affp 


Gly 


Ala 


His 


Asp 


Asn 










485 










4 90 








4 95 




Arg 


Lys 


Val 


Ser 


Val 


Ala 


Trp 


Gin 


Lys 


Pro 


Tyr 


Leu 


Gin 


Gin 


Thr 


Glu 








500 










SOS 










510 






Phe 


L*u 


Leu 


Thr 


Phe 


Gin 


Ala 


Val 


Asn 


Ser 


Thr 


Asn 


Val 


Asn 


Ser 


Tyr 






515 










520 










525 






Val 


Thr 


Affp 


Asp 


Tyr 


Phe 


Gly 


Leu 


Leu 


Asp 


Asp Gin 


Pro 


Ala 


Ser 


Val 




530 










535 










540 
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Asn He Gly Trp Arg Asn Tyr Asn Met Ala Val Gly Arg Phe Pro Val 
545 550 555 560 

Arg Thr Pro Ala Glu Ala Arg He Ala Val Asp tys Thr T]e Arg Tyr 
565 570 575 

5 Glu Glu fiiSp Arg Glu Scr Gly Ala Trp Arg He Arg Ala Cys Phe Ala 
580 585 590 

Ala Asp Asn Gly Asp Lys His Ala Thr Glu Thr Ser Arg Lsu lie Asrp 

S95 600 €05 

Thr Val Lys Arg Tyr Ala Pro Ala He Met pro Val Arg Ala i?he Gin 
10 610 615 620 

Asp Val Tyr Pro His Val He Glu Asn GLy L«u His Ser He Pro Gly 
625 630 635 640 

Ala Lyp Lys Lys Met Leu Glu Thr Leu Gin Ser Gly He He Leu Leu 
645 650 655 

15 Asn Tyr Ala Gly His Gly Gly Pro Ala Gly Trp Ser Asp Glu His Leu 
660 665 670 

Leu Thr Leu Asn Asp He His Lys Phe Asn Tyr Lys His Met Pro He 

675 680 685 

Trp He Thr Ala Thr Cys Asp Phe Ala Asn Tyr Asp Ser Gin Thr Thr 
20 690 695 700 

Ser Ala Gly Glu tilu Val Phe Leu His Glu Lys Ser Gly Thr Pro He 
70S 710 715 720 

Met Phe Scr Thr Thr Arg Val Val Tyz Ai*n Thr Gin Asn Glu Lys He 
725 730 735 

25 Asn Gly Ph« ll«t Leu Arg Arg Met Phe Glu Lys Ala Lys Asp Gly Arg 
740 745 750 

Tyr Arg Thr ftct Gly Glu He He Arg Ser Ala Lys Gin Gly Met Leu 

755 760 765 

Ser Thr Val Phe Pro Asp Ser He Asn Gin Leu Ser Phe Phe Leu Wet 
30 770 775 780 

Gly Asp Pro Scr Val Arg Met Asn Leu Pro Thr His Lys Val Gin Leu 
7B5 790 795 800 

Thr Ala He Asn Gly Gin Asp Pro Glu Gly Gin Tyr Gly Thr He Met 
805 810 815 

35 Leu Lys Ser Leu Glu Arg Val Ala Leu Lys Gly Lys Val Thr Asp Glu 
820 825 830 

Lys Gly Thr Phe Asp Glu Thr Phe Ser Gly Lys Val Phe Leu Thr Val 

835 840 845 

Phe Asp Gly Arg Lys Lys Met Thr Ala Leu Glu Glu Glu Gly Asn Asp 
40 850 855 860 

Leu Ser Leu Val Tyr Tyr Asp Tyr Pro Asn Val Met Tyr Ala Gly He 
865 870 875 080 

Ala Glu Val Lys Asp Gly I*eu Phe Glu Thr Ser Phe He Val Pro Lys 
885 690 895 

45 Asp Val Asn Tyr Ser Glu His Glu Gly Arg He Asn Leu Tyr Ala Tyr 
900 905 910 

Asn Glu Scr Thr Lys Ala Glu Ala Met Gly Val Asp Phe Ser He Arg 

915 920 925 

Val Gin Pro Gly He Pro Asp Glu Val Thr Glu Asp Asn Thr Pro Pro 
50 930 935 940 

Glu He He Scr Cys Phe Leu Asn Asp Ser Thr Phe Arg Ser Gly Asp 
945 950 955 960 

Glu Val Asn Pro Thr Pro Leu Phe Met Ala Glu Val Phe Asp Leu Asn 
965 970 975 

55 Gly He Asn He Thr Gly Ser Gly Val Gly His Asp He Thr Leu Cys 
980 985 990 

He Asp Gly Arg Ala Asp Leu Thr Tyr Asn Leu Asn Ala Tyr Phe Thr 

995 1000 1005 

Ser Ser Ala Thr Asp Ala Gly Val Gly Thr He Leu Phe Met He Pro 
60 1010 1015 1020 

Ala Leu Ala Glu Gly Asp His Thr Ala Arg Leu Thr Val Trp Asp He 
1025 1030 1035 1040 

Phe Asn Asn Ala Val His Hii# Asp Phe Ser Phe Arg Val Val Asp Gly 
1045 1050 1055 

65 He Ala Pro Asp Val Ala Asp Val He Leu Phe Pro Asn Pro Val Arg 
1060 1065 1070 

Glu Ser Ala Thr Phe Arg He Phe His Asn Arg Pro Gly Ser Asp Leu 

1075 1O80 1065 

Asn Val Ala Val Glu He Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser 
70 1090 1095 1100 

Leu Pro Val Lys Thr Tyr Sec Ser Scr Tyr Gly Glu Pro He Glu He 
1105 1110 HIS 1120 

Lys Trp Asp Leu Thr Ser Lys Tyr Gly Val Lye He Gly Asn Gly Phe 
1125 1130 1135 

75 Tyr Leu Tyr Arg Cys Val Val Asn Ser Pro Gly Gly Gin Thr Ala Ser 
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1140 1145 
Met. Ala Lys Lys Met He Val Val Gly Gin 
1155 1160 



1150 
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(21 INTOWIATION FOR SEQ ID «0:381 

(i| SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 973 amino actdc 

(B) -nfFE: amino acid 
(D» TOPOLOGY: linear 

(11) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(ix) FEATURE: 

(AJ umc/KZ't: mi5c_feature 
(E) LOCATION 1. . .973 





(xl) 


SEQUENCE DESCRIPTION: SEQ ID NO: 301 










Phe 


Tyr 


Thr 


Gly 


He 


Asp 


eu 


Hi s 


He 


Glu 


S<»r 


Gin 


Met 


Lys 


Lys 


Leu 


1 








5 










1 0 










15 




Phe 


Pro 


Leu 


Leu 


Leu 


eu 


He 




Ser 


He 


Leu 


Val 


Gly Cys 


Gly 


Lys 








20 










25 










30 






Lys 


Glu 


Lys 


Hi? 


Ser 


Val 


Thr 


Glu 


He 


Ala 


rg 


Glu 


Lys 


Lys 


Arg 


He 






35 










40 










45 








Thr 


Ala 


Leu 


Leu 


Tyr 


Glu 


Lys 


Glu 


Leu 


Pro 


Thr 


Asp 


Ser 


Val 


Lys Gin 




50 










55 










60 










Leu 


Tyr 


Glu 


Asn 


Ser 


Val 


Gin 


Asn 


Lys 


Asn 


Leu 


Val 


Gly Gin Met 


Leu 


65 










70 










75 










80 


Phe 


Ala 


He 


Glu 


Val 


Gly 


Lys 


Arg 


Met 


Arg 


Asn 


Met 


Ser 


Gin 


Tyr 


Thr 










85 










go 










95 




Asp 


Ala 


Met 


Leu 


Tyr 


His 


Gin 


Lys 


Gly 


Leu 


Asn 


Ala 


Ala 


Leu 


Asn 


Leu 








100 










105 










110 






Arg 


Afp 


Thr 


He 


Val 


Ala 


Ala 


Gin 


Ala 


Trp 


Asn 


His 


Leu 


Gly Thr Asp 






5 15 










120 








125 








Ser 


Arg Arg 


He 


Gly 


Ala 


Leu 


Ala 


Glu 


Ala 


Ser 


Asp 


Tyr 


His 


Tyr 


Lys 




130 










135 










140 










Ala 


Leu 


Ser 


Leu 


He 


Glu 


Ser 


Phe 


Ser 


Gly 


Asn 


Gin 


Asn Arg 


Pro 


Ala 


14S 










ISO 








155 










160 


He 


Lys 


Ala 


Arg 


Ser 


Ala 


Ala 


Leu 


Asn 


Gly 


He 


Gly 


Asn 


He 


Asn 


Leu 










165 










170 










175 




Glu 


Leu 


Gly 


Tyt 


His 


Asp 


Glu 


Ala 


Glu 


Lys 


Asn 


Phe 


Leu 


Lys 


Ala 


Leu 








IBO 










185 










190 






Gin 


Gly Glu 


Lys 


Glu 


Leu 


Asp 


Ser 


Pro 


Leu 


Gly 


Gin 


Ala 


He 


Asn 


Tyr 






195 










200 








205 








Ala 


Asn 


Leu 


Gly 


Arg 


He 


Tyr 


Arg 


Gin 


Arg 


Lys 


Glu 


Tyr Asp Lys Ala 




210 










215 










220 










Arg 


Thr Tyr 


Phe 


Leu 


Leu 


Ser 


Leu 


Glu 


Gin 


Asn 


Asn 


Met 


Ala 


Glu 


Asn 


225 










230 










235 










240 


Leu 


Met Gly 


He 


Gly 


Leu 


Cys 


Ser 


He 


Asn 


Leu 


Gly 


Glu 


Val 


Asp 


Glu 










2-15 










250 










255 




Glu 


Lys 


Gly 


Asp 


Tyr 


Gin 


Lye 


Ala 


I,eu 


Gin 


Glu 


Tyr 


Ala 


Thr 


Ala 


Tyr 








260 










265 










270 






Lys 


Leu 


Met 


Glu 


Gin 


Leu 


Ser 


Asp 


Arg 


Trp 


His 


Trp 


Leu 


Asn 


Ser 


Cys 






275 










280 










285 








He 


Pro 


Met 


Ala 


Arg 


He 


Asn 


E^U 


Lya 


Gin 


Gly 


Asn 


Glu Arg 


Leu 


Tyr 




290 










295 










300 










Gin 


His 


Phe 


He 


Ser 


Leu 


Ala 


Glu 


Gly 


Thr 


Ala 


Lys 


Glu 


He 


Asn 


Ser 


305 










310 










315 








320 


Thr 


Ser 


His 


Leu 


He 


Glu 


He 


Tyr 


Asn 


Leu 


Gin 


Tyr 


Glu 


Asn 


Leu 


Glu 










325 








330 










335 




Arg 


Ly5 


Lys 


Glu 


Tyr 


Lys 


Gin 


Ala 


Leu 


Glu 


Ala 


Phe 


Cys 


Leu 


Ser 


Lys 








340 










345 










350 






Thr 


Leu 


Ser 


A^p 


Ser 


Met 


Ser 


He 


Ala 


His 


Lys 


Val 


Ser 


Ser 


He 


Gin 






355 










360 








'365 








Glu 


Thr Arg 


Phe 


A^n 


Tyr 


Glu 


Arg 


Asn 


Lys 


Ser 


Gin 


Lye 


Glu 


Leu 


Glu 




370 










375 










380 










Glu 


He 


Gin 


Gin 


Val 


Ser 


Lys 


Ala 


Lys 


Gin 


Glu 


Lys 


Ser 


Lys 


Phe 


He 


385 










390 










395 








4 00 


Leu 


Leu 


Ser 


Thr 


Leu 


Phe 


Ala 


Leu 


Phe 


He 


Ser 


He 


Leu 


Leu 


He 


Ser 
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40S J15 
Val Lf»u Thr Tyr Ala Tyr Arg Gin Gly Lys Lys His Asn Lys Leu II- 

420 425 430 

Lys Glu Thr Asrp Lys Leu Arg Ser Gly Phe Phe Thr Gly He Thr His 

435 440 445 

Glu Phe Arg Thr Pro lie Thr Val lis Gin Gly Leu Asn Glu Lvs Met 

'150 455 460 

Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Arg Thr Glu Leu His Ly;^ 
4V0 475 .180 

11c He Afp Arq Gin Ser Ser His Met I.eti A^n L-u Val Asn Gin Leu 

485 4 49c 

Leu Asp 11 ^? Cys Lys He Arg Ser Gly Val Ser Thr Pro Glu Trp Arg 

500 503 510 

Asn Gly /Ksp He Val Ser Phe Val Gin He Leu He Asp Ser Phe Ala 

•'^1'=' S20 525 

Pro Tyr Ala Gin Ala Gin Asp He Thr Leu Glu Leu Gin Pro Glu S»r 

S30 535 540 

Lys Pro He Val Val Asp Phe Val Pro Ser Tyr Leu Gin Lys He He 
S'iS 550 555 

Ser Asn Leu Leu Ser Asn Ala He Lys Tyr Ser Leu Ala GJ y Gly Arg 

S€5 570 575 

Val Val He Ser Leu Ala Lys Thr Lys Asn Glu Lys Asn Leu He He 

S80 585 590 

Arg Val Ala .Asp Asn Gly He Gly He Asp Lys Thr Asp Gin Ala His 

600 605 
He Pht A.-p He Phe Tyr Arg Gly GLr. Ser Ala Thr Glu Lys His Glv 

eiO 615 620 

Ser Gly Val Gly Leu Ser Phe Thr Asn He Leu Val Glu Asn Leu Arq 
625 630 635 640 

Gly Thr He Lys Val Glu Ser Gin Pro Gly Lys Gly Ser Ala Phe Thr 

650 655 
He Ser He Pro Thr Gin Asn Gin Ser Ser Ser Ala Glu He Leu Pro 

660 665 670 

Trp Lew Pro Ser Ser Asp Asp He VaJ Met Pro Val His He Aia Fro 

675 680 685 

Asp Asp Ser Fro Thr Ser Pro Met Val Ala Ala Leu Asn His Arg Ph- 

690 695 700 

Glu Asp Glu Arg Pro Thr He Leu Leu Val Glu Asp Asn Lvs Aso 11° 
705 710 715 720 

Asn Leu Leu Val Lys Leu Leu Leu Cys Asp Arg Tyr Asn Val Leu Ser 

■^25 730 735 

A]r. Ala Asn Gly Lys Glu Gly He Ala Leu Ala Thr Glu His He Pro 

7^0 745 750 

Asp He He He Thr Asp He Met Met Pro He Met Asp Gly He Glu 

■'5i> 760 765 

net Thr He Arq Met Lys Gin Ser Pro Leu Leu Cys His He Pro Il«» 

770 77i, 780 

Val Ala Leu Thr Ala Lys Ser Thr Glu Gin Aso Arg Leu Glu Gly lie 
790 795 800 

Lys Ser Gly Val Val Ser Tyr Leu Cys Lys Pro Phe Ser Pro Glu Glu 

835 810 815 

Leu Leu Met Arq He Glu Gin Leu Leu Lys Asp Arg Glu Leu Leu Lys 

820 825 830 

Lys Phe Tyr Met Gin Lys Leu Met Leu Asp Arg Lys Pro Glu Glu Glu 

83S 840 845 

Pro Gin Pro He Asp Asp Ser Ser Met Gin Phe Leu Leu Ala Ala Lvs 

8S0 855 860 

Asp Ala Val Ser Gly Gly He Lys Gin Asn Pro Asp Phe Ser Ala Gin 
865 870 875 BfiO 

Asp Leu Ala Glu Lys Met Cys Met Ser Pro Ser Gin Leu Asn Arg Lyr 

890 895 
Leu Thr Ser Val Val Gly Cys Ser Thr He Gly Tyr He Gin Gin H* 

?00 905 910 

Lys He Lys Leu A] a Cys Lys Leu Leu Ala Asp Glu Ser Lys Asn He 

91^> 920 925 

Ser Asp He Ser He Glu Ala Gly Phe Ser Asp Pro Ala Tyr Phe Ser 

530 935 
Arg Thr Phs Lys Arg Tyr Met Asn Cys Ser Pro Ser Gin Tyr Arg Gin 

950 955 
Lys L-u Leu Ala ilet Pro Gly Ser Asp Lys Glu Thr Val 
965 970 

(2) IIIFOWIATIOH rOR SEQ ID HO: 382 

(i) SEQUF.riCE CHARACTER I ST ICS: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ti^^j )l=ni -ip-''" U^i M =™-M -iP^ -rf- fi«H> -iT"}* -n="ii 

:il„ii ^'X^ r3 O ^« U3 iu lUl O iC; ^iUi 



WO 99/29870 PCT/AU98/01023 

318/490 

(A) LENGTH: 563 amino aclcJr; 

(B) TYPE: amino acid 
(D> TOPOLOGY: linear 

(li) MOLECULE TYPt: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A} ORCAMISH: Porphyromonas yingivalis 

UxJ FEATURE: 

(A) NAHt/fXi: niisc featurf* 
(E) LOCATION I. . .563 

(xi) SEQUENCE DESCRIPTION: SEQ ID MO:3B2 

His His Lvs Thr Tyr Gin Thr Met Lys Lys Leu Leu Gin Ala Lys Ala 

Leu lie Leu Ala Leu Gly Leu Phe Gin Leu Pro Ala lie Ala Gin Thr 

20 25 30 

Gin Met Gin Ala Asp Arg Thr Asn Gly Gin Phe Ala Thr Glu Giu Wet 

35 40 45 

Gin Arg Ala Phe Gin Glu Thr Asn Pro Pr- Aia Gly Pro Val Arg Ala 

55 60 
lie Ala Glu Tyr Glu Arg Ser Ala Aln Val L-u Val Arg Tyr Pro Ph- 

7 0 75 
Gly He Pro Met Glu Leu He Lys Glu Leu Ala Lys Asn Asp Ly... Val 

He Thr He Val Ala Ser Giu Ser Gin Lys fKsrx Thr Val II- Thr Gin 

10b no 

Tyr Thr Gin Ser Gly Val Asn Leu Ser A^n Cys A^p Phe He He Ala 

120 125 

Lys Thr A^rp ser Tyr Trp Thr Arg Asp Tyr Thr Gly Trp Phe Ala Met 

135 2.40 



Tyr A^p Thr Asn Lys Vol Gly Leu Val Asp Phe He Tyr Asn Arg Pro 

150 155 ^ 

Arg Pro A/m Asp Asp Glu Phe Pro Lys Tyr Glu Ala Gin Tyr Leu Gly 
165 

He Giu Met Phe Gly Met Lys Leu Lys Gin Thr Gly Gly As-n Tyr M-t 



100 

Thr Arp Cly Tyr Gly Ser Ala Val Gin Ser Hie H- Ala Tyr Thr Glu 

195 200 205 

Asn Ser Ser Leu Ser Gin Ala Gin Val Asn Gin Lys Met Lys Asp Tyr 

Leu Gly He Thr His His Asp Val Val Gin A.-^p Pro Asn Gly Giu Tyr 

230 ''SS 
He Asn His Val Asp Cys Trp Gly Lys Tyr Leu Ala Pro Asn Lys H- 

245 2£0 255 

Leu He Arg Lys Val Pro Asp Asn His Pro Gin His Gin Ala Leu Glu 

260 265 270 

Asp Met Ala Ala Tyr Phe Ala Ala Gin Thr Cys Ala Trp Gly Thr Lys 

280 285 
Tyr Glu val Tyr Arg Ala Leu Ala Thr Asn Glu Gin Pro Tyr Thr Asn 

'^'^ 295 300 

Ser Leu He Leu Asn Asn Arg Val Phe Val Pro Val Asn Gly Pro Aia 
310 

Ser Val Asp Asn Asp Ala Leu Asn Val Tyr Lys Thr Ala Mot Pro Gly 

■^^^ 330 335 

Tyr Glu He He Gly Val Lys Gly Ala Ser Gly Thr Pro Trp Leu Gly 

345 350 
Ihr Asp Ala Leu His Cys Arg Thr His Glu Val Ala Asp Ly..- Gly Tyr 

360 365 i i-^ 

Leu Tyr He LyP His Tyr Pro He Leu Gly Glu Gin Ala Gly Pro Asp 

Tyr Lys He Glu Ala Asp Val Val Ser Cys Ala ^n Ala Thr He Ser 

390 395 
Pro Val Gin Cys Tyr Tyr Arg He Asn Gly Ser Gly Ser Phe Lys Aia 

Ala Asp Met Thr Met Glu Ser Thr Cly His Tyr Thr Tyr Ser P^e Thr 

Gl, Leu A.-r. Lys Asn Asp Lys Val Glu Tyr Tyr He Ser Ala Ala A^p 
■i^S 440 4^5 ^ 

Asn Ser Gly Arg Lys Glu Thr Tyr Pro Phe He Gly Glu Pro Asp Pro 
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Phe 


Lys 


Phe 


Thr 


Cys Met 


Asn 


Glu 


Thr 


Asn 


Thr 


Cys Thr 


Val 


Thr 


Gly 


46S 








170 










475 






480 


Ala 


Ala 


Lys 


Ala 


Leu Arg 


Ala 


Trp 


Phe 


Asn 


Ala 


Gly Arg 


Ser 


Glu 


Leu 










463 








4 90 








495 




Ala 


Val 


Ser 


Val 


Ser Leu 


Asn 


lie 


Ala 


Gly 


Thr 


Tyr Arg 


lie 


Ly-r 


Leu 








500 








505 






51C 




Tyr 


Aj?n 


Thr 


Ala 


Gly Glu 


Glu 


Val 


Ala 


Ala 


Met 


Thr Lys 


Glu 


Leu 


Val 






515 








520 








525 








Ala 


■31 y 


Thr 


Ser 


Val Phe 


Ser 


Met 


Asp 


Val 


Tyr 


Ser Gin 


Ala 


Pro 


Gly 




530 








535 








54 0 






Thr 


Tyr 


Val 


Leu 


Val Val 


Glu 


Gly Asn 


GlV 


lie 


Arg Glu 


Thr 


Met 


Lys 


545 








550 










555 








560 


lie 


Lsu 


T.y.-. 

























t2) IMFORMATIOM fOR SEQ ID MO; 383 

(il SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 437 amino acids 

(B) TYPE: amino acid 
{D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(iiij HYPOTHETICAL: YES 

tvi) ORIGIHAL SOURCE: 

(A) ORGAJJISH: Porphyromonas gingiva li^ 

(ix) FEATURE: 

(A) NAME/ KEY: ml 3c_f eature 

(B) LOCATION 1...437 



(Xi) SEQUENCE DESCRIPTTOtJ : SEQ ID WO: 383 



Thr 


Thr 


Asn 


Arg 


Lys 


Pro 


Asn 


Thr 


Asn 


Met 


Li'3 


Leu 


Ser 


Ser 


Lys 


Lys 


1 








5 










10 










15 


lie 


Leu 


Ala 


lie 


He 


Ala 


Leu 


Leu 


Thr 


Met 


Gly 


His 


Ala 


Val 


Gin 


Ala 








20 










25 










30 






Gin 


Phe 


Val 


Pro 


Ala 


Pro 


Thr 


Thr 


Gly 


He 


Arg 


Met 


Ser 


Val 


Thr 


Thr 






35 










40 








45 








Thr 


Lys 


Ala 


Val 


Gly 


Glu 


Lys 


He 


Glu 


Leu 


Leu 


Val 


Hts 


Ser 


He 


Glu 




50 










55 










60 










Lys 


Lys 


Gly 


lie 


Trp 


He 


Asp 


Leu 


Asn 


Gly 


Asp 


Ala 


Thr 


Tyr 


Gin 


Gin 


65 










70 










75 








80 


Gly 


Glu 


Glu 


Tift 


Thr 


Val 


Phe 


Asp 


Glu 


Ala 


Tyr 


His 


Glu 


Tyr 


Thr 


He 










85 










90 










95 




Gly 


Thr 


Gin 


Thr 


Leu 


Thr 


Tie 


Tyr 


Gly Asn 


Thr 


Thr 


Arg 


Leu 


Gly 


cys 








100 










105 








110 


Arg 


Ser 


Thr 


Gly Ala 


Thr 


Ala 


Val 


Asp Val 


Thr 


Ly^T 


Asn 


Pro 


Asn 


Leu 






115 










120 








125 








Thr 


Tyr 


Leu 


Ala 


Cys 


Pro 


Lys 


Asn 


Asn 


Leu 


Lys 


Ser 


Leu 


Asp 


Leu 


Thr 




130 










135 










14 0 








Gin 


Asn 


Pro 


Lys 


Leu 


Leu 


Arg 


Val 


Trp 


Cys 


A.-^p 


Ser 


Asn 


Glu 


He 


Glu 


145 










150 










155 










160 


Ser 


Leu 


Asp 


Leu 


Ser 


Gly 


Asn 


Pro 


Ala 


Leu 


He 


He 


Leu 


Gly 


Cys 


Asp 










165 










170 








175 


Arg 


Asn 


Lys 


Leu 


Thr 


Glu 


Leu 


Lys 


Thr 


Asp 


Asn 


Asn 


Pro 


Lys 


Leu 


Ala 








180 










185 










1 00 






Ser 


Leu 


Trp 


Cys 


Ser 


Asp 


Asn 


Asn 


Leu 


Thr 


Glu 


Leu 


Glu 


Leu 


Ser 


Ala 






1 95 










200 










205 








Asn 


Pro Arg 


Leu 


Asn 


Asp 


Leu 


Trp 


Cys 


Phe 


Gly 


Asn 


Arg 


He 


Thr 


Lys 




210 










215 








220 








Leu 


Asp 


Leu 


Ser 


Ala 


Asn 


Pro 


Leu 


Leu 


Val 


Thr 


Leu 


Trp 


Cys 


Ser 


Asp 


225 










230 










235 






240 


A£n 


Glu 


Leu 


Ser 


Thr 


Leu 


Asp 


Leu 


Ser 


Lys 


Asn 


Ser 


Asp 


Val 


Ala 


Tyr 










2 45 










250 










255 


Leu 


Trp 


Cys 


Ser 


Ser 


Asn 


Lys 


Leu 


Thr 


Ser 


Leu 


Asn 


Leu 


Ser 


Gly 


Val 








260 










265 










27 0 




Lys 


Gly 


Leu 


Ser 


Val 


Leu 


Val 


Cys 


His 


Ser 


Asn 


Gin 


11* 


Ala 


Gly 


Glu 






275 










280 










285 






Glu 


Met 


Thr 


Lys 


Val 


Val 


Asn 


Ala 


Leu 


Pro 


Thr 


L*:u 


Ser 


Pro 


Gly Ala 




290 










295 










300 










Gly Ala 


Gilt 


Ser 


Lys 


Phe 


Val 


Val 


Val 


As-p 


Leu 


I.ys 


A£?p Thr 


Asp 


Glu 
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Ly« Asn lie Cys Thr Val Lys Asp Val G)o Lys Ala Ly^ Ser Lys A^n 

330 

Trp Arq Phe A^p Phe Asn Gly A»p Ser Asp A^n Met L=u Pro" Tyr 

5 Glu Gly S=r Pro Thr Ser Asn Leu Vol Asp Ala Pro Thr Val Arg 

360 

Tyr Pro A.^n Pro Val Gly Arg Tyr Ala Leu Val Glu lie Pro Glu 

375 

10 385 390 ""^^ 

val ryr Sc. Phe Ala Val Glu Ser Leu Arg Gl'n Asn lie A^p Leu ^hr 

410 

His Leu Pro A.jp Gly Thr Tyr Phe Phe Arg L^u Asp A.^n Tyr Thr Thr 
1 c 425 j^n 

15 Lys Leu He Lys Gin 

435 

(2) IHFORIIATION FOR SEQ ID MO: 384 

20 (i) SEQUENCE CHARACTERISTICS- 

(A] LEiJGTH: 318 amino acids 
IB) TYPE: amino acid 
<DI TOPOLOGY: linear 

25 (11) MOLECULE TYPE: protein 

(ill) HVPOTMETICAr.: YES 



30 



35 



(vi) ORIGINAL SOURCE: 

(A) ORGAIJISH: Porphyromonas glngivalis 
(ix) FEATURE: 

(A) HAIIE/KEY: misc featur- 

(B| LOCATION 1. - .318 

(xij SEQUENCE DESCRIPTION: SEQ ID NO:384 



A^n His Leu Lys Thr A^n He Lys Met Ary Lys Thr He He Phe Cys 

40 L.„ Leu Leu Ala Leu Phe Gly Cys Ser ^rp Ala Gin Glu Arg vfl Asp 
, 25 30 

Glu Ly. Val Ph« ser Ala Gly Thr Ser He Phe Arg Gly lie Leu Glu 

45 lo^ ''^''^ '^f '■V'^ '^^y "'P Val Trp Gly Met 

Ala Arg Ala Ser Glu A^p Phe Phe Phe Tie Leu Pro Val Thr A^p Asp 
Leu Thr Pro Val L=u Phe Tyr A.n Arg Leu ^hr A«n Glu Pro Cys Phe 

50 Val ser A.p Gin oty He Thr Glu Tyr Phe Ly^ Phe Ala Gin "u Gly 
Tvr Ue Glu Val Glu Gly Ser Ser Val Phe Met Ala Leu Leu 

55 Til ""^ ^""^ -^""^ ^'-^ Asn Ala Pro He 

Glu Gly val Val Scr Ly3 Thr Gly A.n Pro Ala P^f Thr He Pro Met 

150 1 55 , , 

Leu Pro Gly val Ser A.p Cys 11= clu He Ser Asn Asn Arg Ly. Va? 

60 Phe Leu Thr Asn Gl„ Lou Gly Val Val il.*! He Thr Asp Gly .'It Glu 

Pro Pro He He Ala Gly Val Ser Ma Ser Tyr Gly Ser Ser Val Arg 

200 205 

65 "'^ ^" l\2 """^ "'"^ 'y-" 



Leu Asp He Tyr Pro Thr Asn Cys Tyr Pro Leu Ser Thr Lys Pro Val 
220 235 9<in 

Ala Gly A_.p Asp Glu Val Phe Val Lys Gin Gin Gly Arg Gin He g" 

70 He Asp ser Asn Ser Pro Ho Val Gin wll Val Val" Tyr Asp Leu Glu 

Gly Lys ser Val Phe Arg Lys Arg Met Thr Glu Asn Ala Tyr Thr Leu 

280 205 
Ser Phe Arg Ala Pro Met Leu Gly Phe MeL Thr Tie Met He Glu Thr 



75 



295 



300 
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Gin Afn Ser lie 11- Am Lys Lys Leu Asn Val Thr Gin L*u 
305 310 

iZ] IHFORMATTOIJ FOR SEQ ID NO: 385 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -J 61 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: prot*in 

(ill) HYPOTHETICAL: YES 



15 (vi) ORIGIirAL SOLiRCF.r 

(A) ORGAJJISII: Porphyromonas gingivali« 

(iy;) FEATURE: 
«n <A) NWIE/KT.V: misc feature 

(B) LOCATIOJJ 1...461 

(xi) SEQUEIICE DESCRIPTION: SEQ ID HO:385 

Arg Arg Al;i Val Lys He Arg Ser Pro Pro His He His Ser Leu Ph- 
J- 1 rt - _ " 



val A£« Lys Cy^ Leu Phe Ser Asp Ph* Lys Tyr Leu Tyr Leu Ser Arg 
Lys lie Thr ..In Glu Arg Leu Gly Krg Leu Ser He Ary Leu I.y., s-r 



30 Tyr Asn Pro He n« He lie Glu Met Lys Lys Thr Thr He He Ser 



SO SS 



L|u He val Phe Gly Ala ^he Pho Ala Ala Val G^y Gin Thr Lys A.,p 

Ash ser ser Ti't Lys Pro Phe Ser Lys Glu lip He Ala Gly Gly Val 

Tyr ser Leu Pro Thr Gin Asn Arg Ala «n Lys Asp Asn Ala Glu Trp 

105 110 
Leu Leu Thr Ala Thr Val Ser Thr Asn Gin Ser Ala A^p Thr HI, Phe 

40 De Phe Asp Glu Asn A^n Arg Tyr He Ala Arg Asp He Lys Ala A=n 

Gly Val Arg Ly. Ser Thr Arp Ser He Tyr Tyr ^^p Ala Asn Gly Arg 



Tie ser 111. Vai A.^ Leu Tyr He Ser Phe Ser Gly Gly Glu Pro w'a 

Leu A..-P Thr Arg Phe Lys Tyr Thr Tyr top A..p Glu Gly Ly^ till Thr 

Val Arg Glu Val Phe Met Leu Val "t A^-p Pro A.^n Thr Pro He S-r 

50 Arg Leu Glu Tyr Hi.. Tyr A^p Ala Gin Gly Arg Leu ?Ar His Trp He 

ser Phe Ala Phe Gly Ala Clu Ser Gin Lys Asn Thr Tyr His Tyr Asn 

235 7jn 
Glu Lyc Gly Leu Leu Val Ser Glu Val Leu Ser Asn Ala Met Gly Thr 

Thr Tyr Ser Asp Thr Gly Lys Thr Glu "r Ser Tyr A^p Asp L" Asp 
Asn Met Val Uy. Ala Glu Tyr Phe Vaf Val cin Gin Gly fys Ala Trp 
Gin Val Leu Ly* Arg Glu Glu Tyr Thr Tyr Glu Aep Ksn He Cys He 
Gin Tyr Leu Ala He Asn Gly Thr Asp Thr Lys Va^ Tyr Lys Arg Asp 
65 ^" '^l- Val n« A.'p He Pro S« 



ISO 



Met Pro Glu Gin Thr Trp Pro Asn Met Tyr Gly Ph. Asn Ala Lys Arg 
Leu Lys Glu Thr Tyr Ser Ser Tyr 111 Gly Asp Va . Al, Thr Pro He 
70 Phe Asp Tyr He Tyr Thr Tyr Lys Ala Leu Thr Ser Het Ala Thr Pro 
ser Thr Glu Ala Gin Val Ala Val Tyr Leu A.,n Pro Ser Thr Asp Arg 
75 :it-.^v5i^: '.'^ ^'^'^ ",[1 ryr Asp ieu 
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Gin Gly Lys L|u He Arg Asp Cys Al^ Leu Ser Gly Az.p Lys Val Glu 

net Gly yal Gly Ser Leu Thr Lys Thr Tyr Leu Leu Vys Val Asn 

Thr A^p Gin Gly Ala Phe Vnl tit Lys Val Val He Vrl 

455 460 
(2) INFORIiATIOM FOR SEQ ID NO:386 

(i) SEQUtHCE CHARACTERISTICS- 

(A) LENGTH: 4bl amino acldc 
(8) TYPE: amino acid 
(D) TOPOLOGY: linear 

(tl) nOLECULfc: TYPE: protein 

(iii) HYPOTHETTCAL: YES 

(v1) ORIGXHAL SOURCE: 

(A» ORGAfUSM: Porphyromonac gl m^i val 
(ix) ftLATURE: 

(A) IJA/IE/KEY: miac feature 

(B) LOCATIOH 1. . . 451 

(Ki) SEQUtllCE DESCRIPTIOM: SEQ ID HO:386 
net Ala Ly.. Val lie Lys Thr Lya Lys Gly Leu Ala Leu A^n Leu Ly.^ 
30 Gly Ly. Pro Leu Jro Glu Met Leu Ala Glu Pro Ala Gin Ser Pro Thr 
Tyr Ala Val Val Pro Asp A^p phe Glu Gly Val He Pro Lys Val Thr 

35 '''' 5^ ""'^ "-^^ XV '^^^ ^eu Met His Hi. 

Ly. Ala Tyr Pro Glu Het Ly^ Phe Thr Ser Pro Val Ser Gly Glu Val 
lie Ala Val Asn Arg Gly Ala Ly, Arg Lys Val Leu Ser He Glu Val 
40 Lys Pro A:,p Gly Leu Asn Glu Tyr Gl^. Ser The Pro Val Gly "p Pro 
Ser Ala Leu Ser Ala Glu Gin He lyl Glu Leu Leu Leu Ser Sor Gly 



10 



15 



20 



25 



45 



Met Trp Gly Phe He Lys Gin Arg Pro Tyr Asp He ^al Ala Thr Pro 
Asp He Ala Pro Ar, A|p H, Tyr He Thr Ala Phe Thr Ala Pro 



Leu Ala Pro Asp Phe iLIp Phe He Val Arg g" Glu Glu Arg Ala ieu 

G,n Thr Ala He Asp Ala Leu Ala Lys 12 Thr Thr Gly Lys vlf Tyr 

Val Gly Leu Lys Pro Gly ser Ser ieu Gly Leu His Asn 111 Glu li- 

200 ont; 

55 no '''' "'^ '''^ '^n val G?y Val Leu He 

Asn His Thr Lys Pro He Asn Arg Gly Glu Thr "l Trp Thr Leu Lys 
Ala Thr ASP Leu lie Val He Gly Arg Phe "u Leu Thr Gly Lys Ma 

60 Asp Phe Thr Arg Me^ He Ala Met Thr Gly Ser Asp Ala Ala Ala His 

Gly ryr Val Arg He Met Pro Gly Cys Arn Val Phe Ala Ser Phe Pro 
" 280 ^Qc 

65 ^Al ^"'^ ''^^ -^1" Arg val He Asp Gly A.n 

Val Leu ihr Gly Lys Lys Leu Ci-s Glu Lys Glu Pro Phe Leu Ser Ala 

Arg cys Asp Gin He Thr Val He Pro Glu .'iy Asp Asp Val Asp G^u 
70 Leu Phe Gly Trp Ala Ala Pro Arg Leu Asp Gin Ty'r Ser Met Ser Arg 

Ala Tyr Phe Ser Trp Leu Gin Oly Lys Asn Lys Glu Tyr Va'l Leu Asp 
75 37° "-^'^ >^^' »«t He net Ser Asn Glu Tyr 

375 3gy 
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A5P Arg Val Phe Pro Met Asp He Tyr Pro Glu Tyr Leu Leu Lys Ala 

395 

He He Ala Phe Ajp He Asp Lys Met Glu Asp Leu Gly H- Tyr Glu 

4<35 gio 
Val Ala Pro Glu Asp Phe Ala Thr Cys Glu Phe Val Asp Thr S-r L\-=^ 

"20 47.S 430 

He G]u Leu Gin Arg He Val Ary Glu Gly Leu Aflp Met Leu Tyr Lvs 

•»35 445 
Glu Met Asn 
450 

(2) ItfrOWlATIOM FOR SEQ ID NO: 307 

fi) SEQUEHCE CHARACTERISTICS: 

(A) LKNGTH: 195 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULt: TYPE: protein 

(iiij HYPOTHETICAT.: YES 

(vl) ORIGINAL SOLIRCE: 

(A) ORGANISM: Porphyromonas gingiva lis 

( ix ) FLATUKE : 

(A) NAi IE/KEY: misc feature 

(B) LOCATION 1 . . ,195 

(xi) SEQUENCE DESCRIPTION: SRQ ID NO:307 

Met Asp Lys Val Ser Tyr Ala Leu Gly Leu Scr He Gly Asn Asn Phe 

^ lO 15 

Lys Ser Ser Gly He Asp Ser Val Val Met Asp Asp Phe M-t Gin GZv 

20 25 30 

Leu Ser Asp Val Leu Glu Glu Lys Ala Pro Gin Leu Ser Tyr Asp Glu 

35 40 45 

Ala Lys Arg Glu He Glu Ala Tyr Phe Met Asp Leu Gin Gin Lys Al,n 

50 55 60 

Val Lys Leu Asn Lys Glu Ala Gly Glu Glu Phe T.eu Lys He Asn Ala 

70 75 p_ 

His Lys Glu Gly Val Thr Thr Leu Pro Ser Gly Leu Gin Tyr Glu Val 

85 90 95 

Hg Lys Met Gly Glu Gly Pro Lys Pro Thr Leu Ser Asp Thr Val Thx 

100 105 no 

Cys His Tyr Hi^- Gly Thr Leu He Asn Gly He Val Phe Asp Ser Ser 

120 i2b> 
Met. r.ly Glu Pro Ala Ser Phe Pro Lou Arg Gly Val He Ala 

Gly rrp Thr Glu He Leu Gin Leu Met Pro Val Gly Ser Lys Trp Ly 

1^0 155 
Val Thr He Pro Ser Asp Leu Ala Tyr Gly A^p Arg Gly Ala Gly Glu 

165 170 
His He Ly« Pro Gly Ser Thr Leu He Phe He He Glu Leu Leu S-r 
180 185 190 

He Asn Lys 
195 

(2) INFORIIATiON FOR SEQ ID NO: 388 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 273 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY; linear 

(ii) nOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(Ix) FEATURE: 

(A) NA/^E/KEY: miec_featur- 

(B) LOCATION 1 . . .273 
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t-Mi) 3EQUE1ICE DESCRIPTION: SEQ ID MO: 388 



10 



15 



20 



25 



30 



35 



40 



65 



70 



75 



net 


Lys 


Lys 


Ala 


Leu 


Leu 


lie 


Gly 


Ala 


Ala 


Leu 


Leu 


Gly Ala 


Val 


Ser 


1 








5 










1 0 








15 




Fh« 


Ala 


Ser 


Ala 


Gin 


Scr 


Leu 


Ser 


Thr 


He 


Lys 


Val 


Gin Asn 


Asn 


Ser 








20 










25 








30 






Vai 


Gin 


Gin 


Pro 


Arg 


Glu 


Glu 


Ala 


Thr 


He 


Gin 


Val 


Cys Gly Glu 


Leu 
















40 










45 






Ala 


Glu 


Gin 


Val 


Aep 


Cys 


11- 


Gly 


Thr 


Gly 


Asn 


Ser 


Ala He 


He 


Ala 




50 








55 










60 








Ala 


Aid 


Ala 


Ly.- 


Phe 


Glu 


Ser 


Asp 


Arp 


Leu 


Glu 


Ser 


Tyr Val 


Gly Trp 


65 










70 










75 








80 


GIu 


He 


net 


Sec 


Val 


Asp 


Phe 


Phe 


Pro 


GLy 


Tyr 


Lys 


Ala cys 


Lys 


Tyr 










36 










90 








95 




Thr 


S^f 


Alu 


VaV 


Trp 


Ala 


Asp 


Asp 


Met 


Thr 


He 


Leu 


Gly Gin 


Sor 


Glu 








100 










105 








110 






Asp 


Sec 


Arp 


Pro 


Glu 


Met 


Gin 


Thr 


He 


Asn 


Asn 


Leu 


Ala Leu 


Lys 


Thr 






: IS 










120 










125 






Ser 


Val 


Lyr 


11 e 


Glu 


Ala 


Gly 


Lysr 


Asn 


Tyr 


lie 


Val 


Gly Tyr 


lis 






IJO 










135 










140 










Thr 


Ala 


Gly 


Gly 


His 


Pro 


He 


•31 y 


cyt- 


Asp 


Gin 


Gly Pro 


Ala 


Vol 


145 










150 










155 








160 


Asp 


Gly 


Tyr 


Gly 


A3P 


Leu 


Val 


Ser 


He 


Ser 


Glu 


Asp 


Gly Gly 


Ala 


The 










165 










170 








175 




Phe 


Pro 


?ro 


Phe 


Glu 


Ser 


Leu 


lli* 


Gin 


Ala 


Val 


Pro 


Thr Leu 


Asn 


Tyi 








ISO 










185 








190 






Asn 


lis 


Tyr 


Val 


Val 


Val 


His 


Leu 


Lys 


Lys 


Gly 


Glu 


Gly Val 


Glu 


Ala 






195 










200 










205 






VaL 


Leu 


The 


Asn 


Asp 


Ly« 


Ala 


Asn 


Ala 


Tyr 


Val 


Gin 


Asn Gly 


Val 


He 




210 










215 










220 








Tyr 


Val 


Ali 


Gly 


Ala 


Asn 


Gly 


Arg 


Gin 


Val 


Ser 


Leu 


Phe Asp Met 


Asn 


22S 










230 










235 








240 


Gly Lys 


Val 


Val 


Tyr 


Thr 


Cly 


Vol 


Ser 


Glu 


Thr 


He 


Ala Ala 


Pro 


Gin 










245 










250 








255 




Lys 


Gly 


Met 


Tyr 


lie 


Leu 


Arg 


Vsl 


Gly 


Ala 


Lys 


Ser 


He Lys 


Leu 


Ala 








260 










265 








270 






Tie 































(2) IHfORllATIOlI FOR SEQ ID HO; 389 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEKGTH: 55-1 amino acids 
45 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

<ii> MOLECULE TYPE: protein 

50 (iii) HiPOTHETICAL: YES 

(ri) ORIGINAL SOURCE: 

(A) ORGATIISM: Porphyromonas gingival is 

55 (IK) FEATURE : 

(Al NAME/KEY: miac feature 

(E) LOCATION 1. . .554 



60 





{xi I SEQUENCE DESCRIPTION: SEQ ID MO:3P9 








net 


Pro Arg 


lis 


Met 


Lys 


Leu 


Lys 


He 


Ala 


Leu 


Arg 


Leu Lsu 


Leu 


Ala 


1 






S 










10 








15 




Thr 


The Ala 


He 


Val 


Leu 


Phe 


Ser 


Pro 


Leu 


Ala 


Lys 


Ala Gin 


Wet 


Asp 






20 










25 








30 






lie 


Gly Gly 


Asp 


Asp 


Vai 


Leu 


Tie 


Glu 


Thr 


llet 


Ser 


Thr I>eu 


Ser 


Gly 




35 










•JO 










45 






Tyr 


Ser Giu 


A&p 


Phe 


Tyr 


Tyr 


Lys 


Met 


Ala 


Val 


Ala 


Asp Asn 


Gly 


Trp 




50 








55 










60 








He 


Tyr Val 


Met 


Leu 


Asp 


Phc 


Ser 


Arg 


He 


Tyr 


Phil" 


Asp Asp 


Val 


Arg 


65 








70 










75 








BO 


Leu 


Tyr Arq 


Ser 


Lys 


Asp 


Gly 


Gly 


Ala 


Thr 


Tyr 


Gin 


Lys Leu 


Gly 


Ser 








85 










90 








95 




Leu Gly Ser 


Leu 


Val 


Pro 


Tyr 


Asp 


Phe 


Asp 


Vol 


Ser 


His Cys 


Asp 


Phfe 






100 










105 








HO 






He 


Val Thr 


Gly 


Lys 


Asp 


Glu 


Asp 


Asp 


He 


Asn 


Val 


Trp Thr 


Val 


net 



- X 
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115 










120 










125 










Al a 


Phe 


Gi u 


Tyr 


Val 


Gly Gly Thr 


lie 




Asn 


Gl y 


Vai 


L'^^u 


Leu 




1 30 










135 










14 0 










Met 


His 


Arcj 


Mis' 


Asp 


Ala 


Arp 


Tie 


Asn 


Asn 


Thr 


Glu 




Val 


Tyr 


Lys 


145 










150 










155 










160 


Lye 


Asp 


PJie 


Pro 


Asn 


Asn 


Arg 


Leu 


Met 


Gly 


Va 1 


Ala 


Xle 


Ala 


Ser 


Asn 








165 










170 










17 5 




Tyr 


Ar/3 


Ala 


Pro 


Ser 


Pro 


Tyr 


Gly Leu 


Gl y 


Gly 


Asp 


Pro 


Ph° 


Ala 


Let 1 






ISO 










185 










1 90 








V « J. 


Al a 


Val 


Scr 


Gly 


Ser 


Gly Ser 


Asp 




S^r 


Phe 


Leu 


Asp 








19S 










200 








205 








Tl * 


Pns 


Ser 


Leu 


Asp Gly Gly Val 


His 


Phe 


Giu 


Gin 


i*y 5 


Arg 


11** 


Tyr 




210 










215 










220 










Tht 


Accj 


P ro 


<a I r\ 


Lys 


Leu 


Thr 


lie 


Asn 


Arg 


val 


Asp 






eu 


. 

Giy 


225 










230 




















24 0 


Ser 


Thr 




Pro 


Ser 


Leu 


Gly 


Phe 


Apn 


Thr 


Trp 


Pro 


Leu 


Het 




\'u 1 










24 5 










'*5C 










255 




V«l 


Phe 


C7I u 


Meu 




Lys 


Affn 


Leu 


Asp 


Gly 


Phe 


Asp 


T 1 « 

J. re 


Gl 


Phe 


He 


















265 










270 






Ser 


Afn 


Fne 


Vn 1 


Asp 


Tyr 


Asp 


Pro 


Arg 


Tyr 


Al a 


Trp 


Ser 


Glu 


Pro 


1 1*^ 






"^75 










280 










285 








lie 


lie 


Glu 


Gl u 


Asp 


Cy.^ 


Gly 


Trp 


Thr 


Asp 


Pne 


Asn 


Pro 


7.1 *u 


Gly 


Al a 














295 










300 










Leu 


Ser 


lie 


C I u 


lie 


Gin 


Met 


Mst 


Leu 


Asp 


Asp 


Asn 


Ser 


Asp 


Asn 


Thr 


30b 










310 










315 










320 


Val 


lily 


Gly 


Giu 


Arg 


Sgc 


His 


Apn 


Fhe 


Leu 


lie 


Thr 


Tyr 


Pro 


Gly 


His 










325 










330 










335 




Tyc 


Val 


Tyc 


Pro 


Lyar 


Gin 


Ser 


Phe 


Asn 


Tyr 


Ser 


Pro 


Gly 


His 


Thr 


Pro 








34 0 










34 5 










350 






Thr 


L'/K 


Lys 


Aep 


Leu 


Val 


Phe 


Lys 


Hie 


Cys 


lie 


Gly 


lie 


Pro 


Ala 


Leu 






355 










360 










365 








Ala 


Tyr 


A5P 


Lys 


Glu 


Gly Asp Arg Tyr 


Leu 


Thr 


Thr 


Phe 


Gin 


Asp 


His 




370 










375 










380 










Am 


Leu 


llet 


Arg 


Tyr Arg 


Xrp 


lie 


Lys 


Tyr 


A.*jp 


Asp 


lie 


Asn 


Ser 


Phe 


3CS 










390 










395 










400 


Tyr 


Gly 


Trp 


Ser 


Trp 


Pro 


Tyr 


Val 


Tyr 


Ala 


Lys 


Glu 


Ala 


Lys 


Asp 


Lys 










4 05 










410 










415 






Arg 


Arg 


Arg 


Pro 


Gin 


Val 


Ala 


Leu 






Thr 






Lys 


Ala 








420 










425 










4 30 






Cye 


Trp 


Val 


Trp 


His 


Thr 


Arg 


Lys 


Ser 


Pro 


Tyr 


Asp 


Glu 


Thr 


Lys 


Pro 






435 










440 










445 








His 


Pro 


Thr 


Pro 


Val 


lie 


lie 


Lys 


His 


Phe 


Leu 


Trn 


Ser 


Asp 


Thr 


Glu 




450 










455 










460 










Trp 


Val 


His 


Ala 


Leu 


Asp 


Val 


Gly 


Asp 


Val 


Leu 


Gin 


Lys 


Gl'J 


Gly 


Ser 


4 65 










470 










475 










460 


Met 


Lys 


Luu 


Tyr 


Pro 


Asn 


Pro 


Ala 


Lye 


Glu 


Tyr 


Val 


Leu 


He 


Asn 


Leu 










485 








■190 










495 




Pre 


Ly.' 


Glu 


Gly 


Gly 


His 


Glu 


Ala 


Val 


Val 


Tyr 


Asp 


Het 


Gin 


Gly 


Arg 








500 










505 










51 C= 






lie 


Val 


Glu 


Lys 


Val 


Ser 


Phe 


Ser 


Gly 


Lyr 


Glu 


Tyr 


Lys 


Leu 


Asn 


Val 






515 










520 










525 








Gin 


Tyr 


Leu 


Ser 


Lys 


Gly 


Thr Tyr 


Met 


Leu 


Lye 


Val 


Val 


Als 


Asp 


Thr 




530 










535 










540 










Glu 


Tyt 


Phe 


Val 


Glu 


Lys 


lie 


lie 


Val 


Glu 














545 










550 






















(2) 


IN FORMAT I OH 


FOR 


SEQ 


ID NO: 390 

















ti) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 550 amino acids 
{D) TYPE: amino acid 
(D) TOPOLOGY: linear 



fii> MOLECULE TYPE: protein 
(iiil HYPOTHETICAL: YES 

tvi) ORIGIIIAL SOURCE: 

(A) ORGAJilSiM: Porphyroroonas gingivalic 

(ix) FEATURR: 

lA) HAIIE/KEY: misc^feature 

(B) LOCATION 1. . .550 

iy.i) SEQUENCE DESCRI FTIOII : SEQ ID HO: 3 90 
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Met Ly? l.cu Lys He Ala Leu Arg Lea Leu Leu Ala Thr Phe Ala Tl- 

Vai Leu PC- Ser Pro Leu Ala Lys Ala Gin M-t Asp He Gly Gly Asp 

25 30 
Asp Val Lou Tlo Rlu Thr Mec Ser Thr Leu Ser Gly Tyr Ser Glu Asp 

Phe Tyr Tyr Lys Met Ala Val Ala Asp A.-n Gly Trp iL Tyr Val Het 

5b 60 
Leu Asp Phe Ser Arg Tie Tyr fhe A^p Asp Val Arg Leu Tyr Arg Sor 

"^^^ 75 80 

Lys Asp Gly Gly Ala The Tyr Gin Lys I.eu Gly Ser Leu Gly Ser Leu 

95 ao 95 

Val Pro Tyr Asp Phe A.c-p Val b'er His Cys Asp Phe He Val Thr Gly 
100 

hys Asp Glu Asp Asp Xle Asn Val Trp Thr Val Met Thr Ala Phe Glu 

1*' 120 125 

Tyr Val Gly Gly Thr Tie Gly Asn Gly Val Leu Leu Met His Arg His 

130 135 
Asp Ala Asp lie Asn Asn Thr Glu Cys Val Tyr T.ys 7,y.<, Asp Phe Pro 
i . . 155 160 

Asn Asn Arg Leu Met Gly Val Ala He Ala Ser Asn Tyr Arg Ala Pro 

165 175 
Ser Pre Tyr Gly Leu Gly Gly Asp Pro Phe AJ a I.eu Ala Val Ala Val 

ISO 185 190 

Ser Gly Ser Gly Ser Asp His Ser Phe Leu Asp Tyr He Phe Ser Lea 

200 205 
Asp Gly Gly Val His Phe Glu Gin Lys Arg He Tyr Thr Arg Pro Gin 

215 220 
Lys Leu Thr He Asn Arg Val Asp Leu Ser Leu Gly Ser Thr S-r Pro 
230 235 240 

Ser Leu Gly Phe Asn Thr Trp Pro T.eu Met Gly Val Val Phe Glu M-t 

245 250 255 

Asn Lys Asn Leu Asp Gly Phe Asp He Gly Phe He Scr Asn Ph* Val 
n ^ . l^^ 265 270 

Asp Tyc Asp Pro Arg Tyr Ala Trp Ser Glu Pro Tie He He Glu Glu 
275 230 285 

^ nln '^"^ '^^'^ P^*^ Gly Ala Leu Ser He Glu 

290 295 300 

He Gin Het Met Leu Asp Asp Asn Ser Asp Asn Thr Val Gly Gly Glu 
310 315 320 

Arg Scr His Asn Phe Leu He Thr Tyr Pro Gly His Tyr Val Tyr Pro 

325 330 335 

Lys Gin Ser Phe Asn Tyr Ser Pro Gly His Thr Pro Thr Lys Lys Asp 

34S 350 
Leu Va. Pho Lys Hi a Cys He Gly He Pro Ala Leu Ala Tyr Asp Lys 

355 360 365 

Glu Gly Asp Arg Tyr Leu Thr Thr Phe Gin Asp His Asn Leu Met Arg 

^ 375 3B0 

Tyr Arg Trp TIr Lys Tyr Asp Asp He Asn Ser Phe Tyr Gly Trp S-r 
385 390 395 ^ 

arp Pre Tyr Val Tyr Ala I.ys Glu Ala Lys Asp Lys Lys Arg Arg Arg 

405 410 4ijj 

Pro Gin Vsl Ala Leu Asn Pro Thr Asn Gly Lys Ala Cys Trp Val Trp 

420 425 430 

His Thr Arg Lys Ser Pro Tyr Asp Glu Thr Lys Pro His Pro Thr Pro 
435 440 

III ^^'^ "^^ "^^P Thr Glu Trp Val His Ala 

450 4S5 4S0 

Leu Asp Val Gly Asp Val Leu Gin Lys Glu Gly Ser Met Lys Leu Tyr 

470 475 480 

Pro A-n Pro Ala Lys Glu Tyr Val Leu He Asn Leu Pro Lys Glu Gly 



485 



490 495 



Gly His Glu Ala Val Val Tyr Asp Het Gin Gly Arg He Val Glu Lys 

500 505 510 

Val Ser Phe Ser Gly Lyt Glu Tyr Ly^ Leu Asn Val Gin Tyr Leu Ser 

-•-5 520 525 

Lys Gly Thr Tyr Met Leu Lys Val Val Ala Asp Thr Glu Tyr Phe Val 
535 54 0 

Glu Lys He ile Val Glu 
545 550 

(2) IIIfORJIATIOtl FOR SEQ ID MO: 391 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




P«h 1!^" ri*"!* «« "=^> .tS'"^*' Tf^ '^"^^ 

's3f r^j Oi .jijj^^ o ai iui D ic::; =o iui juj 
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ti) SEOUEHCE CHAKACTERISTICS: 

(A) LENGTH: 390 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<li» nOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(Vil ORTOTHAL SOURCE: 

(A) ORGAMIStl: Porphyromonas yingivalis 

{ 1 X ) FEATURE : 

(A) HATir/KEY: inisc_f ea turs 

(B) LOCATIOil 1. . . 390 

ir.i) SEQUEtiCE DESCRIPTION: SBQ ID [rO:391 

H»L Lys Arg r.r-u Leu Pro ehe Leu Leu Leu Ala Gly T.»u Val Ala Val 

i S 10 IS 

Gly Ajn Val Ser Ala Gin Ser Pro Arg He Pro Gin Val Asp Val Uxs 

20 25 30 

Thr Ary He Ala Arg Asn Ala Arg Tyr Arg Leu Acp Lys He Ser Val 

35 40 45 

Pro Asp Ser Arg ain He Phe Asp Tyr Fh« Tyr Lys Glu Glu Thr He 

50 55 60 

Pro Thr Lyr Tie Gin Thr Thr Thr Gly Gly Ala H- Thr Ser He Asp 

70 75 00 

Ser Leu Phe Tyr Glu Asp Asp Arg Leu Val Gin Val Arg Tyr Phe Asp 

□5 90 95 

Asn Asn Leu Glu Leu Lys Gin Ala Glu Lys Tyr Val Tyr Asp Gly Ser 

100 105 110 

Lys Leu Val Leu Ary Glu Tie Arg Lys Ser Pro Thr Asp Glu Thr Pro 

lis 120 125 

He Lys Ly^ Val Ser Tyr His Tyr Leu Cys Gly Ser Asp Met Pro Phe 

130 135 140 

Glu lie Thr Thr Glu Met Ser Asp Gly Tyr Phe Glu Ser His Thr Leu 
1^5 150 155 160 

Asn Tyr Leu Asn Gly Lys He Ala Arg He Asp He Met Thr Gin Gin 

165 170 175 

Asn Pro Ser Ala Glu Leu He Glu Thr Gly Arg Met Val Tyr Glu Phe 

180 185 190 

Asp Ali Asn Asn Asp Ala Val Leu Leu Arg Asp Ser Val Phe Leu Pro 

195 200 205 

I.5U Gin Asn Lys Trp Val Glu Met Phe Thr His Arg Tyr Thr Tyr Asp 

210 215 220 

Asn Lys His Asn Cys He Arg Trp Glu Gin Asp Glu Phe Gly Thr Leu 
2-5 230 235 2J0 

Thr Leu Ala Asn Asn Phe Glu Tyr Asp Thr Thr lie Pro Leu Ser Scr 

2«I5 250 255 

Val Leu Pho Pro Thr His Glu Glu Phe Pho Arg Pro Leu Leu Pro Asn 

260 25b 270 

Fhe II*c Lys His Het Arg Thr Lys Gin Thr Tyr Phe Asn Asn Ser Gly 

275 280 285 

Glu Gly Leu Ser Glu Val Cys Asp Tyr Asn Tyr Phe Tyr Thr Asp MeL 

290 295 300 

Gin Gly Asn Ala Leu Thr Asp Val Ala Val Acn Glu Ser He Lys He 
305 310 315 320 

Tyr Pro Arg Pro Ala Thr Anp Phe Leu Arg He Glu Gly Ser Gin Leu 

325 330 335 

Leu Arg Leu Scr r.eti Phe Asp Met Asn Gly Lys Leu He Arq Ala Thr 

34 0 345 35.0 

Glu Leu Thr Gly Asp Leu Ala He He Gly Val Ala Ser Leu Pro Arc 

355 360 365 

Gly Thi Tyr lift Ala Glu He Thr Ala Ala Asn Ser Lys Thr He Arg 

370 37S 380 

Ala r.y." Val Ser Leu Arg 



(2) IliFOWlATIOM FOR SZQ ID HO: 3 92 

U) SEQUEIJCE CHARACTERISTICS: 

(A) LKHGTH: 411 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



385 



390 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ti^^i tl^ti IF«" iP^'-ii ail ""^^ (P=ii ..I*" -n™" *^'^> -i*^' 
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tii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl } ORIGIIIAL SOURCE: 

(Al ORGAH7SH: Porphyromonas gingiva lis 

(i::) FEATURE: 

(A) NAJIE/KEY: misc_feaLure 
IB) LOCATION 1 ... 411 

l.xl) SEQUEMCE DESCRlPTIOl^: SEQ ID MO: 392 

Met Arg Gin His Leu Ser I.eu Ph- Pro Phe lie Leu Phe Leu Leu L-u 

^ 10 15 

Ala Phe Ser Tyr Val Gly Cys Arg Thr Val Arg Gin Thr Pro Lys Gin 

2C 25 30 

Ser Glu Arg Tyr Val Vol Val Leu Ser Leu Asp Gly Phe Arg Pro Asp 

3S 40 45 

Tyr Thr Asp Arg Ala Arg Thr Pro Ala Leu Asp Arg Met Ala Gin Glu 

5^ 55 en 

Gly Leu Ser Gly Ser Leu Gin Pro Cys Phe Pro Scr Leu Thr Ph» Pro 

^- 70 75 00 

Asn His Tyr Ser Met Ala Thr Gly Leu Tyr Fro Asp His His Gly II- 

85 90 Q5 

Val Ala /V?ri ^lu Phe Val Asp Ser Leu Leu Gly H- Ph- Arg II- S*r 

105 no 

Asp Arq Lys Ala Val Glu Thr Pro Gly Phe Trp Giy Gly Glu Pro Val 

lis 120 125 

Trp Asn Thr Ala Ala Arg Gin Gly He Arg Thr Gly Val Tyr Phe Trp 

^•^^ 135 140 

Val Gly Ser Glu Thr Ala Val Asn Gly Asn Arg Pro Trp Arg Trp Lys 
145 ISO ^ ^J- 

Lys Phe Ser Ser Thr Val Pro Phe Arg Atp Arg Ala Asp Ser Val 11- 

165 170 175 

Ala Trp Leu Gly Leu Pro Glu Lys Glu Arg Pro Arg Leu Leu Met Trp 

180 185 190 

Tyr lie Glu Glu Pro Asp Met He Gly His Ser Gin Thr Pro Glu Ser 

195 200 205 

Pro Leu Thr Leu Ala Met Val Glu Arg Leu Asp Ser Val Val Gly Tyr 

215 220 
Phe Ary Lys Arg F.eu Asp Ser Leu Pro He Ala Ala Gin Thr Asp Phe 
230 235 240 

Tie He Val Ser Asp His Gly Met Ala The Tyr Glu Asn Glu Lys Cyr 

245 250 255 

Val Asn Leu Ser HI.- Tyr Leu Pro Ala Asp Scr Phe Leu Tyr M-t Aia 

2^0 265 P70 

Thr Gly Ala Phe Thr Hi^ Leu Tyr Pro Lys Pro Ser Tyr Thr Glu Arq 

280 285 

lin ^'-"^ ^"'^ Ser Val Tyr Arg Lys 

290 225 300 

Gly Glu Val Pro Lys Arg Leu Arg Cys Gly Thr Asn Pro Arg Leu Gly 
^•'^ 310 315 320 

Glu Leu Val Val He Pro Asp He Gly Ser Thr Val Phe Phe Ala II- 

325 330 335 

Asn Glu Asp Val Arg Pro Gly Ala Ala His Gly Tyr Asp Asn Gin Ala 

3**0 34S 350 

Pro Glu Met Arg Ala Leu Leu Arg Ala Val Gly Pro Asp Phe Axg Pro 

355 360 365 

Gly Ser Arg Val Glu Asn Leu Pro Asn He Thr He Tyr Pro Leu He 

3't> 375 
Cys Arg Leu Leu Gly He Glu Pro Ala Pro Asn Asp Ala Asp Glu Thr 
? , 39*^ 395 400 

Leu Leu Asn Gly Lnu He Arg Asp Lys Arg Pro 
405 4i3 

(2) IMFORIIATTOTJ FOR SEQ ID HO: 393 

(i) SEOUEUCE CHARACTERISTICS: 

(A) LEIIGTH: 24 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) 1K5LECULE TYPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



iUil ^il Oi O la li«H O ^ Oi iUti ilJi 



10 



15 
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liiil HYPOTHETICAL: YES 

(vi ) ORIGINAL SOURCE: 

tA) ORGAIilSH: f orphyromonas glngivalis 

(ix) FEATURE: 

(A) NAHK/KEY: mlscf eature 

(B) LOCATION 1. . .24 6 

(xi) SEQUEIICE DESCRIPTION: SEO ID MO; 393 

Met hys Val Gly Leu Ph= He Pro Cys Tyr Val Asn Ala Val Tyr Pro 
IS 10 15 

Glu Val Gly He Ala Thr Tyr Lys Leu Leu Lys Set Leu Atp He Asp 

20 25 30 

Vai Asp Tyr Pro Met Asp Gin Thr Cys Cys Gly Gin Pro Met Ala Asn 

35 40 4S 

Ala Gly Phs Glu Gin Lys AJ a Gin Lys Leu Ala Leu Arg Phe Glu Glu 
20 50 55 60 

Leu Phe Glu Sor Tyr Asp Val Val Val Gly Pro Ser Ala Ser Cys- Val 

70 75 80 

Ala Phe Vai Lys Glu A/^n Tyr Asp His He Leu Arg Pro Thr Gly Hi^ 

«5 9'-^ 95 

Val Cyc Lys Ser Ala Ala Lys Val Arg A^p He Cys Glu Phe Leu Hi <c 

100 105 2.10 

A^p As^p Leu Lys He Thr Ser Leu Pro Ser Arg Phe Ala Ills Ly- Val 

115 120 125 

Ser Leu His Asn Ser Cys Hie Gly VaJ Arg Glu Leu His Leu Ser Thr 
30 1:30 135 140 

Pro Ser Glu Val His Arg Pro Tyr His Asn Lys Val Arg Arg Leu Leu 
145 150 155 

Glu Mrt Val Gin Gly He Glu Val Phe Glu Pro Lys Arg He Asp Glu 

170 175 
Cys Cys Gly Phe Gly Gly Met Tyr Ser Val Glu Glu Pro Glu Val Ser 

100 105 190 

Thr Cys riot Gly ills Asp Lys Val Leu Asp His He Ser Thr Gly Ala 

195 200 205 

Glu Tyr He Thr Gly Pro Asp Ser Ser Cys Leu list: His Met Gin Gly 
'^O 210 215 220 

Val He Asp Arg Glu Lys Leu Pro He Lys Thr He His Ala Val Glu 
225 230 240 

Tl*i Leu Ala Ala Asn Leu 
24S 



45 



65 



70 



75 



(2) IHFORIIATION FOR SEQ ID NO; 394 



(1) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 24 6 amino acids 
50 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(J II MOLECULE TYPE: protein 

55 (Hi) HYPOTHETICAL: YES 

(vij ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyromonas gingiva lis 

60 {ixl FEATURE: 

{A) HAHE/KEY: mlsc feature 

(B) LOCATION I...2T6 



(xi) SEQUENCE DESCRIPTION: SEO ID NO: 394 

Met Asp He Val Ser Met Ala Asp Lys Ala Leu Val Val Glu Met Arg 

^ 5 10 15 

Asp Val Thr Leu Cys Gin Glu Glu Asn Val He Phe Gin Asn Leu Asn 

20 25 • 30 

Leu Thr Leu Ser Ala Gly Asp Phe Val Tyr Leu He Gly Scr Val G1-- 

36 40 45 

Ser Gly Lys Ser Thr Leu Leu Lys Ala Leu Tyr Ala Glu Val Pro II « • 

SO - 55 - ' 60 ' 

Ser Ala Gly Tyr Ala Arg Val He Asp Tyr Asp Leu Ala Lys Leu L -s 
6S 70 75 00 



SUBSTITUTE 



SHEET (Rale 26) (RCVAU) 
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Arg Ly« Gin Leu Pro Tyr Leu Arg Arg A^n Leu Gly lie Val Phe CXn 

8S 90 gc 

Asp Phe Gin Leu Leu Asn Gly Arg Thr Val Ala Glu A£n Lou Asp Phe 

105 no 
Val Leu Arg Ala Thr Asp Trp Lys Asn Arg Ala Asp Arg Giu Gin Arg 

^^-^ 120 125 

lie Glu Glu Val Leu Thr Arg Val Gly Met Ser Arg Lys Ala Tyr Lys 

Arg Pro His Glu Leu Ser Gly Gly Glu Gin Gin Arg Val Gly He Ala 
^^^^ 1-^5 160 

Arg Ala Leu Leu Ala Lys Pro Ala Leu He Leu Ala Asp Glu Pro Thr 

Gly Asn Leu Asp <jer Val Thr Gly Leu Gin He Ala Ser Leu Leu Tyr 

r^. ^® 1^0 

Glu He Ser Lys Gin Gly Thr Ala Val Leu Met Ssr Thr Ills Asn Ser 

^-^ 200 205 

Ser Leu Leu Ser Mis Leu Pro Ala Arg Thr Leu Ala Val Arg Lys Asn 

Gly Asp Ala Ser Ser Leu Val Glu Leu Ser Al^ A^p Ala Val Ser Arg 



225 230 
Lyc Acn Thr Glu He Asp 



23b 240 



(2 J INfORllATIOd FOR SEQ ID HO: 3 95 

(i) SEQtJEIICE CHARACTERISTICS: 

(A) LEiiGTH: 241 amino acids 

(B) TYPE: amino acid 
tD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAMISM: Porphyromonas gingival is 

iiy.) FEATURE: 

(A] NAME/KEY: mlsc feature 

(B) LOCATION 1. . .241 

iy.i) SEQUENCE DESCRIPTION: SEQ ID WO: 395 

Met Ala Asp Lys Ala Leu Val Val Glu Met Arg Asp Val Thr Leu Oys 

Gin Glu Glu Asn Val He Phe Gin A.;n ^eu Asn Leu Thr Leu Ser Ala 

Gly Asp Phe Val Tyr Lou He Gly Ser Val Gly Ser Gly Lys Ser Thr 

40 ,j5 
Leu Leu T.ys Ala Leu Tyr Ala Glu Val Pro He Ser Ala Gly Tyr Ala 

Arg Val He Asp Tyr Asp Leu Ala Lys Leu I.ys Arg Lys Gin Leu Pro 

Tyr Leu Arg Arg Asn Leu Gly He Val Phe Gin A.= p Phe Gin Leu L-u 

85 9Q 
Asn Gly Arg Thr Val Ala Glu Asa Leu Asp Phe Val Leu Arg Ala Thr 

ICS 110 
Asp Trp Lys Asn Arg Ala Asp Arq Glu Gin Arg He Glu Giu Val L-u 

IJS 120 12'" 

Thr Axg Val Gly Met Ser Arg Lys Ala Tyr Lys Arg Pro His Glu Leu 

Ser Gly Gly Glu Gin Gin Arg Val Gly He Ala ^ry Ala Leu Leu Ala 

Lys Pro Ala Leu He Leu Ala Asp Glu Pro ^hr Gly Asn Leu Asp Ser 

Val Thr Gly Leu Gin He Ala Ser Leu ^eu Tyr Glu He Ser Lys Gin 

Gly Thr Ala Val Leu Met Ser Thr Us Asn Ser Ser Leu L^u S^, Hi- 

200 - 205 

Lou Pro Ala Arg Thr Leu Ala Val Arg Lys Asn Gly A,^p Ala Ser Ser 

Leu val Glu Leu Ser Ala Asp Ala Val Ser Arg L^s Asn Thr Glu 11- 
230 

Asp 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



P-Hi ^P«i^ il^' ^^'h „B ^^ii ii^*' i'SL i'^^ -T^ f% 

LJji '^Sli Q .«.n« n:« a3 ^Ci* m iUv O -a^ ii-jJ il-fi 



10 



15 



65 



70 



75 
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(2) INFORHATIOIi FOR SEQ ID NO: 396 

( i ) SEQUEl JCE CHARACTERI STICS : 

(A) LENGTH ; 232 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi» ORIGINAL SOURCE: 

(A) ORGAJ'IISH: Porphyromonas gingivalls 

(ix) FEATURF.: 

{A) UAME/KEY: m1.?c_ feature 

(B) LOCATION 1. . .232 



20 (xl ) SEQlfEHCE DECCRIPTIOM: SEQ ID NO:396 



25 



Met 


Arg 


Asp 


Val 


Thr 


Leu 


Cy.o 


Gin Glu 


Glu 


Asn 


Val 


lis 


Phe 


Gin 


Asn 


1 








5 








10 










15 




Leu 


Af?n 


Leu 


Thr 


Leu 


Ser 


Ala 


Gly Asp 


Phe 


Val 


Tyr 


Leu 


He 


Gly 


Ser 








20 








25 










30 




Val 


oly 


Ser 


Gly 


Ly^ 


Ser 


Thr 


Leu Leu 


Lys 


Ala 


Ltiu 


Tyr 


Ala 


Glu 


Val 






35 










40 








45 








Pro 


lie 


Ser 


Al.i 


Gly 


Tyr 


Ala 


Axg Val 


He 


Asp 


Tyr 


A^p 


Leu 


Ala 


Lys 














55 








60 






Leu 


Lys 


Arg 


Lys 


Gin 


Leu 


Pro 


Tyr Leu 


Axg 


Arg 




Leu 


Gly 


Tic 


Val 


6£ 










70 








75 








80 


Phe 


Gin 


Asp 


Ph^i 


Gin 


Leu 


Leu 


Asn Gly Arg 


Thr 


Val 


Ala 


Glu 


Asn 


Leu 










85 








90 










95 




Asp 


Phe 


Val 


Leu 


Arg 


Ala 


Thr 


Asp Trp 


Lys 


Asn 


Arg 


Ala 


Asp Arc 


Glu 








100 








105 










110 






Gin 


Arg 


He 


Glu 


Glu 


Val 


Leu 


Thr Axg 


Val 


Gly Met 


Ser 


Arg 


Lys 


Ala 






115 










120 








125 




Tyr 


Lys 


Arg 


Pro 


Hi±j 


Glu 


Leu 


Ser Gly 


Gly 


Glu 


Glu 


Gin 


Arg 


Val 


Gly 




130 










135 








140 






lie 


Ala 


Arg 


Ala 


Leu 


Leu 


Ala 


Lys Pro 


Ala 


Leu 


Tie 


Leu 


Ala 


Asp 


Glu 


145 










150 








155 








160 


Pro 


Thr 


Gly Asn 


Leu 


Asp 


Scr 


Val Thr 


Gly 


Leu 


Gin 


He 


Ala 


Ser 


Lou 










165 








170 










175 




Leu 


Tyr 


Glu 


He 


Ser 


Lys 


Gin 


Gly Thr Ala 


Val 


Leu 


net 


Ser 


Thr 


His 








180 








185 










190 






Asn 


Ser 


Ser 


Lsu 


Leu 


Scr 


His 


Leu Pro 


Ala 


Axg 


Thr 


Leu 


Ala 


Val 


Arg 






195 










200 








2 05 






Lys 


fKsn 


Giy 


Asp 


Ala 


Ser 


Ser 


Leu Val 


Glu 


Leu 


Ser 


Ala 


Asp 


Ala 


Val 




210 










215 








220 






Ser 


Arq 


Lys 


Asn 


Thr 


Glu 


Ho 


A3P 
















225 










230 


















(2> 


IIIFORIIATION 


FOR 


SEQ 


ID HO: 397 

















30 



35 



40 



45 



50 



55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

60 (11) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 



(vl) ORIGINAL SOURCE; 

(A) ORGATJISM: Porphyromonas gingival is 

(ix) FEATURE; 

(A) NAflE/KEY: nilsc_f eature 

(B) LOCATION 1. . . 219 

<::i> SEOUEIICE DESCRIPTION: SEQ ID NO: 397 

Met He Glu He Ser Asn Leu Thr Lys Val Phe Axg Thr Glu Glu He 

^5 10 15 

Glu Thr Val Ala Leu Asp Gly Val Ser Leu Lys Val Asp Lys Gly Glu 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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20 






25 










30 








Phe 


li- 


Ala 
35 


He 


MeC Gly Pro 


Ser 
40 


Gly 


Cys 


Gly 


Lys 


Ser 
45 


Thr 


Leu 


Leu 




Affn 


ne 


Leu 


Gly 


Leu Leu /\sp 


Asn 


Pro 


Thr 


Ser 


Gly 


lie 


Tyr 


Lys 


Leu 


5 




50 






55 










60 












Arp 


Gly 


Ala 


Glu 


Val Gly Pi£n 


Leu 


Arg 


Glu 


Lys 


Asp 


Arg 


Thr 


Ala 


Val 




65 








10 








75 










30 




Arg 




Gly 


A^n 


He Gly Fhe 


Val 


Phe 


Gin 


Ser 


Phe 


Asn 


Lsu 


He 


Glu 


10 










85 






90 










95 




Glu 


Met 


Thr 


Val 
100 


Ser Glu Aen 


Val 


Glu 
105 


Leu 


Pro 


Leu 


Val 


Tyr 
110 


Leu 


Giy 




Val 


Lys 


Ala 
115 


Ser 


Glu Arg Lys 


Glu 
120 


Arg 


Val 


Glu 


Glu 


Ala 
125 


Leu 


Arg 


Lys 




rist 


Sec 


He 


Ser 


His Arg Ala 


Gly 


His 


Phe 


Pro 


Asn 


Gin 


Leu 


Ser 


Gly 


15 










135 










140 












Gly 


Gin 


Gin 


Gin 


Arg Val Ala 


He 


AJa 


Arg 


Ala 


Val 


Val 


Ala 


Asn 


Pro 




145 








150 








155 










160 




Ly.? 


Leu 


He 


Leu 


Ala A£?p Glu 


Pro 


Thr 


Gly 


Asu 


Leu 


Asp 


S^r 


Lys 


A5n 










165 






170 










175 




20 


Gly 


Ala 


Arp 


Val 
160 


Met Glu Leu 


Leu 


Arg 
165 


Gly 


Leu 


Asn 


Arg 


Glu 
190 


Gly 


Ala 




Thr 


He 


Val 
195 


Met 


Va) Thr Hi 5 


Ser 

200 


Glu 


His 


Asp 


Ala 


Arg 
205 


Ser 


Ala 


Giy 




Arg 


He 


He 




Leu Phe Asp 


Gly 


Lys 


He 


Arg 












25 




210 






215 





















t2) ItlFOWlATIOH t'OR SEQ ID MO: 398 

(ij SEQL'EHCF. CHARACTERISTICS: 
30 I A) LEHGTH: 595 amino acids 

(E) TYPE: amino acid 
(D) TOPOLOGY: linear 



35 



40 



55 



60 



65 



70 



75 



(li> llOLECULE TYPE: protein 

(iii> HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(AJ ORGAMISM: Porphyromonas gingiva lis 

( Ix ! FEATURE : 

(A) HAIIE/KEY: mi£c_£eature 
(B» LOCATION 1 . . .595 



45 (xij SEQUEIICE DESCRIPTIOW: SEQ TD lJO:3 98 



50 



Met 




Glu 


Phe 


Phe 


Lys 


list 


Phe 


Phe 


Ala 


Ser 


He 


Leu Gly 


Val 


He 










5 










10 










15 




Thr 


Ala 


Gly 


He 


He 


Leu 


Phe 


Cys 


lie 


Phe 


Leu 


Phe 


He 


Phe 


Phe 


Gly 








20 










25 










30 






He 


Val 


Ala 


Gly 


He 


Ala 


Ser 


Lys 


Ala 


Thr 


Gly 


Gly 


Thr 


He 


Pro 


Lys 






35 










40 










45 








He 


Glu 


A] a 


Asn 


Ser 


He 


Leu 


His 


He 


Unk 


Asn 


Ser 


Ser 


Phe 


Pro 


Glu 




SO 










55 










60 










I] » 


Val 


Ser 


Ala 


Afc'ti 


Pro 


Trp 


Sec 


Met 


Leu 


Thr 


Gly 


Lys 


Afp 


Glu 


Sar 


65 










70 










75 










80 


Val 


Ser 


Leu 


Ser 


Gin 


Ala 


Val 


Glu 


Ala 


He 


Gly 


Gin 


Ala 


Lys 


Asn 


Asn 










BS 










9C 










95 




Pro 


Affn 


He 


Thr 


Gly 


He 


Ph- 


Leu 


Asp 


Leu 


Asp 


A£rn 


I.eu 


Ser 


Val 


Gly 








100 










105 










110 






Mfet 


Ala 


Scr 


Ala 


Glu 


Glu 


Leu 


Arg 


Arg 


Ala 


Leu 


Gin 


Asp 


Pile 


Lye 


Hot 






115 










120 










125 








set 


Gly 


hys 


Phe 


Val 


Val 


Ser 


Tyr 


Ala 


Asp 


Arg 


Tyr 


Thr 


Gin 


Lys 


Gly 




130 










135 










140 










Tyr 


Tyr 


Leu 


Ser 


Sec 


He 


Ala 


Asp 


Lys 


Leu 


Tyr 


Leu 


Asn 


Pro 


Lys 


Gly 


145 










150 










155 










160 


Met 


Leu 


Gly 


Leu 


He 


Gly 


He 


Ala 


Thr 


Gin 


Thr 


Met 


Phe Tyr 


Lys 


Asp 










165 










170 










175 




Ala 


Leu 


Asp 


Lys 


Phe 


Gly 


Val 


Lyff 


Met 


Glu 


He 


Phe 


Lys 


Val 


Gly 


The 








180 










185 










190 






Tyr 


Lys 


Ala 


Ala 


Val 


G] u 


Pro 


Phe 


Met 


Leu 


Asn 


Arg 


Met 


Ser 


Asp 


Alo 






155 










200 










205 








Asn 


Arg 


Glu 


Gin 


He 


Thr 


Thr 


Tyr 


He 


Asn 


Gly 


Leu 


Trp Asp 


Lys 


Hs 




210 










215 










220 










Thr 


Ser 


Arp 


Hs- 


Ala 


Glu 


Ser 


Arg 


Lys 


Thr 


Ala 


Met 


Asp 


See 


Val 


Ly-T 



PCT/AU98/010Z3 
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22S 230 



235 240 



Met Phe Ala Airp Lys Gly Glu Met Phe Gly Leu Ala Glu Lys Ala Val 

24b 250 255 

Glu Met Lys Leu Val Asp Glu Leu Ala Tyr Arg Thr Asp Val Glu Lvs 

260 265 270 

Glu Leu Lys Lys Met Ser Gin Arg Gly Glu Lys Asp Glu Leu Arn Ph- 

2''5 280 285 

Val Sor Leu Ser Gin Val Leu Ala Asn Gly Pro Met Asn Lys Thr Lvs 

290 295 300 

Gly Ser Arg He Ala Val Leu Phe Ala Glu Gly Glu lie Thr G3 u Glu 
310 315 320 

lie Tie Lys Lys Pro Phe Asp Thr Asp Gly Ser Ser He Thr Gin Glu 

•^25 330 
Leu Ala Lys Glu lis Lys Ala Ala Ala A^p Asp Asp Asp IJe Lys Ala 

340 345 
Val Val I.eu Arg Val Asn Ser Pro Gly Gly Ser Ala Phe Thr Ser Glu 

355 360 365 

Gin He Trp Lys Gin Vol Ala Asp Leu Lys Ala Lys Lys Pro T1-' Val 

370 375 380 

Val Ser Het Gly Asp Val Ala Ala S-r Gly Gly Tyr Tyr He Ala Cy^ 
385 350 395 4OO 

Ala Ala Asn Ser He Val Ala Glu His Thr Thr Leu Thr Gly Ser II- 

40b 410 415 

Gly n- Phe Gly Het Phe Pco Asn Phe Ala Gly Val Ala Lys Lys He 

'»25 430 
Gly Val Asn Met Asp Val Val Gin Thr Ser Lys Tyr Ala A.-p L-u Glv 

4j5 440 445 

Asn Thr Phe Ala Pro Het Thr Val Glu Asp Arg Ala Leu He Gin Ara 

450 455 460 

Tyr He Giu Gin Gly Tyr Asp Leu Phe Leu Thr Arg Val S-r Glu Gl- 
470 4 ,5 ^^f, 

Arg Asn Arg Thr Lys Ala Gin He Asp Ser He Ala Gin Gly Arg Vdl 

485 490 495 

Trp Leu Gly Asp Lys Ala Leu Ala Leu Gly Leu Val Asp GMi Leu Glv 

SOO 505 - 510 

Gly Leu Asp Thr Ala He Lys Arg Ala Ala Lys Leu Ala Gin Leu Glv 

S15 520 525 

Gly Asn Tyr Ser He Glu Tyr Gly Lys Thr Lys Arg Asn Phe Phe Glu 

530 535 540 

Glu Leu Leu Ser Ser Ser Ala Ala Asp Met Lys Ser Ala He Leu Ser 
550 S55 ^^(j 

Thr He Leu Ser A.->p Pro Glu He Glu Val Leu Arg Glu Leu Arg Ser 

565 570 575 

Het Pro Pro Arg Pro Ser Gly He Gin Ala Arg Leu Pro T-r Tyr Ph- 
50C> 585 590 

Met Pro Tyr 

595 

(2) INFORJIATIOrJ fOR SEQ ID HO: 399 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEJJGTH: 589 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOCr^: linear 

(Jl) MOLECUUl TYPE: protein 

fiii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

I A) ORGAT^ISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/ KEY: misc_feature 

(B) LOCATIOII 1, . , 589 

txi) SEQUENCE DRSCRIPTlOtl : SEQ ID MO: 399 

net Phe Ph* Ala Ser He Leu Gly Val He Thr Aia'Gly He Ho Leu 

5 10 15 

Phe Cys He Pho Leu Phe He Phe Phe Gly He Val Ala G1-- n- Ala 

20 2b 3-^ 

Ser Lys Ala Thr Gly Gly Thr He Pro Lys He Glu Ala P^n Ser He 



35 



Leu His He tink Asn Ser Ser Ph 



40 45 



Pro Glu He Val Ser Ala Asn Pro 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 



-vP^h -i?^''-' ti^'h r*H ="^1 ri=Hi iF^i -rtf-^ ";^! -^Pfii -ti'"h 

"*^if- rr'ji in^i o nj :u -a ilj3 
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60 

Trp 3-r tlst Leu Thr Gly Lys Asp Glu Ser Val Ser Leu Ser Gin Ala 



65 7 0 



75 80 



Val Giu Ala lie Gly Gin Ala Lys Asn Asn Pro A^-n Tic Thr Gly He 

85 90 95 

Phe Leu Asp L^iu Asp Asn Leu Ser Vai Gly Met Ala Ser Ala Glu Glu 

100 105 no 

Lf^n Arg Arg Ala Leu Gin Asp Phe Lys Hct Ser Gly I.y.s Phe Val Val 

115 120 125 

Ser Tyr Ala Atp Arg Tyr Thr Gin Lys Gly Tyr Tyr Leu Ser Ser 11- 

. 135 140 
Ala Asp Lys Leu Tyr Leu Asn Pro Lys Gly M-t. L-u Gly Leu He Glv 
^'^^ iSO 155 j^gj 

He Ala Thr Gin Thr Met Phe Tyr Lys Asp Ala Leu Asp Lys Phe Gly 

165 170 175 

Val Lys liet Glu He Phe Lys Val Gly Thr Tyr Lys Ala Ala Val Giu 

If^O 185 190 

Pro Phe MeL Leu Asn Arg Met Ser Asp Ala Asn Arg Glu Gin II- Thr 

200 205 
Thr Tyr He Asn Gly Leu Trp Asp Lys He Thr s-r Asp He Ala Glu 

= iO 215 2''0 

Set- Arg Lys Thr Ala Met Asp Ser Val Lys Met Phe Ala Asp Lys Gly 
230 235 240 

Glu Met Phe Gly Leu Ala Glu Lys Ala Val Glu Met Lys Leu Val Asp 

250 755 
Glu ueu Ala Tyr Arg Thr Asp Val Glu Lys Glu Leu Lys Lys Met Ser 

i^C^O 265 270 

Gin .-srg Gly Glu Lys Asp Glu Leu Arg Phe Val Ser Leu Ser Gin Val 
^'^^ 2B0 285 

^'"^ '^^'^ '^^^ ^""'^ "-'^ ^^"^ "^^^ ^ly Ser Arg He Ala Val 

295 300 
Lou Phe Ala Glu Gly Glu He Thr Glu Glu He lie Lys Lys Pro Phe 
305 310 315 

Asp Thr Asp Gly Ser Scr He Thr Gin Giu Leu Ala Lys Glu He Lys 

325 335 
Ala Ala Ala Asp Asp Asp Asp He Lys Ala Val Val Leu Arg Val Asn 

^'^O 345 350 

Ser Pro Gly Gly Ser Ala Phe Thr Ser Glu Gin He Trp Lys Gin Val 

^55 360 365 

Ala Asp Leu Lys Ala Lys Lys Pro He Val Val Ser Met Gly Asp Val 

375 380 
Ala Ala Ser Gly Gly Tyr Tyr He Ala Cys Ala Ala Asn Ser He Val 

390 395 
Ala Glu His Thr Thr Leu Thr Gly Ser He Gly He Phe Gly Met Phe 

405 415 
Pro Atn Pho Ala Gly Val Ala Lys Lys He Gly Val A.-rn Met Asp Val 
420 ' -■ - 



42b <I30 



Val Gin Thr Ser Lys Tyr Ala Asp Leu Gly Asn Vhr Phe Ala Pro Met 



'I'lO 445 



Thr Val Glu Arp Arg Ala Leu He Gin Arg Tyr He Glu Gin Gly Tyr 

455 460 
Asp Leu Phe Leu Thr Arg Val Ser Glu Gly Arg Asn Arg Thr Lys Ala 

if r, . ^'^^ 480 

Gin He Asp Scr H*x Ala Gin Gly Arg Val Trp Leu Gly Asp Lys Ala 

485 490 455 

Leu ^J.a Leu Gly Leu Val Asp Glu Leu Gly Gly Leu Asp Thr Ala H» 

500 505 510 

Lys Arg Ala Ala Lys Leu Ala Gin Leu Gly Gly Asn Tyr Ser He Glu 

515 520 525 

Tyr ^ly Lys Thr Lys Arg Asn Phe Phe Glu Glu Leu Leu Ser Ser Ser 

535 54 0 

Ala Ala Asp Mot Lys Ser Ala He Leu Ser Thr He Leu Ser Asp Pro 
550 555 5gf5 

Glu He Glu Vai Leu Arg Glu I.eu Arg Ser Met Pro Pro Arg Pro Ser 

565 570 575 

Gly lie Gin Ala Arg Leu Pro Tyr Tyr Phe Met Pro Tyr 
580 5(j5 

{2) IIIFORMATiOM FOR SEQ ID HO:400 *- — ..v 

(i) SEQUEMCR CHARACTERISTICS: 

(A) LEIJGTH: .^13 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(ii) MOLECULE rVFfc:: protein 

liii) HYPOTHETICAL: YES 

(vl> ORTGIIIAL SOURCE: 

(Ai OR6AHISH: Porphyromonas gingivalis 

(ixj FEATURE; 

(AI uma/KZY: misc f eature 
(B) LOCATlOi: 1 . . .313 



(Xi) SEOUEMCE DESCRIPTIOH: SEQ ID HO: 400 



Met 


Arg 


Ala 




lie 


Trp 




He 




Ser 


Val 


Ser 


Val 


Leu 


Phe 


Phe 


1 






5 










10 










IS 




Phe 


Gly 


Thr 


Al A 


lie 


Glv 


Gin 


Ala 


Gin 


Ser 


Arg 


Asn 


Arg 


Thr 


Tyr 


Glu 








20 










25 










30 






Ala 


Tyr 


Val 




Gl n 


T'/r 


Ala 


Asp 


Glu 


Ala 


He 


Arg 


Gin 


Met 


Ser 


Arg 




35 










40 










45 








Tyr 


A^ri 


lie 


Pro 


Ala 


Se:r 


He 


Thr 


He 


Ala 


Gin 


Ala 


Leu 


Val 


Glu 


Thr 


50 










55 










60 










Gly 


Ala 


Gly 


Ala 


Ser 


Thr 


Leu 


Ala 


Ser 


Val 


Hitf 


AsfU 


Asn 


Hie 


Phe 


Gly 


65 










TO 










75 












lie 


hys 


Cys 


His 


Lys 


Ser 


Trp 


Thr 


Gly 


Ly.-? 


Arg 


Thr 


Tyr Arg 


Thr 


Asp 










85 










90 










05 




Arp Ala 


Fro 




Glu 


Cys 


Phe 


Arg 


6*er 


Tyr 


Ser 


Ala 


Ala 


Arg 


Glu 


Scr 








ICO 










105 










110 






Tyr 


Gl'J 


Afp 


Mis 


Ser 


Arg 


Phe 


Lf;u 


Leu 


Gin 


Pro 


Arg 


Tyr 


Arg 


Pro 


Leu 






lis 










120 










125 








Phe 


Lys 


Leu 


Asp 


Arg 


Gill 


Afp 


Tyr 


Arg 


Gly 


Trp 


Ala 


Thr 


Gly 


Leu 


Gin 




130 










135 










140 










Arg 


Cys 


Gly 


Tyr 


Aid 


Thr 


Asn 


Arg 


Gly 


Tyr 


Ala 


Asn 


Leu 


Leu 


He 


Lys 










150 










155 










160 


Het 


Val 


Gl u 


Leu 


Tyr 


Glu 


Leu 


Tyr 


Ala 


Leu 


Asp 


Arg 


Glu 




Tyr 


Pro 










165 










170 










175 




Ser 


•rrp 


ehe 


Hiif 


Lys 


Ser 


Tyr 


Pro 


Gly 


Ssr 


Asn 


Lys 


Lys 


Ser 


His 


Glr. 








180 










1B5 










190 






Thr 


Thr 


Lys 


Gin 


Lys 


Gin 


Ser 


Gly 


Leu 


Lys 


His 


Glu 


Ala 


Tyr 


Phe 


Ser 






195 










200 










205 








Tyr 


Gly 


Leu 


Leu 


Tyr 


He 


He 


Ala 


Lys 


Gin 


Gly 


Asp 


Thr 


Phe 


Asp 


Ser 




210 










21S 










220 










Leu 


AJa 


Glu 


Glu 


Phe 


Asp 


Met 


Arg 


Ala 


Ser 


l.ys 


Lsu 


Ala 


Lys 


Tyr 


Asn 


225 










230 










23S 










240 


Asp 


Ala 


Pro 


Val 


Asp 


Phe 


Pro 


li- 


Glu 


Lys 


Gly 


Asp 


Val 


He 


Tyr 


Leu 




















2 50 










255 




Glu 


Ly.T 


Lys 


Wis 


Ala 


eys 


Ser 


ne 


Ser 


Lys 


His 


Thr 


Gin 


His 


Val 


Val 








260 










265 










270 






Ai.g 


Val 


Gly 


Asp 


Ser 


Iter, 


His 


Ser 


He 


Sec 


Gin 


Arg 


Tyr 


Gly 


He 


Arg 






275 










280 










285 








M-t 


Lys 


Asn 


Leu 


Tyr 


Lys 


L&u 


A&'ii 


Asp 


Lys 


Asp 


Gly 


Glu 


Tyr 


He 


Pro 




290 










295 










300 










Gin 


Glu 


Gly 


Asp 


He 


Leu 


Arg 


Leu 


Arg 

















305 310 



(£) IHFOWiATION FOR SEQ ID KO:4 01 

ti) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 523 amino acido 
(R) TYPE: amino acid 
(Di TOPOl^Y: linear 

(HI UOLECULE TYPE: protein 

(iii) H^-POTHF.TICAL: YES 

(vl) ORIGIHAL SOURCE: 

{A) ORCArasjl: Porphyrpmonat- gingivalic 

(ix) FEATURE: 

(A) NAUE/KEY: misc_f eature 
(D) LOCATIOII 1. . .523 

txH SEQUEHCE DESCRI PTIOTJ i SEQ 3D »O:401 

Met A^p Gly Arg Arq Tyr Ser Asp Gly Leu His Gin Ala Tic Glu Ala 



SUBSTITUTB SHEET (Rule 26) (RO/AU) 
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1 








5 










10 










15 




Li s 


Glu 


Hlr 


Val 


Ly5 


Val 


Glu 


Ala 


Ala 


Thr 


Gin 


Thr 


Phe 


Ala 


Thr 


He 






20 










25 










30 






Thr 


Lsu 


Gin 


At'n 


Tyr 


Phc 


Arg 


Met 


Tyr 


His 


Lys 


Leu 


Ala 


Gly 


Met 


The 






35 










40 










45 








Gly 


Thr 


Ala 


Gl u 


The 


Glu 


Ala 


Gly 


Glu 


I.eu 


Trp 


Asp 


He 


Tyr 


Lys 


Leu 


50 










55 










60 










Asp 


Vil 


Val 


Val 


Hg 


Pro 


Thr 


Asn 


Lys 


Pro 


He 


Ala 


Arg 


Lys 


Asp 


Met 


65 










70 










7G 










30 






Ara 


He 


Tyr 


Lys 


Thr 


Ala 


Arg 


Glu 


Lys 


Tyr 


Ala 


Ala 


Val 


He 










85 










90 










95 






Glu 


He 


Val 


Arg 


Le\j 


Val 


Glu 


Glu 


Gly 


Arg 


Pro 


Val 


Leu 


Val 


Gly 








100 










105 










110 






Thr 


Th r 


Ser 


Val 


Glu 


He 


Sar 


Glu 


Leu 


Leu 


Ser 


Arg 


Met 


Leu 


Arg 


Leu 






115 










120 










125 








Arg 


Gly 


He 


Gin 


His 


Ai^n 


Val 


Leu 


Asn 


Ala 


Lys 


Leu 


HI J 


Gin 


Lys 


Glu 


130 










135 










14C 










Ala 


GlLi 


He 


Val 


Ala 


Gin 


Ala 


Gly Gin 


Lys 


Gly 


Thr 


Val 


Thr 


He 


Ala 


145 










150 










155 










160 


Thr 


As 11 


Met 


Ala 


Gly Arg 


Gly Thr 


Asp 


He 


Lys 


Leu 


Ser 


Ala 


Glu 


Vol 










165 










1-70 










175 




Lyr 




Ala 


Gly 


Gly 


Leu 


Ala 


He 


He 


Gly 


Thr 


Glu 


Arg 


His 


Glu 


Ser 




lOO 










185 










190 






Arq 


Arg 


Val 


A-p 


Arg 


Gin 


Leu 


Arg 


Gly 


Arg 


Ser 


Gly 


Arg 


Gin 


Gl y 


Asp 






15*5 










200 










205 








Pro 


<?iy 




Ser 


He 


S'he 


Tyr 


Val 


Ser 


Leu 


Glu 


Asp 


His 


Leu 


Met 


Arg 




210 










215 










220 










Leu 


yhe 


Ala 


Thr 


Glu 


Lys 


He 


Ala 


Ser 


Leu 


Met 


Asp 


Arg 


Leu 


Gly 


Phe 


225 










230 










235 










24 0 


Lys 


GTn 


Gly 


Clu 


Val 


Leu 


Glu 


Asn 


Asn 


Het 


Leu 


Ser 


Lys 


Ser 


Val 


Glu 








245 










250 










255 




Arg 


Ala 


Gin 


Jrfi to 


Lys 


Val 


Glu 


Glu 


Asn 


Asn 


Phs 


Glv 


He 


Arg 


Lys 


His 






260 










265 










270 






Leu 


Leu 


Glu 


T\ r 


Asp 


Asp 


Val 


llct 


Asn 


Ser 


Gin 


Ara 


Glu 


Val 


He 


Tyr 






275 










280 










205 








Thr Arg Arg 


- 


His 


Ala 


Leu 


Het 


Gly Glu 




He 


Gly Met 


Asp 


Val 




290 










295 










300 










Leu 


Asn 


Thr 


He 


Tyr Asp 


Val 


Cys 


Lys 


Ala 


Leu 


He 


Asp Asn 


Tvr 


Ala 


305 










310 










315 










320 


Glu 


Ala 


Asn 


A^p 


Phe 


Glu 


Gly 


Phe 


Lys 


Glu 




L*u 


Met 


Arg 


Ala 












325 










330 










335 




Ala 


He 


Glu 


Ser 


Pro 


He 


Thr 


Gin 


Glu 


lie 


Phe 


Ar 


Gly 


Lys 


Lys 


Ala 








340 










34 5 










350 






Glu 


Glu 


Leu 


Thr 


A3p 


Met 


Leu 


Ph« 


Asp 


Glu 


Ala 


Tyr 


Lya 


Ser 


Phe 


Gin 






355 










360 










365 








Arg 


Lys 


tiet 


Asp 


Leu 


He 


Ala 


Glu 


Val 


Ala 


His 


Pro 


Val 


Val 


H\s 


Gin 




370 










375 










300 










Val 


Ph- 


Glu 


Thr 


Gin 


Ala 


Ala Val Tyr Glu 


Arg 


He 


Leu 


He 


Pro 


He 


385 










390 










3?5 










400 


Thr 


A.-P 


Gly 


Lys 


Acq 


Val 


Tyr Asn He 


Gly 


Cys 


Asn 


Leu 


Arg 


Glu 


Ala 










405 










410 










415 




Asp 


Glu 


Thr 


Gin 


Gly Lys 


Ser 


He 


He 


Lys 


Glu 


Phf? 


Glu 


Lys 


Ala 


He 








420 










425 










430 






Val 


Leu 


His 


Thr 


He 


Asp 


Glu 


Sec 


Trp 


Lys 


Glu 


His 


Leu Arg 


Glu 


Ner 






435 










440 










445 








Asp Glu 


Leu 


Arg 


Asn 


Ser 


Val 


Gin 


Asn 


Ala 


Ser 


Tyr 


Glu 


Asn 


Lys 


Asp 




4t>C 










455 










460 










Pro 


L-u 


Leu 


He 


Tyc 


Lya 


Leu 


Glu 


Ser 


Tyr 


Glu 


Leu 


Phe 


Arg 


Lys 


Het 


4 65 










470 










475 










430 


Val 


Glu 


Ala 


Met 


A-n 


Arg 


Lys 


Thr 


Val 


Ala 


He 


Leu 


Met 


Arg 


Ala 


Aro 










405 










4 90 










495 




lie 


Pro 


Val 


Pro 


Glu 


Ala 


Pro 


Ser 


Gin 


Glu 


Glu 


Leu 


Glu 


Hi? 


Arg 


Arg 








500 










505 










510 






Gin 


lie 


Glu 


He 


Arg 


His 


Ala 


Thr 


Gin 


Cln 


Arg 
















515 










520 



















12) ItirORllATIOH iOR SEQ ID tJO: 4 02 

( i 1 SEQU E! ICE CHAKACT ERI ST I CS : 

(A) LENGTH: 375 amino acids 
(BJ TYPE: amino acid 
(Dl T0POLCX3Y: linear 

(ii) MOLECL'LE TVPE: protein 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



•if"% 4?=^ -i}^"!! 1,51 »H«Ji ^ir' ti^ if=ii ff^i 

iLii ninji o .^a.. igr: a 113 m u-J o ius tuJ 
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liii) HYPOTHETICAL: YES 

(vj ) ORIGJNAl- SOURCE: 

(A) ORGANISt-I: Porphyromonas gingivalls 

(ix) E-EATURE: 

[A) IJAI'IE/K-EY: mlffc_feature 

(B) LOCATION 1 . . .375 

(xi) SEQUEhfCE DESCRIPTION: SEQ ID NO: 402 

M^t rvcn Ph^i Leu Lys Lys Glu Pro Phe I.y^ lie The Ser Met II- Tyr 

Leu Leu Leu Asp Thr He Thr Asn Arg Ala Gly Thr Glu Arg Ala Val 

He A.^n I.cu Ala Asn Asn Leu His Ma Am Gly His Arq Val Ser Leu 

Val Ser Val Cy^ Thr Lyr Glu Gly Glu Pro Ser Phe Gl„ Val G] u Ly^ 

55 go 
Gly Glu Val IK., Leu Gly He Arg Leu Tyr Gly Asn Ala Leu 

Ala Arg L^-' Thr Val Ty, ph, Lys Al. Tyr Arg Arg He Ly. A.,, Leu 

T/r Lye Lyr Ar, Glu Pro Vai Leu Leu Uc Gly Thr A^n He Phe II- 

■ lOS no 

A.,r. Thr lie L.u Ser Gin tie Ser A.n Arg Gly Arg He Phe Thr He 

Gly cy. Glu HI., He Ser Tyr Asp He Ala Arg Pro He Thr Lys Arg 

lie Arg Gly The Leu Tyr Scr Gly Leu Asp Ala Val Val Ala Leu Thr 

Lys Arg Asp Gin Gin Ser Phe Glu Ala Ho J." Arg Gly Arg Ser Lyr 



Ala Tyr Val He Pro Asn Gin Val Ser Phe Thr Thr Val Gin irq Asp 

185 1 on 

Ala Thr Thr His Lys Gin ^>et Leu Ala He Gly Arg Leu Thr Tyr Gin 

200 205 
Lys Gly Phe Glu Phe Met lie Glu Asp Ala Ser Arg Val Leu Arg Glu 

Ara Pro A.p Trp Ly. L«u He He Val Gly Asp Giy Glu Asn Glu Ser 

735 '9i!A 

M^t Leu Arg Lys Glu He Ala Ser Arg Asn Net Glu Ser Gin He Glu 

250 2CC 
I.e HI.. Pro ser Thr Pro Glu He Arg Lys Tyr Tyr Glu Ser S^r Ala 

He Tyr Leu Met Thr Ser Arg Phe g!u Gly Leu Pro Met v" Leu Leu 

- 2B0 285 

Glu A.a Glu Ala Tyr Ala Leu Pro He He Ser Tyr A«p Cy. Pro Thr 



Gly Pro Arg Glu Leu lie clZ Asn Gly Arg A^n Gly Phe Leu Val Pro 

Met Glu Ala Hi, Glu Asp Phe Ala Asp Lys Le! Arg Leu Leu Met A«p 

330 

Asp Glu Thr Leu Arg Lys Lys Met Gly Gin Glu Ser Glu Leu Met Val 
Lys ser Tyr Ser Pro Ala Asn He ^yr Glu Cys Trp Lys Lys Leu Ph= 

360 -acc 

Val Glu He Gly Tyr Met A^n 
••'70 375 

12) iriFORi lATIOII FOR SEQ ID NO: 403 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 362 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLEOJLE TYPE: protein 

(Hi) MYPOTHETICAL: YES 

(Vij ORIGINAL SOURCE: 

(A) ORGAJUSIl: Porphyromonas gingivalis 
{ ir ) FEATURE : 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



1U3 "-SS ^ii C1> 



ii~it ■•i"'^' "fl 
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PCT/AU98/01023 



338/ 



490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



I A) NAI-1E/KE\: inisc_featur- 
tQ> LOCATIOII 1 . . .362 



(>:ij SEQUEIUCE DESCRIPTION; SEQ ID MO: 403 



Met 


He 


Tyr 


Leu 


Leu 


Leu 


Asp 


Thr 


He 


Thr 


Asn 


Arg 


Ala 


Gly 


Thr 


Glu 


1 








5 










] 0 










15 




Arg 


Ala 


Val 


lie 


Asn 


Leu 


Ala 


Arn 


A.sn 


Lea 


Hie 


Ala 


Affn 


Gly 


His 


Arg 








20 










25 










30 






Val 


Ser 


Leu 


Val 


Ser 


Val 


Cys 


Thr 


Ly^ 


GlLl 


Gly 


Glu 


Pro 


Ser 


Phe 


Gin 






35 










40 










4 5 








Val 


Glu 


Lys 


Gly 


He 


Glu 


Val 


His 


His 


Leu 


Gly 


He 


Arg 


Leu 


Tyr 


Gly 




50 










55 










60 








Acn 




Leu 


Ala 


Arg 


Lys 


Thr 


Val 


Tyr 


Phe 


Lye 


Ala 


Tyr Arg Arg 


He 


6S 




















■75 










00 


Lys 


Ala 


Leu 


Tyr 


Lye 


Lys 


Arg 


Glu 


Pro 


Val 


Leu 


Leu 


He 


Gly 


Thr 


Asn 










85 










90 










95 




lie 


Phe 


Tie 


Aen 


Thr 


He 


Leu 


Ser 


Gin 


He 


Scr 


A.-'n 


Axg 


Gly 


Arg 


He 








100 










105 










110 






Phe 


Thr 


He 


Giy 


Cys 


Glu 


Hl3 


He 


Ser 


■ryr 


Asp 


He 


Ala 


Arg 


Pro 


He 






115 










120 










125 








Thr 




Arg 


He 


Arg 


Gly 


Phe 


Leu 


Tyr 


Ser 


Gly 


Leu 


Asp Ala 


Val 


Val 




130 










135 










140 










Ala 


Leu 


Thr 


Lys 


Arg 




Gin 


Gin 


Ser 


Phe 


Glu 


Ala 


He 


Leu 


Arg 


Gly 


145 










150 










155 










160 


Arg 


Ser 


T.y.7 


Ala 


Tyr 


Val 


He 


Pro 


Asn 


Gin 


Val 


Ser 


Phe 


Thr 


Thr 


Val 










165 










170 










17S 




Gin 


Arg 


Asp 


Ala 


Thr 


Thr 


Hi? 


Lys 


Gin 


Met 


Leu 


Ala 


He 


Gly Arg 


Leu 








100 










185 










190 






Thr 


Tyr 


Gin 


Lys 


Gly 


Phe 


Glu 


Phe 


Met 


He 


Glu 


A5p 


Ala 


ser 


Arg 


Val 






:i 95 










200 










205 








Leu 


Axg 


Glu 


Arg 


Pro 


Asp 


Trp 


Lys 


Leu 


He 


Ho 


Val 


Gly 


Asp 


Gly Glu 




210 










215 










220 












Glu 


Ser 


Met 


Leu 


Arg 


Lys 


Clu 


He 


Ala 


Ser 


Arg 


Asn 


Met 


Glu 


Sep 


225 










230 










235 










240 


Gin 


He 


Glu 


He 


His 


Pro 


Ser 


Thr 


Pro 


Glu 


He 


Arg 


Lys 


Tyr 


Tyr 


Glu 










245 










250 










255 




Ser 


Ser 


Ala 


He 


Tyr 


Leu 


Met 


Thr 


Ser 


Arg 


Phe 


Glu 


Giy Leu 


Pro 


hJet 








260 










265 










270 






Val 


Leu 


Leu 


Glu 


Ala 


Glu 


Ala 


Tyr 


Ala 


Leu 


Pro 


He 


He 


Ser 


Tyr 


Asp 






27& 










280 










295 








Cys 


Pro 


Thr 


Gly 


Pro 


Arg 


Glu 


Leu 


He 


Glu 


Asn 


Gly 


Arg 


Asn 


Gly 


Phe 




290 










295 










300 










Leu 


Val 


Pro 


Mot 


Glu 


Ala 


HLs 


Glu 


Asp 


Phe 


Ala 


Asp 


Lys 


Leu 


Arg 


Leu 


305 










310 










315 










32 0 


I.CII 


J-Iet 


Asp 


Afp 


Glu 


Thr 


Leu 


Arg 


Lys 


Lys 


Met 


Gly 


Gin 


Giu 


Ser 


Glu 










325 










330 










335 




Leu 


Met 


Val 


Lys 


Ser 


Tyr 


Ser 


Pro 


Ala 


Asn 


He 


Tyr 


Glu 


Cys 


Trp 


Lys 








340 










345 










350 






Lys 


L-u 


Phe 


Val 


Glu 


He 


Gly 


Tyr 


Met 


Asn 















12) IHFORIIATIOW FOR SEQ ID MO: 404 

(1) SEQtJEUCE CHARACTERISTICS: 

(At LEMGTH: 64 0 amino acids 
(B) TYPE: amino acid 
(D) TOPOLCX3Y: linear 

(li) MOLECULE TYPE: protein 

(liiJ HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

lA) ORGAHISM: Pcrphyromonas glngivslls 

(ir.) FEATURE: 

<A) UAllE/KEY: misc_feature 
(B) LOCATION' 1...640 

(Xi) SEQUEUCE DESCRIPTIOH: SEQ XD 140:404 

Met Gly Lys He He Gly He Asp Leu Gly Thr Thr Asn Ser Cys Val 

15 10 IS 

Ser Val Leu Glu Gly Asn Glu Pro He Val Us Thr Asn Ser Glu Gly 
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2^ -^S 30 

Ly^ Arg Thr Thr Pro Ser Val Val Ala Phe Val Asp Gly Giy Glu Arg 

Lys Val Gly Arp Pro Ala Lys Arg Gin Ala lie Thr Arn Pro Thr Lys 
55 

Thr lie Tyr Sei Il« Lyr Arg Phe riet CJly Glu Thr Tyr Asp Gin Val 

Scr Arg Glu Val Glu Arg Val Pro Pho Lys Val Val Arg Gly A^p l^n 

8ft 90 
A.^n Thr Pro Arg Val A^p Tie Asp Gly Arg Leu Tyr Thr Pro ^In Glu 

]05 

11= Scr Ala Met He Leu Gin Lys Met hyc Lys Thr Ala Glu A^p Tyr 

Leu Gly Gin Glu Val Thr Glu A^a Val He Thr Val Pro Ala Tyr Phe 

A^n Asp Ala Gin Arg Gin Ala Thr Ly^ Glu Ala Gly Glu He Ala Gly 

Leu Lys Val Arg Arg lU Val Asn Glu Pro Thr Ala Ala Ser Leu ^la 

i65 3^75 

Tyr Gly Leu J\ap Lys Ser Asn Lys Asp HeL Lys Ho Ala Val Phe Asp 

105 

Leu Gly Gly Gly Thr Phe /urp He Ser He Leu Glu Leu Gly A^p Gly 

200 205 

Val Phe Glu Val Lys Ser Thr Asn Gly Asp Thr His Leu Gly Gly Asp 

Asp Phe Asp His Val He Ho Asp Trp Leu Ala Glu Glu Phe Lys S-r 

' ^"^^^ 23S 240 

Gin Glu Gly Val Asp Leu Arg Gin Atp Pro Met Ala Met Gin Arg Leu 

^■^^ 250 255 

Lys Glu Ala Ala Glu Lys Ala Lys He Glu Leu Ser Ser Thr S^.r Ser 

Thr Glu He Asn Leu Pro Tyr He Met Pro Val Asn Gly He Pro Lys 

Mis Leu Oal Met Thr Leu Thr l^g Ala- Lys Phe Glu G^n Leu Ala Asp 

295 200 
Arg Leu He Gin Ala Cy^ Val Ala Pro Cys Glu Thr Ala Leu Lys Asp 
310 

Ala Gly Met Ser Arg Gly Asp He Asp Glu Val He Leu Val Gly cly 

•^25 330 33c 

scr Thr Arg He Pro Ala He Gin Glu He Val Glu Lys He Phe Gly 

3'*5 350 
Lys Ala Pro Ser Lyc Gly Val Asn Pro Asp Glu Val Val Ala Val Gly 

360 ^ 
Ala Ala He Gin Gly Gly Val Leu Thr Gly Glu Val Lys Asp Val Leu 

Leu Leu Asp Val Thr Pro Leu Ser Leu Gly He G^u Thr Met Gly Gly 

Val Met Thr Arg Leu He Asp Ala Asn Thr Thr He Pro Thr Lys Zy^ 
4 05 'IS 

Ser Giu He Phe Thr Thr Ala Val Asp Asn Gin Pre Ser Val Clu He 

His val Leu Gin Gly Glu Arg Ser l;u Ala Lys Asp Asn Lys Ser He 
Gly Arg Phe Asn Leu Asp Gly He Ala Pro Ala Pro A^g Gin Thr Pro 



Gin He Glu Val Thr Phe A^^ He Asp Ala Asn Gl^y He Leu A^n Val 

Thr Ala His A^p Lys Ala Thr Gly Lys Lys G^n A^^n Ha Arg He Gl'u 

^ 490 

Ala Ser Ser Gly Leu Ser Asp Asp Glu He Lys Arg Met Lys Glu Glu 

505 

Ai* Glr. Ala A.-,„ Ala Glu Ala Asp Ly:. Lys Glu Lye Glu Arg He Asp 

Ly.- Ue Asn Gin Ala Asp Ser Met He fhe Gin Thr gIu Lys Gin Leu 

535 540 
Lys Glu Leu cl-. A.-p Lys Phe Pro Ala Asp Lys Lys Ala Pro He Asp 

Thr Ala Leu Asp Lys Leu Ly., Glu Ala His Lys Ala Gin A.p Val "a 



Ala He Asp Thr Ala Met Ala Glu Leu Gin Thr Ala Leu Ser Ali. Ala 

585 590 
Gly eiu Glu Leu Tyr Ly= Asn Ala Gly Ala Ala Gl,. Gly Gly Ala Gin 

600 ^05 
Pro Gly Pro Asp Phe Gly Gly Ala Gin Gly Pro Ser Ala Gly Asp Gin 



620 
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Pio Ser A.-p Asp Lys Asn Vai Thr Asp Val A.-p Phe Glu Glu Val Ly- 
^30 635 640 

(2) IHFORIIAriOfJ FOR SEQ ID HO:J05 

li) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4<9 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOG-^: linear 

(il) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingival is 

Ux» FEATURE: 

(A) NAIIE/KEY: mlsc_feature 

(B) LOCATION 1. , .44 9 

(Xi) SF.QUEIICE DESCRIPTIOII: SEQ ID NO: 405 

Met Arg Tyr Asp Leu Ala He Tie Gly Gly Gly Pro Ala Gly Tyr The 

^ S. 10 15 

Ala Ala Glu Arg Ala Ala Lys Gly Gly Leu Lyj: Thr Leu Leu II- Glu 

20 25 30 

Lys A.«n Ala Leu Gly Gly Val Cys Leu Asn Glu Gly Cys He Pro Thr 

3-"^' 4 0 45 

Lys Thr Leu Leu Tyr Ser Ala Lys Val Leu His Gin Tie Ala Thr Ala 

55 60 
Scr Lyf^ Tyr Ala Val Ser Gly Thr Ala Asp Gly Leu Asp Leu Gly Lys 

■'O 75 80 

Val He Ala Arg Lys Gly Lys He He Arg Lys Leu Thr Ala Gly H« 

85 90 95 

Arg Ser Arg Leu Thr Glu Ala Gly Val Glu M«l Val Thr Ala Glu Ala 

100 105 110 

Thr Val Thr Gly Cys Asp Ala Asp. Gly He Tie Gly He Thr Ala Gly 

115 12C 125 

Glu Ala Gin Tyr Lys Ala Ala Asn Leu Leu Leu Cye Thr Gly Ser Giu 

l-'^O 135 140 

Thr Phe Ho Pro Pro He Pro Gly Val Glu Gin Thr Glu Tyr Trp Thr 
1*^0 155 160 

Asn Arg Glu Ala Leu Gin Asn Lys Glu He Pro Thr Ser Leu Val He 

155 170 175 

He Gly Gly Gly Val He Gly Met Glu Phc Ala Ser Phe Ph- Asn Glv 

180 185 190 

T-ic Gly Thr Gin Val Hie Val Val Glu Met Leu Pro Glu He L-u A^n 

1?5 209 205 

Gly He A^-p Pro Glu Ills Ala Ala Het Leu Arg Ala His Tyr Glu Lys 

210 215 220 

Glu Gly He Lys Phe Tyr Leu Gly His Lys Val Thr Ser Val Arg Asn 
230 235 240 

Gly Ala Val Thr Val Glu Tyr Giu Gly Glu Ser Lye Glu He Glu Gly 

245 250 255 

Glu Arg He Leu Met Ser Val Gly Arg Arg Pro Val Leu Gin Gly Ph- 

260 265 270 

Glu Ser Leu Gly Leu Val Leu Ala Gly Lys Gly Val Lys Thr Asn Glu 

275 280 285 

Arg Met Gin Thr Ser Leu Pro Asn Val Tyr Ala Aln Gly Asp He Thr 

290 295 3no 

Gly Phe Ser Leu Leu Ala His Thr Ala Val Arg Glu Ala Glu Val Ala 
310 315 32(j 

Val Asp Gin He Lou Gly Lys Thr Asp Glu Thr Met Ser Tyr Arg Ala 

325 330 335 

Val Pro Gly Val Val Tyr Thr Asn Pro Glu Val Ala Gly Val Gly Glu 

•^^0 345 
Thr Glu Giu Ser Leu Arg Lys Ala Gly Arg Ala Tyr Thr Val Arg Arg 

3^^ 360 — 365 

Leu Pro Met Ala Phe Ser Gly Arg Phe Val Ala Glu Asn Glu Gin Gly 

370 375 380 

Asn Gly Glu Cyr Lys Leu Leu Leu Asp Glu Glu Arn Arg Leu He Gl - 

390 395 
Ala Hi? Leu He Gly Asn Pro Ala Gly Glu Leu He Val Thr Ala Ala 
405 410 415 
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net Ala lie Glu Thr Gly Met Thr Asp Arg Gin lis Glu Arg lie He 
The Pro Ki^ Pro Thr Val Gly Glu 111 Leu Ly. Glu Thr Leu Ala Gly 
5 Gly '"^^ 

(2 J ItIFX)RJlATIOfl FOR SEQ ID nO:'l06 

10 *n SEQUKHCE CHARACTERISTICS: 

(A) LENGTH: 941 amino acid* 
(Bl TYPE: amino acid 
{ D) TOPOLOGY: linear 

15 (li) MOLECULE TYPE: protein 

HYPOTHETICAL: YES 



20 



25 



(vi) ORIGIIIAL SOURCE: 

(A) ORGAI/ISH: Porphyromcnas ginqivalis 
(ix) tlLATURE: 

{A) HAjIE/KEY: ml sc_ feat ur- 

(B) LOTjtvTIOM 1. . . 941 

(>:i) SEQUEUCE DLSCRJPTION : SEQ ID 110:406 
Met Glu Leu Lye Arg Phe Leu Ser Lou Gly Leu Leu Leu Val Gly Phc 



30 ri. Pro Met Ly. Lu Ser Ala Gin Gin 11. Gin Pro Leu Pro Thr Asp 
Pro Ala Val Arg Val Gly Lys Leu A.^n Gly Leu Thr Tyr Phe He 

35 '''''' so" """" ""^^ ""^P ^"^P Oln 

Lv-£ Val Cly s,r II. Leu Glu Glu Asp Ser Gin Ser Gly Leu Ala His 

Phe Leu Glu His Met A].". Phe A^n Gly Thx Lys Arn Phe Pro Gly lys 

40 Asn Lsu Ilo A-.n Tyr Leu Glu Thr He lly Val Arg eh« Gly tin Asn 

Leu A:.n Ala Ser Thr Gly FKe Asp Lys Tl.r Glu Tyr Thr lit Met Asp 

120 12= 
^ val rrc Thr Thr Arg Gin Gly He He Asp Ser Cj^ Leu Leu He Leu 

Hl^ A,=p Trp Ser As,. Arn He Thr Leu Asp Gly nil GJ u He Asp Glu 

Glu Ar, Gly V„l He G^n Glu Glu Trp Arq a" Arg Arg Asp Ala lit 

50 Leu Arg Met Phe Glu Ala He Leu Ala iys Ala Met Pro Gly 'fin Lys 

Tyr Ala c.lu Arg Het Pro He Gly lit „e^ val Val ^eu Asn Phe 

200 205 
55 "il '^'^ "■'^ ■'■^f Tvr Arg Pro Asp 

Leu Gin Gly Leu Val He Val Gly Asp He Asp yll Asp Tyr Val Glu 
Asn Lys He Lys Gl„ Leu Phe Lys Asp Val Pro Ala Pro Val Asn Fro 

60 Ala Glu Aig He Tyr Thr Pro Val Glu "p Asn A.p Glu Pro He Val 

265 97n 
Ala He Ala Thr Asp Ala Glu Ala Thr Thr Thr Gin Leu Ser He S-r 

280 2S5 
65 2O0 ^" |S| val Arg Gly s^. He Phe Gly 

Leu val Glu Asp Tyr Met Lj's Gin Val He Thr Thr Ala Val Asn Glu 
Arg I.e„ ser .Jlu He Thr His Lys Pro Asn Ma Pro Ph= Leu Ser Wa 
70 Gly Ala Phe Phe slt Asn Phe Het Tyr He Thr Gin Thr Lys ^p Ala 

Phe Asr. Phe val Ala Thr Val Arg gJu Gly Glu Ala Glu Lys Ala Met 

360 

75 3,1 "'^ l\l I--" '^'^ °ly lie 'ihr 

380 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



■"J 


Gly 


Glu 


Tyr Asp 


Arg 


Ala 


Ara 


Thr 




Val 


Leu 


Lys 


Arg 


Tyr 


Glu 


385 








390 










395 










4 00 


JKsn 


Gin 


Tyr 


Asn Glu 


Arg 


Asp 


Lys 


Arg 


Lys 


Asn 


Asa 


Ala 


Tyr 


Ala 


Asn 








40b 










410 










415 




Qlu 


Tyr 


Ser 


Thr Tyr 


Phe 


Thr 


Asp 


Gly 


Gly 


Tyr 


He 


Pro 


Gly 


He 


Glu 








420 








425 










430 






Val 


(»lu 


Tyr 


Gin Thr 


Val 


Asn 


Ala 


Phe 


Ala 


Pro 


Gin 


Val 


Pro 


Leu 


Glu 






435 








440 










445 








Ala 


Phe 


A»n 


Gin Ala 


He 


Ala 


Gin 


Met 


11 r 


Asp 


Pro 


Val 


Lys 


Asn 


Ala 




450 








453 










460 










Vai 


Val 


Thr 


Leu Thr Gly 


Pro 


s-r 


Lys 


Ala 


Glu 


Ala 


Lvs 


He 


Pro 


Ser 


4 65 








470 










47S 










480 


Gl u 


Ai a 


A^p 


Phe L-u 


Ala 


Aid 


Phe 




Al a 


Al a 


Ara 


Gin 


Gin 


Lys 


Val 








485 










4 90 










495 




Gl u 


Ala 


L''s 


Lys Ai-p Glu 


Val 


Ser 


Asp 




L.VS 


Leu 


Met 


Glu 




Ala 








500 








505 










510 






Pro 


L'*s 


Aia 


Gly Lys 


He 


Vai 


Ser 


Gl u 


Lys 






Gin 


Lys 


Phe 


Gly 






515 








520 










525 










Thr 


Glu 


Leu Thr 


Leu 


Ser 


Asn 




lie 




Val 


Tyr 


Leu 


Lys 


Lys 




S30 








535 










540 














PhP- 


Lys Ser 


Asn 


Glu 


He 


L u 


Met 


Ser 


Ala 


Leu 


s^r 


Pro 


Gly 


54 5 






550 










555 










560 








Ser Gly Lys 


His 


Ala 


Pro 


Afn 


Gin 


Ser 


Val 


Met 


Asn 


Ser 








S65 










570 










575 










Val Gly 


Gly Leu 


Gly 


Aj=!n 


Ph** 


Asp 


Ala 


He 


Gin 


Leu 


Arp 








seo 








585 










£90 






T 


Val 


L'>u 


Thr Gly Arg 


Sec 


Ala 




V*a 1 


cr 


Pro 


S«r 


Leu 


S«r 


Leu 






595 








600 










605 










St 


Glu 


Gly Leu 


Ser 


Gly Lys 


Thr 


Thr 




Glu 


Asp 




Glu 


Thr 




610 








615 










620 










Ph= 


Ph^ 


in 


Leu lie 


Tyr 


Leu 


Gin 


Met 


Thr 


Al a 


Asn 


Arg 




Asp 


Pro 


625 








630 










635 










640 


Giu 


Al a 




Lys Ala 


Thr 


Gin 


Glu 


Lys 


Leu 


Tyr 


Asn 


Asn 


eu 


Lys 


Asn 








645 










650 










655 




Gin 


Glu 


Ala 


Asn Pro 


MRt. 


Ala 


Ala 


Leu 


Met 


Asp 


Ser 


Ho 


Ara 


His 


Thr 








6G0 








665 










670 






Met 


J /r 


Gly 


A?p Asn 


Fro 


Met 


Met 


Lys 


Pro 


M-t 


Lys 


Ala 


Ala 


Asp Val 






675 








680 










685 








Glu 


Lys 


Val 


Asn Tyr Asp Gin Val 


Met 


Ala 


Phe 


Tyr 


Asn 


Glu 


Arg 


Phe 




690 








695 










700 










Ala 


Asp 


Ala 


Gly Asp 


Phe 


Met 


Phe 


Phe 


Phe 


He 


Gly Asn 


Leu 


Asp 


Glu 


70S 








710 










715 










720 


Ala 


Lys 




Lys Pro 


Leu 


He 


Glu 


Thr 


Tyr 


Leu 


Ala 


Ser 


Leu 


Pro 


Asn 






725 










7 30 










735 




Leu 


... 


Arg 


Gly Asp Lys 


net 


Asn 


Lys 


Ala 


Gin 


Val 


Pro 


Ala 


Ala 


Arg 








740 








745 










750 






Ser 


Gly 


Lys 


He Asp 


Cys 


Lys 


Phe 


Glu 


Lys 


Glu 


Met 


Asp 


Thr 


Pro 


Ser 






755 








760 










765 








Thr 


Thr 


lie 


Phe Asp 


Val 


Val 


Ser 


Gly Asn 


Val 


Glu 


Tyr 


Thr 


Leu 


Lys 




,770 








775 










760 










Asn 


Ser 


Leu 


Leu Leu 


Glu 


Val 


Phe 


Ser 


Ala 


Val 


M^it 


Asp 


Gin 


Val 


Tyr 


785 








790 










795 










800 


Thr 


Ala 


Thr 


Val Arg 


Glu 


Lys 


Glu 


Gly 


Gly Ala 


Tyr 


Ser 


Val 


Ala 


Ala 








805 










810 










815 




Phe 


Gly 


Gly 


Leu Giu 


Gin 


Tyr 


Pro 


Gin 


Pro 


Lys 


Ala 


Leu 


Met 


Gin 


He 








820 








825 










830 






Tyr 


Phe 


Pro 


Thr Asp 


Pro 


Ala 


Arg 


Ala 


Glu 


Glu 


Met 


Ae-n 


Ala 


He 


Val 






835 








640 










845 








Phtj 


Ala 


g:u 


Leu Glu 


Lys 


Leu 


Ala 


Lys 


Glu 


Gly 


Pro 


Asn 


Val 


Glu 


Tyr 




850 








055 










860 










t'h* 


Lys 


Lys 


Thr lie 


Glu 


Asn 


Leu 


Asn 


Lys 


Gin 


His 


Lys 


Glu 


Ser 


Leu 


865 








870 








075 










880 


Acg 


Giu 


Asn 


Arq Phe 


Trp 


Leu 


Glu 


Ala 


tl*t 


Lys 


Ala 


Ser 


Phe 


Phe 


Glu 








885 










890 










895 




Clv 


Asn 


Asp 


Phe He 


Thr 


As-p 


Tyr 


Glu 


Ser 


Val 


Leu 


Asn 


Gly 


Leu 


Thr 








900 








905 










910 






PrD 


Aid 


Glu 


Leu Gin 


Lyr 


Phe 


Ala 


Ala 


Asp 




Leu 


Lys 


Gin 


Gin 


Asn 






915 








920 










925 








Arg 


Val 


Val 


Val net 


Mer 


Ma 


Pro 


Val 


Ala 


Lys 


Aia^ 


Gin 










930 








935 










940 











(2) IttFORllfiTlON FOR SEQ ID HO: 407 



(i) SEQUENCE CHARACTERISTICS: 
75 (A) LEIIGTH: 684 amino acids 
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(B) TYPE: amino acid 
(D) TOPOLOGY: llnsar 

(ii) IIOLECt'LE TYPC: protein 

tiii) HVPOTHETICAL; YES 

(vl) ORIGINAL SOURCE: 

lA) ORGANISE: Porphyromonas girn;;iYaliE 

(ix) ftATURE: 

(A) HAI-IE/KEV: mis cofeature 

(B) LOCATION 1. . .604 



txi) SEQUEHCE DESCRIPTION: SEQ ID HOr^O"? 



Met 


Ser 


Lyr 


Lys 


Gly 


Thr 


He 


Gly 


Val 


Thr 


Ser 


Asp 


Asn 


He 


Phe 


Pro 


1 








5 










10 










IS 




Val 


lie 


L^'s 


Lyp 


Phe 


Leu 


Tyr 


Ser 


Asp 


His 


Glu 


He 


Phe 


Leu 


Ary 


Glu 








20 










25 










30 






lie 


Val 


Ser 


Acn 


Ala 


Val 


Asp 


Ala 


Thr 


Gin 


Lyi; 


Leu 


Lys 


Thr 


Leu 


Thr 






3£ 










40 










45 








Ser 


Val 


Gly 


Glu 


Phe 


Lys 


Gly 


Glu 


Thr 


Gly 


Asp 


Leu 


Arg 


Val 


Thr 


Val 




50 










55 










50 










Ser 


Vnl 


Asp 


Glu 


Val 


Ala 


Arg 


Thr 


He 


Thr 


Val 


Ser 


Asp 


Arg Gly 


Val 


65 










70 










75 










80 


Gly 


Hst 


The 


Glu 


Glu 


Glu 


Val 


Glu 


Lys 


Tyr 


He 


Asn 


Gin 


He 


Ala 


Phe 








85 










90 










95 




Ssr 


Sor 


A1 A 


Glu 


Glu 


Phe 


Leu 


Glu 


Lys 


Tyr 


Lys 


Asp 


Asp 


Lys 


Ala 


Ala 








100 










105 










110 






lie 


lie 


Gly 


Hiff 


Phe 


Gly 


Leu 


Gly 


Phe 


Tyr 


Ser 


Ala 


Phe 


Met 


Val 


Ser 






lis 










120 










125 








Glu 


Arg 


Val 


Asp 


Val 


lie 


Thr 


Arg 


Ser 


Phe 


Arg 


Glu 


Asp 


Ala 


Thr 


Ala 




130 










135 










140 










Val 


Lys 


Trp 


Ser 


Cys 


Asp 


Gly 


Ser 


Pro 


Glu 


Tyr 


Thr 


Leu 


Glu 


Pro 


Ala 


145 










150 










155 










IGO 


Asp 


Lys 


Ala 


Asp 


Acq 


Gly 


Thr 


Asp 


lie 


Val 


net 


His 


He 


Asp 


Glu 


Glu 








165 










170 










175 




As^n 


Ser 


Glu 


Phe 


Leu 


Lys 


Lys 


Glu 


Lys 


He 


Glu 


Gly 


Leu 


Leu 


Gly 


Lys 








100 










185 










190 






Tyr 


Cys 


Lys 


Phe 


Leu 


Thr 


Val 


Pro 


He 


He 


Phe 


Gly 


Lys 


Lys 


Gin 


Glu 




195 










200 










205 








Trp 


hys 


Asp 


Giy 


Lys 


Met 


Gin 


Asp 


Thr 


Asp 


Glu 


Asp Asn 


Gin 


He 


Asn 




7.\ 0 










215 










220 










Asp 


Thr 


His 


Pro 


Ala 


Trp 


The 


Lys 


Lys 


Pro 


Ala 


Asp 


Leu 


Lys 


Asp 


Glu 


22S 










230 










235 










240 


Asp 


Tyr 


Lyb- 


Glu 


Phe 


Tyr 


Arg 


Ser 


Leu 


•l*yr 


Pro 


Met 


Ser 


Glu 


Glu 


Pro 










245 










250 










255 




Leu 


Phe 


Trp 


lie 


His 


Leu 


Asn 


Val 


Asp 


Tyr 


Pro 


Phe 


Asn 


Leu 


Thr 


Gly 








260 










265 










270 






lie 


Leu 


Tyr 


Phe 


Pro 


Lys 


He 


Lys 


Asn 


Asn 


Leu 


Asp 


Leu 


Gin 


Arg 


Asn 






275 










280 










285 










lie 


Gli\ 


Leu 


Tyr 


Cys 


Asn 


Gin 


Val 


Tyr 


Val 


The 


Asp 


Glu 


Val 


Gin 




290 










255 










300 










Gly 


He 


Val 


Pro 


Asp 


Phe 


Leu 


Thr 


Leu 


Lsu 


His 


G]y 


Val 


He 


Asp 


Ser 


305 










310 










315 










320 


Pre 


Asp 


He 


Pro 


Leu 


Asn 


Val 


Ser 


Arg 


Ser 


Tyr 


Leu 


Gin 


Ser 


Asp 


Ala 










325 










330 










335 




Asn 


Val 


Lye 


Lys 


He 


Ser 


Ser 


Hi5 


He 


Thr 


Lys 


Lys 


Val 


Ala 


Asp 


Arg 








3-10 










345 










350 






Leu 


GJ u 


Glu 


lie 


Phe 


Lys 


Asn 


Asp 


Ary 


Pro 


Thr 


Phe 


Glu 


Glu 


Lys 


Trp 






355 










36G 










365 








Asp 


Ser 


Leu 


Lys 


Leu 


Pho 


Val 


Glu 


■ryr 


Gly 


Met 


Leu 


Thr 


Asp 


Glu 


I.ys 




370 










375 










300 










Phe 


Tyr 


Glu 


Arg 


Ala 


Ala 


Lys 


Phe 


Phe 


Leu 


Phe 


Thr 


Asp 


Met 


Asp 


Gly 


385 










390 










395 










400 


His 


Lys 


Tyr 


Thr 


Phe 


Affp 


Glu 


Tyr 


Arg 


Thr 


Leu 


Val 


Glu 


Gly 


Val 


Gin 










4 05 










410 










415 




The 


Asp 


Lys 


Asp 


Giy 


Gin 


Val 


Val 


Tyr 


Leu 


Tyr 


Ala- 


■Thr 


Asp Lyt 


His 








420 










425 










4 30 






Gly 


Gin 


Tyr 


Sf:r 


His 


Val 


Lys 


Arg 


Ala 


Ser 


Asp 


Lys 


Gly 


Tyr 


Ser 


Val 






435 










440 










445 








net 


Leu 


Leu 


Aep 


Gl y 


Gin 


Leu 


Asp 


Pro 


His 


He 


Val 


Ser 


Leu 


Leu 


Glu 




450 










455 










460 










Gin 


Lys 


Leu 


Glu 


Lys 


Thr 


Hifc- 


PhG 


Val 


Arg 


Val 


Asp 


Ser 


Asp 


Thr 


He 
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4 65 










4 7 0 










■J 7 5 










480 


Ain 


Asn 


Lsu 


lie 


Arg 
4 85 


Lys 


Glu 


Glu 


Arg 


Ala 
4 90 


Glu 


Vrt I 


L.ys 


L»u 


Ser 
4 95 


Asp 


Thr 


Glu 


Arg 


Ala 


Thr 


Leu 


Vai 


Lys 


L-u 


Phe 


Glu 


Ala 


Arg 


Leu 


Pro Arg 








500 










505 










510 






Asp 


Glu 


515 


hys 


Hiff 


Phe 


Asn 


Val 
520 


Aid 


Phe 


Glu 


Ser 


Leu 
525 


Gly 


Ala 


Glu 


Gly 


Glu 
530 


Ala 


ile 


Leu 


lie 


Thr 

535 


Gin 


Ala 


Glu 


Phe 


Met 
S-JO 


Arg 


Arg 


Met 


Arg 


Asp 


net 




Gin 


Leu 


CJln 


Pro 


Gly 


net 


Sor 


Phe 


T-r 


Gly 


Glu 


Leu 


Pro 


545 










550 








555 










560 


Asp 


ser 




Asn 


Leu 


Val 


Leu 


hsn 


Thr 


Asp 


His 


Pro 


Leu 


He 


Asp Arg 










565 










570 










575 




Val 


Leu 


Ser 


GXy 
580 


Glu 


Lyr 


Glu 


Ser 


Vsl 
585 


Glu 


Pro 


£tc 


Leu 


Thr 

s?o 


Glu 


Leu 


Arg 


Ala 


Lys 
555 


lie 


Ala 


Glu 


Leu 


Lys 
600 


Ala 


Glu 


Glu 


Ala 


Lys 
505 


Leu 


Leu 


Asp 


Glu 


Glu 
610 


Lys 


Gly 


Lys 


Lys 


Pro 
615 


Glu 


Glu 




Pro 


Val 
620 


Ala 


7hx 


Lys 


Glu 


Ala 


tys 


(ilLi 


Asn 


As 11 


Ala 


Val 


Glu 


Gin 


Ala 


Lys 


Thr 


Glu 


Gly 


Ser 


He 


625 










630 










635 










64Q 


Asn 


nep 


Cln 


Leu 


Thr 


Lys 


Tyr 


Ala 


Gin 


Asp 


Asn 


Glu 


Leu 


He 


Gly 


Gin 








645 








650 










655 




Leu 


He 


Ab-p 


Lsu 
660 


Al a 


Lftu 


Leu 


Gly 


Sei- 
ses 


Gly 


Lou 


Levi 


Thr 


Gly 
57 0 


Glu 


Ali> 


Lsu 


Ala 


Glu 

67S 


The 


lie 


Ara 


Arg 


Scr 
680 


Gin 


Arg 


Leu 


Leu 











(2) INFORIIATIOH FOR SEQ ID HO: 4 08 

(i) SEQUEMCF. CHARACl'tlRISTICS: 

lA) LENGTH: -164 amino acids 
(B) TYPE: amino acid 
(r>) TOPOLOGY: iin«ar 

(iij IIOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vl) ORIGIMAL SOURCE: 

I A) ORGAHISH: Porphyromonas gingivaiis 

{ix! FEATURE: 

(A) WAIIE/KEY; misc_feature 
(H) LOCATION 1 . . .464 



(xi> SEQUENCE DESCRiPTION: SEQ ID HO: 408 



Met 


Glu 


Lys 


Leu 


He 


Asp 


He 


Leu 


Val 


Val 


Asp 


Asp 


Asp 


Val 


Ala 


Val 


1 








5 










10 










15 




Cys 


Ala 


Ala 


Leu 


Arg 


Leu 


Val 


Leu 


Ly.i 


Arg 


Ala 


Gly 


Tyr Asn 


Pro 


Vai 








20 










25 










30 






lie 


Ala 


Asn 


Ser 


Pro 


Asp 


Glu 


Ala 


Leu 


Ser 


He 


Met 


Arg 


Asn 


Pro 


Asp 






35 










40 










45 








Gly 


Gly 


C^s 


Lys 


Pro 


Ala 


Val 


Tie 


Leu 


Met 


Asp 


Met 


Asn 


Phe 


Scr 


Leu 




50 










55 










60 










Ser 


Thr 


Ser 


Gly 


Arg 


Glu 


Gly 


Leu 


Glu 


Leu 


Leu 


Glu 


Lys 


Met 


Gin 


He 


65 










70 










75 










80 


Phe 


Thr 


Ser 


Cys 


Pro 


Val 


He 


Leu 


Met 


Thr 


Ala 


Trp 


Ala 


Ser 


He 


Pro 










85 










90 










95 




Leu 


Ala 


Val 


Glu 


Gly 


Met 


Arg 


Leu 


Gly 


Ala 


Phe 


Asp 


Phe 


He 


Gly 


Lys 








100 










105 










110 






Pro 


Tcp 


Asp 


Asn 


Asp 


Arg 


Leu 


Leu 


Arg 


Thr 


He 


Asp 


Thr 


Ala 


Leu 


HlG 






lit 










120 








12S 








Leu 


Ala 


Ala 


Pro 


Ser 


Ala 


Val 


Ala 


Asn 


Pro 


Ser 


Glu 


Gin 


Ser 


Asp Arg 




130 










135 










140 










Asp 


The 


Ala 


Arg 


Gin 


Pro 


Lys 


Ala 


Thi- 


Val 


Gin 


Glu 


Asn 


Asp 


Pro 


Cys- 


145 










150 










J 55 










160 


Ala 


Mis 


He 


He 


Gly 


Arg 


S-r 


Asp 


Aia 


He 


Cys 


Lysr 


-He 


Lys 


Glu 


Arg 










165 










170 










175 




He 


Arg 


Arg 


He 


Ala 


Pro 


Thr 


Hl.T 


Ala 


Ser 


Val 


Leu 


He 


Thr 


Gly Glu 








180 










185 










150 






Ser 


Gly 


Thr 


Gly 


Lys 


Glu 


Leu 


He 


Ala 


Glu 


Ala 


Leu 


His Arg Gly 


Ser 






l?o 










200 










205 








Lys 


Arg 


Ala 


Ser 


Ala 


Pro 


Phe 


Vai 


Lys 


Val 


Asn 


Leii 


Gly 


Gly 


He 


Pco 
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210 










213 










:»20 










GI It 


Ser 


Leu 


Phe 


Glu 


Ser 


Glu 


Leu 


Phe 


Gly 


Hia 


Lys 


Lys 


Gly Ala 


Phe 


225 










230 










235 










240 


Thr 


Asn 


Ala 


Phe 


Ser 


Asp 


Arg 


Lys 


Gly Arg 


PhG 


Glu 


Leu 


Ala 


Asp 


Gly 










245 










250 










255 




Gly 


Thr 


rie 


Phe 


Leu 


Asp 


Glu 


He 


Gly 


Glu 


Leu 


Pro 


Val Gly Asn Gin 






260 










265 










270 






Val 


Lys 


Leu 


Leu 


Arg 


Val 


Leu 


Gin 


Glu 


Gin 


Thr 


Phe 


Glu 


Pro 


Leu Gly 




275 










200 










285 








G)u 


Ser 


Val 


Ser 




Arg 


Val 


Aftp 


I]e 


Arg 


Val 


Val 


Ser 


Ala 


Thr 


Asn 




290 










295 










300 










Ala 


Sbi 


Leu Glu Arg 


Met 


Val 


Ala 


Glu 


Gly Arg 


Phe 


Axg 


Glu 


Aj?p 


Lou 


305 










310 










315 










320 


Tyr 


Tyr 


Arg 


He 


Asn 


Leu 


He 


His 


Leu 


His 


Leu 


Pro 


Pro 


Leu 


Arg 


Glu 








325 










330 










335 




Ary Gin 


Glu 


A.-P 


lie 


Gin 


Leu 


Leu 


Val 


Glu 


Ala 


Phe 


Ser 


Glu 


Ala 


Phe 








34G 










345 










350 






Ala 


Gla 


Ser 


Asn 


Gly Leu 


Pro 


His 


Ala 


Val 


Trp 


Ser 


AlA 


Glu 


Ala 


Met 






355 










360 










365 








Ary 


Arg 


lie 


Cys 


Ala 


Met 


Pro 


Leu 


Pro 


Gly ?isn 


Val 


Arg 


Glu 


Leu 


Lys 




370 










375 










380 










Asn 


Val 


Val 


Glu 


Arg 


Thr 


Leu 


Leu 


Leu 


Ser 


Gly 


Ser 


Arg 


Glu 


He 


Ser 


365 










390 










395 










4 00 


Ala 


Arg 


Aap 


Val 


Ala 


Asp 


Phe 


Gly 


Sftr 


Gin 


Val 


Thr 


Ala 


Ala 


Asp 


His 










405 










410 










415 




S*r 


Asp 


Glu 


Arg 


Ala 


Leu 


Thr 


Asp 


Met 


Glu 


Glu 


Ala 


Ala 


He 


Arg 


Glu 






420 










42S 










430 






Thr 


Leu 


Thr 




Tyr 


Asn 


Gly 


Asn 


Val 


Ser 


Arg 


Ala 


Ala Ajrg 


Ala 


Leu 






435 










440 










445 








Gly Leu 


£»er 


Arg 


Ala 


Ala 


Leu 


Tyr 


Arg 


Arg 


Met 


Glu 


Lys 




Gly 


Leu 




4 50 










455 










460 











(2) IMFORHATIOM FOR SEQ ID ZJO:409 

(i) SEQUEUCE aiARACTERISTICS: 

(A) LEMGTH: 250 amino acids 
IBi TYPE: amino acid 
(DI TOPOLOGY: linear 

fll) MOLECULE TYPE: protein 

(iii) HVPOTHETICAl,: YES 

(Vl) ORIGIllAL SOURCE: 

(A) ORGANISH: Porphyroroonas glngivalis 

(ix) fEATURE: 

(A) I W IE/ KEY: mi£'c_f eature 
(B» LOCATIOfi 1. . .250 



Ixi) SEQUEIICE DESCRIPTION: SEQ ID HO: 4 09 



Het 


Leu 


Lys 


He 


Lye 


Asn 


Leu 


His 


Ala 


Thr 


Val 


Gin 


Gly Lys 


Glu Tie 


1 








5 










10 










15 


Leu 


Lys 


Gly 


He 


Asn 


Leu 


Glu 


He 


Asn 


Ala 


Gly 


Glu 


He 


His 


Ala He 








20 










25 










30 




Het Gly 


Pro 


Asn 


Gly 


Ser 


Gly 


Lys 


Ser 


Thr 


Leu 


Ser 


Ser 


Val 


Leu Val 






35 










40 










45 






Gly 


His 


Pro 


Ser 


Phe 


Glu 


Val 


Thr 


Glu 


Gly 


Glu 


Val 


Thr 


Phe Asn Gly 




SO 










55 








60 








He 


Asp 


Leu 


Leu 


Glu 


Leu 


Glu 


Pro 


Glu 


Glu 


Arg 


Ala 


His 


Leu 


Gly Leu 


65 










70 










75 








90 


Ph^ 


Leu 


Ser 


Phe 


Gin 


Tyr 


Pro 


Val 


Glu 


He 


Pro 


Gly Val 


Ser 


Het Val 










85 










90 










95 


Asn 


Phe 


Met 


Arg 


Ala 


Ala 


Vai 


Asn 


Glu 


His 


Arg 


Lys 


Ala 


He 


Gly Ala 








100 










105 










110 


G)ii 


Pro 


Val 


Ser 


Ala 


Ser 


Afp 


Phe 


Leu 


Lys 


Met 


Wot 


Arg 


Glu 


Lys Ary 






J 15 










120 










125 






Ala 


11% 


Val 


Glu 


Leu 


Asp 


Asn 


Lys 


Leu 


Ala 


Ser 


Acg' 


-Ser 


Val 


Asn Glu 




130 










135 










140 








Gly 


Phe 


Ser 


Gly 


Gly 


Glu 


Ly^ 


Lys 


Arg 


Asn 


Glu 


He 


Phe 


Gin 


Het Ali 


145 










150 










155 








160 


Met 


Leu 


Glu 


Pro 


Lva 


Leu 


Ala 


He 


Leu 


Asp 


Glu 


Thr 


Aep 


S»r 


Gly Leu 










} 65 










170 










175 


Asp 


He 


A=p 


Ala 


Lou 


Arg 


He 


Val 


Ala 


Gly 


Gly 


Val 


Asn Arg 


Leu Arg 
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180 






185 










1 90 






Pro Glu A^n 


Ala 


Ala 


lie Val He 


Thr 


HI.T 


Tyr 


Gin 


Arg Leu 


Leu 




195 






200 








205 






Glu 


Tyr 11a Lys 


Pro 


Aep 


Phs Val His 


Val 


Leu 


Tyr 


Lyf 


Gly Arg 


Tic 




210 






215 






220 








Val 


Lys Ser Gly 


Gly 


Ala 


Gla Leu Ala 


Le»u 


Thr 


Leu 


Glu 


Glu Ly^ 


Gly 


226 






230 






235 








240 


Tyr 


Asp Trp lie 


Lys 


Glu 


Glu He Gly 


Glu 






















250 












(2) 


IMFOWIATIOH 


FOR 


SEQ 


ID MO: 410 















til SEQUENCE CHARACTERISTICS : 

(A) LEJIGTH: 4 61 amino acid.-s 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(11) MOLECULE TYPE: protein 
liii) H-^POTHBTICAL: YES 

(vi) ORIGIWAL SOURCE: 

(A) ORGA)JISl-l: Porphyronvonas ginc:ivalis 

lix) FEATURE: 

tA> HAllE/KEY: micc_fBatur% 

(B) LOCATIOM 1 , , .461 



(Mi) SEOUEHCE DESCRIPTIOH: SZQ ID HO: 4 10 





AU 


Lys 


Glu 


I>ys 


Thr 


He 


Tyr 


Vol 


Cys 


Arg 


Ser 


Cys 


Gly 


The 


Lys 


1 








5 










10 










IS 




Tyr 


Ala 


Lys 


Trp 


Gin 


Gly 


Atn 


Cys 


Asn 


Ala 


Cys 


Gly 


Glu 


Tcp 


Asn 


Cys 








20 










25 










30 






lie 


Asp 


Glu 


Glu 


Lys 


Val 


Pro 


Ala 


Pro 


Ala 


Ser 


Gly 


Lys 


His 


Ala 


Ala 






35 










40 










45 








L-ys 


Ser 


Phe 


Met 


Pro 


Arg 


Glu 


Gin 


Asp 


Asn 


Arg 


Pro 


Arg 


Leu 


Leu 


Gin 




50 










55 










60 










As'p 


Val 


Glu 


f?Rr 


Gly 


Asp 


Glu 


Glu 


Arg 


He 


Arg 


Leu 


Gly Asp 


Glu 


Glu 


65 










70 










75 










90 


Phe 


Asp 


Arg 


Val 


Leu 


Gly 


Gly 


Gly 


He 


Val 


Lys 


Gly 


Ala 


Phe 


Val 


Leu 










05 










90 










95 




Leu 


Giy 


Gly 


Glu 


Pro 


Gly 


He 


Gly 


hys 


Ser 


Thr 


Leu 


He 


Leu 


Gin 


Thr 








100 










105 










110 






Val 


Leu 


Arg 


Leu 


Pro 


Gin 


Leu 


Arg 


Thr 


Leu 


Tyr 


Val 


Sor 


Gly 


Glu 


Glu 






lis 










120 










125 








Ser 


Ala 


Arg 


Gin 


Leu 


Lys 


Met 


Arg 


Ala 


Glu 


Arg 


Leu 


Gly 


Gin 


Ala 


Met 




130 










135 










140 










Asn 


Gly 


Cys 


Tyr 


Val 


Tyr 


Cys 


Glu 


Thr 


Asn 


He 


Glu 


Arg 


He 


Leu 


Ser 


145 










150 










155 










160 


Arg 


Ala 


Glu 


Glu 


Leu 


Thr 


Pro 


Asp 


Leu 


Leu 


Val 


He 


Asp 


Ser 


He 


Gin 










165 










170 










175 




Thr 


Val 


Tyr 


Thr 


Glu 


Glu 


Met 


Glu 


Ser 


Ser 


Ala 


Gly 


Ser 


Val 


Gly 


Gin 








180 










105 








190 






He Arg 


Glu 


Cys 


Ala 


Ala 


Leu 


Leu 


Leu 


Lys 


Tyr 


Ci's 


Lys 


Thr 


Thr 


Gly 






195 










200 










205 








He 


Pro 


Val 


He 


Val 


He 


Gly 


His 


He 


Thr 


Lys 


Glu 


Gly Ser 


He 


Ala 




210 










215 










220 










Gly 


Pro 


Lys 


Val 


Lsu 


Glu 


His 


He 


Val 


Asp 


Thr 


Val 


Leu 


Leu 


Phe 


Asp 


225 










230 










235 










240 


Gly Asp 


Lye 


His 


His 


T.eu 


Tyr 


Arg 


He 


Leu 


Arg 


Gly 


Gin 


Lys 


Asn 


Arg 










245 










250 










255 




Tyi: 


Gly 


Ser 


Thr 


Ser 


Glu 


Leu 


Gly 


Tl- 


Tyr 


Glu 


Met 


Arg 


Gin 


Asp 


Gly 








260 










265 










270 






Leu 


Arg 


Gly 


Val 


Glu 


A^n 


Pro 


Ser 


Glu 


Miff 


Leu 


He 


Thr 


Arg 


Asn 


Arg 






2-7 5 










280 










285 






Glu 


hep 


L-u 


Ser 


Gly 


He 


Ala 


Hq 


Ala 


Val 


Ala 


Met 


Glu 


Gly 


He 


Arg 




290 










295 










300 








Pro 


lie 


Leu 


He 


Clu 


Ala 


Gin 


Ala 


Leu 


Val 


Ser 


Seif" 


Ala 


He 


Tyr 


Ala 


305 










310 










315 








32 0 


Asn 


Pro 


Gin 


-Arg 


Ser 


Ala 


Thr 


Giy 


Phe 


Asp 


He 


Arg 


Arg Met 


Asn 


Met 










325 










330 










335 




Leu 


Leu 


Ala 


Val 


Leu 


Glu 


Lys 


Arg 


Ala 


Gly 


Phe 


Lys 


Leu 


He 


Gin 


Lys- 








340 










345 










350 






Asp 


Val 


Phe 


Leu 


Asn 


He 


Ala 


Gly 


Gly 


He 


Lys 


He 


Ala 


A*p 


Pro 


Ala 
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355 360 365 



Thr 


Asp 


Leu 


Ala 


Val 


Tic 


Ser 


Ala 


Val 


Leu 


Ala Ser 


Ser 


Leu 


Asp 


Tie 




370 










375 








360 








Val 


He 


Pro 


Pro 


Ala 


Val 


Cys 


Met 


Thr 


Gly Glu Val 


Gly 


Leu 


Ser 


Gly 


385 










390 










39£ 








400 


Glu 


Il« 


Arg 


Pre 


Val 
405 


Sec 


Arq 


He 


Glu 


Gin 
410 


Arg He 


Thr 


Glu 


Ala 
415 


Arq 


Arg 


He 


GlV 


Phe 


Lys 


Glu 


He 


Leu 


Val 


Pro 


Ala Asp 


Asn 


Phe 


Arg 


Gin 






4?.0 










425 








4 30 






Glu 


Asp 


Alo 


Gly Arg 


Phft 


Gly 


He 


Arg 


Leu 


Val Pro 


Vdl 


Arg 


Lys 


Val 






4J5 










440 








445 








Glu 


Glu 
450 


Ala 


Phe 


Arg 


His 


Lbu 
455 


Phe 


Ser 


Lys 


Gly Arg 
4 60 


Glu 









(2) IHFORIIATIOH FOR 3EQ ID NO: 4 11 

(i> SEQUEMCE CHARACTERISTICS: 

(Al T.RNGTH: 271 amino acids 
(B) TYPE: amino acid 
(D) T0POLO*3Y: linear 

(iiJ llOLFa.fLE TYPE: protein 

( ii i ) Hi POTHETICAL : YES 

(vil ORIGIMAL SOURCE; 

(AI ORGANISM: Porphyromona « gingivalis 

llX) FEATt-rRE: 

(A) IJAHF./KEY: mi sc_ feature 

(B) LOCATIOtI 1- . .271 



(xl) SEQUE^4CE DESCRIPTIOM: SEQ ID HO: 411 
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ID HO: 412 

















H) SEQUEilCE CHARACTERISTICS*. 

<A) LEMGTH: 417 and no acids 
(Bl TYPE: amino acid 
(D) TOPOLO^jY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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(ii) nOLKCULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A> ORGAWISM: Porphyromonas gingival Is 

{ix» FEATURE: 

(A) HAHE/KEY: mlsc feature 
IB) LOCATIOM 1. . .4T7 



{xil SKQl.lE(/CE DESCRIPriOJ4: SEQ ID HO: 412 



Met 


Ser 


Thr 


Asr. 


He 


Asp 


Va 1 


Gin 


Gin 


He 


Lys 


Glfi 


Arq 


Phe Cly 


He 


1 








5 










10 










15 




lie 


Gly 


Ssr 


Ser 


Tro 


Leu 


Met 


Glu 


His 


Ala 


Tl » 


Arg 


Val 


Ala 


Ala 


Gin 








20 










25 










30 






Val 


Ala 


Pro 


Thr 


Asp 


Met 


Ser 


Val 


Leu 


Val 


Thr 


Gly 


Glu 


Scr Gly 


Ser 
















iQ 










45 








Giy 


Lys 


Glu 


Phs 


Phe 


Pro 


Gin 


He 


ll'i 


His 


Tyr 


Tyr 


Ser 


Ala 


Arg 


Lys 




50 










55 










60 










His 


His 


Sec 


Tyr 


He 


Ala 


Val 


Asn 


cys 


Gly 


Ala 


He 


Pro 


Glu 


Gly 


Thr 


65 










70 










75 










80 


lie 


Asp 


Ser 


Glu 


Leu 


Phe 


Gly 


His 


Arq 


Lys 


Gly 


Ser 


Phe 


Thr 


Gly 


Ala 










85 










90 










95 




Val 


Ssr 




Arq 


Lys 


Gly 


Tyr 


Phe 


Glu 


Glu 


Ala 


b'er 


Gly Gly Thr 


He 








100 










105 










110 






Phe 


Leu 


Asp 


Glu 


Val 


Gly 


Glu 


Leu 


Pro 


Leu 


Pro 


Thr 


Gin 


Ala 


Arg 


Leu 






115 










120 










125 








Leu 


Arg 


Val 


Leu 


Glu 


Thr 


Gly 


Glu 


Phe 


He 


Pro 


Val 


Gly Ala 


Sor 


Gin 




130 










135 










140 










Ser 


GXn 


Lyf 


Thr 


Asp 


Val 


Arg 


He 


Val 


Ala 


Ala 


Thr 


Asn 


Val 


Asn 


Leu 


145 








150 










155 










160 




Glu 


Ais 


Val 


Aid 


Asn 


Gly 


Lys 


Phe 


Arg 


Glu 


Asp 


Leu 


Phe 


Phe 


Arg 










16b 










170 










175 




Leu 


Asn 


Thr 


VaI 


Pro 


He 


Glu 


Vai 


Pro 


Ala 


I.eu 


Arg 


l!et 


Arg 


Pro 


Asp 








180 










105 










190 






Asp 


Val 


Pro 


Leu 


Leu 


Phe 


Arg 


Arg 


Phe 


Ala 


Ala 


Asp 


Ser 


Ala 


Glu 


Lys 






195 










200 










205 








Tyr 


Arg 


net 


Pro 


Pro 


Leu 


Arg 


Leu 


Ser 


Afp 


Glu 


Ala 


Axg 


Thr 


He 


Leu 




210 










215 










220 










rtet 


Arq 


Tyr 


Arg 


Trp 


Pro 


Gly 


Asn 


Val 


Arg 


Glu 


Leu 


Arg Asn 


He 


Thr 


225 










230 










235 










240 


A^p 


Arg 


Leu 


Scr 


He 


Leu 


Glu 


Glu 


Glu 


Arg 


Thr 


Val 


Ser 


Ala 


Glu 


Thr 










245 










250 










255 




Hp 


Thr 


Arg 


ryr 


Leu 


Asp 


Ala 


Gl\i 


Gly 


Met 


Gin 


Asp 


Leu 


His 


Pro 


Val 








260 










265 










270 






Val 


He 


Arg 


Arg 


Asn 


Glu 


Thr 


Thr 


Glu 


Ala 


Asp 


L'yS 


Gin 


He 


Pro 


His 






275 










280 










285 








Tyc 


Glu 


Arq 


Glu 


He 


lift 


Tyr 


Gin 


Val 


Leu 


Tyr 


Afip 


Het 


Lye 


Lys 


Glu 




290 










295 










300 










lie 


Ala 


Asp 


Leu 


Lys 


Giy 


Met 


Het 


Asn 


Arg 


Leu 


Ala 


His 


His 


Glu 


Gin 


305 










310 










315 










320 


Pro 


Ser 


Trp 


Pro 


Val 


Gly 


Ser 


Aap 


Val 


Trp 


Gly 


J\sn 


Asp Asp Lys 


Arg 










326 










330 










335 




The 


Ala 


Asp 


pro 


Lv*^ 


Trp 


Gly 


Val 


Ser 


Thr 


His 


Lyr 


Ala 


Pro 


He 


Ala 








340 










345 










350 






Asn 


Ala 


Ala 


Glu 


Pro 


Val 


Glu 


Pro 


He 


Gin 


Glu 


Ala 


Ssr 


Glu Tyr 


Thr 






355 










360 










365 








Glu 


Asp 


Pro 


Val 


Ser 


Leu 


Glu 


Glu 


Val 


Gin 


Lys 


Lys 


Met 


He 


Ser 


Leu 




37 0 










375 








380 










Ala 


Leu 


Glu 


Arq 


His 


Gly 


Gly 


Arg 


Arg 


Lys 


Glr> 


Thr 


Ala 


Glu 


Glu 


Leu 


385 










390 










395 










400 


Lys 


He 


Ser 


Glu 


Arg 


Thr 


L«u 


Tyr 


Arg 


Lys 


He 


Lys 


Glu 


Tyr Gly 


Leu 










405 










410 










415 




Glu 

































(2 J ItlFOWtATION FOR SEQ ID HO: 413 

(ij SEQUENCE CHARACTERISTICS: 

I A) LENGTH; 602 amino acids 
(B) TYPE: amino acid 
ID) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: pcotein 
(lii) HYPCn-HETICAL; YES 

5 

(vl) ORIGINAL SOURCE: 

(A) ORGANISM: Po rphyromonas gingivalis 

iix) FRATURE: 
10 (A) NA;>IE/KEY: tnlsc feature 

(B) LOCATIOH 1 - . .602 

ixi) SEQUENCE UESCRIPTIOK: SEQ ID MO: 413 
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12) IHFORt lATIOIJ FOR SF,Q ID HO: 414 

tl) SEQUEIICE CHARACTERISTICS: 

(A} LF.NGTH: 4 43 amino a cider 
(S) TYPE: amino acid 
( D J TOPOLOG V : linear 

(ii) rrOLECULK TYPE; protein 

(ill) ir/ ?OTHE'riCAL; YES 

tvi) ORIGIMAL SOURCE: 

(A) OROAIIISH: Porphyromonas gingival is 

(iX> FF.ATURE: 

(A) NAIIE/KEY: misc_f eature 

(B) LOCATION 1. . .44 3 



SEQUEHCE I>ESCRIPTIOH: SEQ ID lIO:414 
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10 



15 



20 



25 



45 



50 



55 



60 



65 



70 



75 



305 








310 










315 








320 


Pro 


Glu 


A5n 


Thr Gly 
32b 


Lys 


lie 


lie 


Gly 


Gly 
330 


Gly 


Pro 


Met 


Met 


Gly Arg 
335 


Ala 


Leu 


Leu 


Ser Pro 
340 


Asp 


Val 


Pro 


Val 
345 


Thr 


Lye 


Gly 


Scr 


Ser 
350 


Gly Val 


Leu 


lie 


Leu 


Asp Acq 


Glu 


Glu 


Ala 


Val 


Arg 


Lys- 


Pro Met Arq 


Asp Cys 






3SS 








360 










365 






IJc* 


Arg 

370 




Ala Lys 


Cys 


Val 
375 


Gly 


Val 


cys 


Pro 


380 


Gly 


Lsu 


Asn Pro 


Ala 


Phe 


Lfeu 


Met Arq 


Asp 


Thr 


Leu 


Tyr 


L'/s 


Ser 


Trp 


Glu 


Thr 


Ala Gin 


385 








390 










395 








400 


Lys 


Gly 




Val Vai 
405 


Asp 


Cys 


He 


Glu 


Cys 
410 


Gly 


Ser 


Cys 


Ser 


Phe Thr 
415 


cys 


Pre 


Ala 


Asn Arg 


Pro 


Leu 


Leu 


Afjp 
425 


Tyr 


lie 


Arq 


Gin 


Ala 


Lys Li"s 








420 
















43Q 




Thr 


Val 


Met 
435 


Gly Tie 


Gin 


Arg 


Ala 
440 


Arg 


Lys 


Gin 











(2) IJIFORHATIOfJ FOR SEQ ID HO: 415 

li) SEOUEIJCE CHARACTERISTICS: 

(A) LEI;gTH: 47 9 amJ no acids 
(D) TYPE: amino acid 
(D) TOrOLCHXi: linear 

(li) MOLECULE TVPE: protein 

(iiil HYPOTHETICAL: YES 

30 (vll ORIGINAL SOURCE: 

(A> ORGAMISM: Porphyromonas qingivalis 

tixl FEATURE: 

lA) NAME/KEY: mlsc feature 
35 (bl LOCATION 1...479 

\y.±) SEQUEIICE DESCRIPTIOt-l : SEQ ID HO: 415 



40 



Mftt 


Lys 


Arg 


H« 


Gin 


Leu 


Thr 


Leu 


He 


Ala 


Leu 


Phe 


Ala 


Aia 


Vai 


Ala 


1 








5 










10 










15 




Gly 


Leu 


Val 


Ala 


Gin 


Asn 


Ala 


Tyr 


Glu 


Gly 


Val 


He 


Ser Tyr 


Lys 


He 








20 










25 








30 






Ser 


Leu 


Asp 


Lys 


Thr 


Gly 


Asn 


Lys 


Val 


Val 


Leu 


Asn 


Gly Aia 


Ala 


Asp 






35 










40 










45 








Met 


Ser 


Asn 


L«^u 


Lys 


Leu 


Lys 


Ser 


Thr 


Gin 


Met 


Ho 


He 


Val 


Thr 


Pro 




50 










55 










60 










li- 


Leu 


Arq 


Ser 


Glu 


Asp 


Gly 


Thr 


Ser 


Arq 


Vai 


Glu 


Ph«: 


Pro 


Ser 


Val 


es 










TO 








75 










80 


Val 


He 


Thr 


Gly 


Arq 


Asn 


Arg 


Thr 




Ala 


Leu 


Ly." 


Arg 


Glu 


He 


Aia 










85 










90 










95 






Ser 


Ser 


Ala 


Leu 


Pro 


Gin 


Ala 


Lys 


His 


Ala 


Ala 


Gin 


Tyr 


He 


Arg 








100 










105 










110 






Arg 


Hi£ 


A£n 


Gly 


T.y.o 


Ser 


Glu 


Gin 


Phs 


Ala 


Pb- 


Thr 


Gly Glu 


llis 


Ala 






115 










120 










125 








Tyr 


Ala 


Ser 


Trp 


Met 


Met 


Acp 


Aia 


Lys 


Phe 


Val 


Val 


Arg Glu 


Glu 


Val 




130 










135 










140 










Arg 


Gly 


Cys 


Ala 


Lys 


Cys 


Pro 


Val 


Gly 


Leu 


Ser 


Ser 


Asn 


He 


Val 


Pro 


145 










ISO 








155 










160 


Phe 


Asp 


Pro 


Leu 


Phe 


Asn 


Pro 


Ala 


Glu 


Ala 


Pro 


Tyr 


Leu 


Leu 


Ala 


His 










165 










170 










175 




He 


Thr 


Pro 


Ala 


Glu 


Glu 


Val 


Glu 


Lys 


Gin 


Arg 


Glu 


Ger 


Ser 


Phe Asp 








180 










165 










ISO 






Ala 


Tyr 


He 


Asn 


Phe 


L/s 


Val 


Asn 


Lys 


Ala 


Asp 


Val 


Leu 


Pro 


Glu 


Tyr 






195 










200 










205 








Arg 


Asn 


Asn 


Lys 


Aia 


Glu 


Leu 


Glu 


Lys 


H- 


Lys 


Glu 


Phe 


Val 


Ser 


Thr 




210 










215 








220 










Val 


Lys 


Ala 


Asn 


Pro 


Asn 


Tyr 


Ser 


Val 


Asn 


Lys 


Met 


He 


He 


Glu 


Gly 


225 










230 










235 










240 


Phe 


Ala 


Ser 


Pro 


Glu 


Ala 


Scr 


He 


Aia 


His 


Asn 


Lys 


Ala 


Lsu 


Ser 


Glu 










245 










2S0 










255 




Arq 


Acq 


Ala 


Ly.-. 


Arg 


Leu 


Ala 


Glu 


Glu 


Leu 


Val 


Arg 


Lys 


Tyr 


Gly 


Ly.- 








260 










265 










270 






Thr 


Leu 


Pro 


Asn 


He 


Thr 


Thr 


Glu 


Phe 


Gly 


Gly 


Glu 


Asp Trp 


Lys 


Gly 






275 










280 










285 








Leu 


Lys 


Leu 


Ala 


He 


Glu 


Lys 


Ser 


A&p 


112 




Asp 


Arg 


Asp 


Arg 


Val 
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35 



45 



50 



55 



60 



65 



70 



75 





2 90 










295 








300 












Glu 


lie 


lie 


Asn 


Ser 


Asp 


Lys 


Tyr Ala 


A.?p 


Asp 


Asp 


Ala 


Arg 


Glu 


305 










3J0 








315 










320 


Gin 


Ala 


Leu 


Lys 


Gin 


Leu 


Ser 


Ser 


Tyr Arg 


Tyr 


He 


Leu 


Asp 


Gin 


Tie 










325 








330 










335 




Tyr 


Pro 


Ajrn 


Leu 


Arg 


Arg 


Arn 


The 


He Thr 


Het 


Gly 


Tyr 


He 


Val 


Arg 








34 0 










345 








350 






Arp 


Tyr 


Thr 


Leu 


Glu 


Glu 


Ala 


Arg 


Glu He 


He 


Lys 


Thr 


Al a 


Pro 


Lyr 






355 










360 








365 








Glu 


Leu 


Scr 


Giu 


Ala 


Glu 


ttet 


Tyr 


Arg Val 


Ala 


Met 


Sec 


Tyr 


Pro 


Glu 




370 










375 








380 










Glr 


Ills 


Gin 


Glu 


Arg 


Leu 


Phe 


Ala 


Leu Asn 


Thr 


Thr 


Leu 


Lys 


Tyr 


Phe 


385 










39Q 








395 










400 


Pro 


Glu 


Ser 


Val 


Thr 


Gly Arg 


lie 


Asn Leu 


Ala 


Val 


Ala 


Ala 


Phe 


Asn 










405 








410 










415 




Gly 


Gly 


Asp 


Val 


Gin 


Gin 


Ala 


Tie 


Ala Leu 


Leu 


Ser 


Pro 


lie 


Gin 


Thr 








420 










425 








430 






Giu 


Lys 


Glv 


Val 


Ser 


A^ n 


lie 


Leu 


Gly Ala 


Ala 


Tyr 


Ala 


Arg 


Thr 


Gly 






435 










440 








445 








Asp 




Ala 


Arg 


Ala 


Glu 


Thr 


Phe 


Phe Arg 


Lys 


Ala 


Val 


Ala 


Glu 


Gly 




450 










455 








460 










Asp 


Ala 


Asn 


Ala 


G] n 


Arg 


Asn 


Leu 


Asp Met 


Leu 


Leu 


Gly 


Lys 


Lys 




465 










410 








475 












(2) 


lUFORIIATION 


FOR 


SEQ 


ID NO: 415 















10 



15 



20 



25 

(i) SEQUEUCE CHARACTERISTICS: 

(A) LENGTH: 383 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(iil MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 



Ivi) ORIGIHAL SOURCE: 

(A) ORGAJ'JISM: Porphyromonas gingivalis 



ti>^) FEATURE: 
40 (A) HAl IE/KEY: mlsc_feature 

fBJ LOCATION 1. . .383 



(.xi> SEQLiEHCE DESCRIPTIO^^: SEO ID MO: 4x6 



Met 


Ala 


Glu 


Lys 


Arg 


Asp 


Tyr 


Tyr 


Glu 


Val 


Leu 


Gly 


Val 


Ser 


Lys 


Asn 


1 








5 










10 










15 




Ala 


Thr 


Asp 


Asp 


Glu 


Leu 


Lys 


Lys 


Ala 


Tyr 


Arg 


Lys 


Lys 


Ala 


He 


Gin 








2C 










25 










30 






Tyr 


His 


Pro 


Asp 


Lys 


Asn 


Pro 


Gly 


Asp 


Ly? 


Giu 


Ala 


Giu 


Glu 


Hie 


Phe 






35 










4 0 










45 








Lys 


Glu 


Val 


Ala 


Glu 


Ala 


Tyr 


Asp 


Val 


Leu 


Scr 


Asp 


Pro 


Gin 


Lys 


Arg 




50 










55 










60 










Ser 


Gin 


Tyr 


Asp 


Gin 




Gly 


His 


Ala 


Gly 


Leu 


Gly 


Giy 


Ala 


Ala 


Gly 


65 










70 










7b 










80 


Gly 


Gly 


Phe 


Ser 


Gly 


Gly 


Gly 


Met 


Ser 


Met 


Glu 


Asp 


He 


Phe 


Ser 


Arg 










SS 










90 










95 




Phe 


Gly 


Asp 


Leu 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly 


Phe 


Gly 


Gly 


Phe 


Ser 


Asp 








100 










105 










HO 






Met 


Gly 


Gly 


Gly 


Ser 


Arg 


Arg 


Arg 


Val 


Arg 


Arg 


Giy 


Ser 


Asp 


Leu 


Arg 






115 










120 










125 








Val 


Arg 


Val 


Lys 


Leu 


Ser 


Leu 


Ala 


Asp 


He 


Ser 


hys 


Gly 


Val 


Glu 


Lys 




130 










135 










140 










Lys 


Val 


Lys 


Val 


Lys 


Lys 


Gin 


Val 


Val 


Cys 


Ser 


Lys 


Cys 


Arg 


Gly 


Asp 


145 










150 










155 










160 


Gly 


Thr 


Glu 


Giu 


Ala 


Asn 


Gly 


Lys 


Thr 


Thr 


Cys 


Gin 


Thr 


Cys 


His 


Giy 










16b 










170 










175 




TJir 


Gly 


Val 


Val 


Thr 


Arg 


Val 


Ser 


Asn 


Thr 


Phe 


Leu 


Gly 


Ala 


Met 


Cln 








100 










18 5 










190 






Thr 


Gin 


Ser 


Thr 


Cy.? 


Pro 


Thr 


Cys 


His 


Gly 


Glu 


Gly 


Glu 


He 


He 


Thr 






195 










200 










205 








Lys 


Pro 


Cys 


Ser 


Lys 


Cys 


tys 


Gly 


Glu 


Gly 


Val 


Glu 


Ha 


Gly 


Glu 


Glu 




210 










215 










220 










Val 


He 


Ser 


Phe 


His 


He 


Pro 


Ala 


Gly 


Val 


Ala 


Glu 


Gly 


Met 


Gin 


Met 


225 










230 








235 








240 


Ser 


Val 


Asn 


Gly 


Lys 


Gly 


Atn 


Air. 


Ala 


t'ro 


Arg 


Giy 


Gly 


Val 


Asn 


Giy 
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25 



40 



45 



50 



55 



60 



65 



70 



75 











2^5 










250 








25S 




Afp 


Lsu 


Tl* 




Vai 


He 


Ala 


Glu 


Glu 


Pro 


Asp 


Pro 


Asn 


Leu He 


Arg 






260 










265 










270 


Afn 


Gly 


Airi 


Asp 


Leu 


He 


Tyr 


Asn 


Leu 


teu 


He 


Ser 


Val 


Pro Leu 


Al5 
















280 










205 






He 


hys 


Gly 


Gly 


Ser 


Val 


Glu 


Val 


Pro 


Thr 


Ilfl 


As-p 


Gly 


Arg Ala 


Lys 




290 










2?b 










300 








Il« 


Atg 


lis 


GIu 


Al3 


Gly 


The 


Gin 


Pro 


GJLy 


Lys 


Het 


Leu Arg Leu 


Arg 


305 










310 










315 








320 


Aen 


l.ys 


Gly 


Leu 


Pro 


Ser 


Val 


Asn 


Gly 


Tyr 


Gly 


net 


Gly 


AsD Gin 


Leu 










3?S 










330 








335 




Val 


Asn 


Val 


Asn 


Val 


Tyr 


He 


Pro 


Glu 


Ser 


He 


Asp 


Ala 


Lys Asp 


G]ii 


















345 










350 




'Jin 


Ala 


He 


Ala 


Ala 


Met 


Glu 


Asn 


Ser 


Asp 


Ssr 


Phe 


Lys 


Pro Thr 


Asp 






355 










360 










365 






Ala 


Ala 


Arv 


Lys 


Asp 


1\Q 


A?p 


Lys 


Ly5 


Tyr Arg Glu 




Leu 7\sp 






370 










375 










380 








(21 


IIIFORUATIOII 


FOR 


SEQ 


ID MO: 417 















10 



15 



20 

ti) seouEncE characteristics: 

I A) LKMGTH: 253 amino acids 
iP.) TYPE: amino acid 
(D) TOPOLOGY: linear 

ai) MOLECULE TYPE: protein 

aii) HVFOTHETTCAL; YES 

30 (vl) ORIGINAL SOLIRCE: 

(AJ ORGAMISH: Pocphyroraonas glnqlvalts 

tlx) FEATURE: 

(A) NAl-lE/KEY: miffc_feature 
35 (B) LOCATION 1-..293 





(xll 


} SEQUENCE DESCRIPTION: SEQ ro NO; 417 










Ust 


I.yn 


Lys 


Leu 


He 


Leu 


Ala 


Thr 


Lou 


Gly 


Lsu 


net 


Ala 


He 


Ala 


Met 


1 








5 










10 










IS 




Leu 


Ser 


Cys 


Ser 


Ser 


Asn 


Asn 


Lys 


Asp 


Leu 


Glu 


Asn 


Lys 


Gly Glu 


Ala 








20 










25 










30 






Thr 


Leu 


Lsu 


Val 


Thr 


Phe 


Gly 


Ser 


Ser 


Tyr 


Lys 


Ala 


Pro 


Arq 


Glu 


Thr 






35 










40 










45 








Tyr 


Ala 


Lys 


11% 


Glu 


Lys 


Thr 


Phe 


Ala 


Ala 


Ala 


Tyr 


Pro 


Asp Gin 


Arg 




50 










55 










60 










life 


Ser 


Trp 


Thr 


Tyr 


Thr 


Ser 


Ser 


lis 


He 


Arg 


Lys 


Lys 


Leu 


Ala 


Gin 


65 










70 










75 










80 


Gin 


Gly 


He 


Tyr 


He 


Ai-p 


Ala 


Pro 


Asp 


Glu 


Ala 


L«»u 


Glu 


Lys 


Leu 


Ala 










85 










90 










95 




Arg 


Leu 


GJy 


Tyr 


Lys 


Lys 


He 


Asn 


Val 


Gin 


Ser 


Leu 


His 


Val 


He 


Pro 








100 










105 










110 






Gly Arg Glu 


Tyr 


Asp 


Glu 


Met 


He 


Asp 


Phe 


Val 


Asn 


Lys 


Phe 


Lys 


Ala 






115 










120 










125 








Ala 


His 


Ser 


Asp 


He 


Thr 


Val 


Lys 


Val 


Gly 


Ala 


Pro 


Leti 


Phe Asp 


Thr 




130 










135 










140 










Asp Glu Asp 


Hot 


Arg 


Glu 


Val 


Ala 


Glu 


He 


Leu 


His 


Lys 


Arg 


Phe 


Gin 


145 










ISO 










155 










160 


Gin 


Thr 


He 


Glu 


Li*s 


Gly 


Glu 


Ala 


He 


Val 


Phe 


Met 


Gly His 


Gly 


Thr 










165 










170 










175 




Gl'j 


His 


Ala 


Ala 


Asn 


Asp 


Arg 


Tyr 


Ala 


Arg 


He 


Asn 


Lys 


He 


Met 


Lys 








180 










185 










190 






Asn 


Tyr 


Ser 


Lys 


Phe 


MeL 


He 


Val 


Gly 


Thr 


Val 


Glu 


Ser 


Asp 


Pro 


Ser 






195 










200 








205 






He 


Asn Asp 


V;tl 


He 


Ala 


Glu 


Leu 


Lys 


Glu 


Thr 


Gly 


Ala 


Thr 


Ala 


Val 




210 










215 










220 










Thr 


Met 


Met 


Pre 


Leu 


Met 


Ser 


Val 


Ala 


Gly 


Asp 


His 


Ala 


TLr 


Asn 


A-«p 


225 










230 










235 










240 


UttU Ala Gly 


Asp 


Glu 


Asp 


Asp 


Ser 


Tcp 


Lys 


Thr 


U4U 


Leu 


Thr 


Asn 


Ala 










245 










250 










255 




Gly 


Tyr 


Thr 


Val 


Ser 


He 


Asp 


Lys 


Leu 


Asp 


Asn 


Gly 


Asn 


Phe 


Ser 


Ala 








260 










265 










270 






Leu 


Gly tKsp 


He 


Glu 


Glu 


He 


Arg 


Asn 


He 


Trp 


Lsu 


Lys 


His 


Met 


Ly£ 






275 










280 










285 








Ala 


Tht 


Ser 


Ala 


Arg 
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10 
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25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(2) rMFORllATION FOR SEQ ID HO: 4 18 

(l( SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 356 amino acids 
tai TYPE: amino acid 
[D) TOPOLOGY: linear 

(ii) HOLECULE TYPE: protein 

(iii) ItVPOTHETICAL; YES 

tvi) CRIGIHAL SOURCE; 

<A! ORGAJIISH: Po cphyromonas gingival ij£ 

ax} FEATURE: 

(A) HAHE/KEY; misc feature 
(Bl LOCATIOH 1. . .356 



(xij SEQUEtlCE DtSCRIPTIOIl: SEQ ID HQ: 419 



Itet 


Thr 


Sel' 


Val 


Ser 


HI ff- 


Leu 


Arg 


Thr 


He 


S^r 


Val 


Ala 


Gly 


He 


Leu 


1 








5 










10 










IS 




Ala 


Ala 


Leu 


Gly 


Gly 


Ala 


Val 


Leu 


lie 


T.-u 


Phe 


Gly 


Val 


Asn 


Leu 


Phe 


















25 










30 






Leu 


Gly 


Sfer 


Val 


Ala 


Tie 


Pro 


tiet 


Ser 


Glu 


He 


Phe 


Arg 


His 


Leu 


Phe 






35 










40 










45 








S*r 


Asp 


Acq 


Pro 


Glu 


Gly 


Gly 


Glu 


Ala 


Leu 


Val 


His 


Tyr Asn 


He 


Leu 




SO 










55 










60 










Trp 


Lys 


Ser 


Arg 


Leu 


Pro 


Glu 


Ala 


Leu 


Thr 


Ala 


Ala 


Phe 


Ala 


Gly 


Ala 


65 










70 










75 










80 


Gly 


I.eu 


Sec 


Val 


Ser 


Gly 


Leu 


Gin 


Met 


Gin 


Thr 


Val 


Phe 


Arg 


Asn 


Pro 










85 










90 










95 




Leu 


Ala 


Gly 


Pro 


Ser 


Val 


Leu 


Gly 


He 


Ser 


Sec 


Gly 


Ala 


Ser 


Leu 


Gly 








100 










105 










110 






Val 


Ala 


Leu 


Val 


Val 


Leu 


Leu 


Ssc 


Gly 


Ser 


Leu 


Gly 


Gly 


Val 


Ala 


Leu 






115 










120 










125 








Ser 


Ser 


Leu 


Gly 


Tyr 


Met 


Gly 


Glu 


Val 


Ala 


Met 


Asn 


He 


Ala 


Ala 


Aia 




130 










135 










140 










Val 


Gly 


Ser 


Leu 


Ala 


Val 


Met 


Gly 


Leu 


He 


Val 


Phe 


Val 


Ser 


Thr 


Lys 


145 










150 








155 










160 


Val 


Arg 


Ser 


His 


Val 


Thr 


Leu 


Leu 


He 


He 


Gly 


Val 


Hec 


He 


Gly Tyr 










165 










no 










175 




Vaj 


Ala 


Thr 


Ala 


Val 


He 


Gly 


V«3l 


Phe 


Lys 


Phe 


Pne 


Ser 


He 


Glu 


Glu 








1 80 










183 










150 






Asp 


lie 


Arg 


Ala 


Tyr 


Val 


He 


Trp 


Gly 


Leu 


Gly 


Ser 


Phe 


Ser 


Arg 


Ala 






1S5 










200 










205 








Thr 


Affp 


Ssr 


Gin 


Leu 


Ser 


Phe 


Phe 


Ala 


He 


Leu 


r-iet 


Leu 


He 


Phe 


He 




210 










215 










220 










Pro 


Aid 


Gli' 


Met 


Leu 


Leu 


Val 


Lys 


Gin 


Leu 


Acn 


Leu 


Leu 


Leu 


Leu 


Gly 


225 










230 










235 










240 


Clu 


Ser 


Tyr 


Ala 


Arg 


Asn 


Leu 


Gly 


Leu 


Asn 


Thr Arg Arg Ala 


Arg 


Leu 










245 










250 










255 




Leu 


Val 


11- 


Ser 


Ser 


Ala 


Gly 


Leu 


Leu 


lie 


Ala 


Thr 


Val 


Thr 


Ala 


Tyr 








26Q 










265 










270 






Cys 


Gly 


Pro 


lie 


Gly 


Phe 


Leu 


Gly 


Met 


Ala 


Val 


Pro 


His 


Lou 


Ala 


Arg 






275 










280 










285 








Val 


TJa 


Phe 


His 


Thr 


Ser 


A^p 


His 


Arg 


He 


Leu 


r-fet 


Pro 


Aia 


Thr 


Cys 




290 










295 










300 










Leu 


He 


Gly 


Ser 


Ala 


Leu 


Ala 


Leu 


Phe 


Cys 


Asn 


He 


He 


Ala 


Arg Het 


305 










310 










315 










320 


Pro 


Gly 


Phe 


Glu 


Gly 


Ala 


Leu 


Pro 


Va] 


Afn 


Ser 


Val 


Thr 


Ala 


Leu 


Val 










325 










330 










335 




Gly Ala 


Pro 


He 


He 


Val 


Thr 


Val 


Leu 


Phe 


Arg 


Arg 


Arg Arg 


Phe 


Ly5 








340 










345 










350 






Glu 


Glu 


Thr 


Asp 



























{2) INFORMATION FOR SEQ ID MO: 41 9 

(U SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 757 amino acido 
iBl TYPE: amino acid 
(D) TOPOLOGY: linear 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




tP*-!* '^~^» ^S^e rf^'lt 



WO 99/29870 



PCT/AU98/01023 



355/490 



(iil MOLECULE TYPE: protein 

(iii) llVPOTHK-riCAL: YES 

(vi) ORIGIHAL SOURCE: 

(A] ORGANISM: Porphyromonas gingival is 

(ix) rEATURE: 

(A) NAIIE/hXr: mlsc_f eature 
{B) LOCATIOll 1, . ,757 



(:;!> SEQt»F,MCE DESCRIPTiOM: SEQ ID HO: 419 



Met 


Arg 


I'Kr 


Lys 


Thr 


He 


Phe 


Phe 


Ala 


He 


He 


Ser 


Phe 


He 


Ala 


Leu 


1 








5 










10 










15 




Leu 


See 


Ser 


Ser 


Leu 


Sor 


Ala 


Gin 


Ser 


Lys 


Al a 


Val 


Leu 


Thr 


Gly 


Scr 








20 










25 










30 




Val 


Ser 


Asp 


Aia 


Glu 


The 


Glv 


Glu 


Pro 


Leu 


Ala 


Gly 


Ala 


Arg 


He 


Giu 






35 










40 










45 








Val 


.. 


Hi? 


Thr 


Asn 


He 


Val 


Ala 


Glv 


Ala 


As 


Ala 


Gly Gly 


His 


Phe 




50 










55 










60 










Glu 


He 


Lys 


Asn 


Leu 


Pro 


Ala 


Gly 


Gin 


His 


Thr 


He 


He 


Cyt- 


Ser 


L"u 


6£ 










70 










75 










80 


Gly 


Gl 


Tyr 


Gly 


Gin 


Lys 


Glu 


Glu 


Val 


Val 


Ala 


He 


Glu 


Ala 


Gly 


Gin 




















90 










95 




Thr 


Uj s 


Thi" 


2io 


Sor 


Phe 


Al a 


Leu 


Arg 


Leu 


A rrt 


Thr 


Ash 


Asn 


Leu 


Glis 








1 00 










1 05 










110 






Glu 


Val 


Val 


Val 


Thr 


GJ V 

y 


Ttir 




Thr 




1 J r 


Arg 


Leu 


Val 


Asp 


Ala 






115 










120 










125 








Pro 


Val 


Ala 


Thr 


Glu 


Val 


Leu 


Thr 


Ala 




ASp 


He 


Ala 


Ser 


Phe 


Ser 




130 










135 










14 0 










Ala 


Pro 


Thr 


Ser 


Glu 


Ala 


Leu 


Leu 


Gin 


Gly 


Leu 


Ssr 


Pro 


Ser 


Phe 


Asp 


145 










150 










155 










160 


Ph'S 


Gly 


Pro 


Asn 


Leu 


Met 


Gly 


Ser 


Phe 


Met 


Gin 


Leu 


Asn Gly 


Leu 


Ser 










165 










170 










17 5 




Ser 




Tvr 


He 




He 


Leu 


He 


Asp 


Glv 


f.vs 


Ara 


Val 


Tyr 


Gly 


Asp 








160 










185 










190 






Val 


Gly 


Gly 


Gin 


Ala 


Asp 


Leu 


Ser 


Arg 


He 


Ser 


Pro 


Asp 


Gin 


He 


Glu 






195 










200 










205 








Arg 


lie 


Glu 


Leu 


Val 


Lys 


Gly 


Ala 


Ser 


Ser 


Ser 


Lsu 


Tyr 


Gly Ser 


Asp 




210 










215 










220 










Ala 


He 


Ala 


Gly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Lys 


Asn 


Thr 


Asn 


Arg 


22E 










230 










235 










240 


Leu 


Scr 


Ala 


Tyr 


Thr 


Ser 


His 


Arg 


He 


Ser 


Lyff 


Tyr 


Asn Asp Arg 


Gin 










245 










250 










255 




Thr 


Asn 


Thr 


Ser 


Leu 


Asp 


He 


A.«tn 


lis 


Gly 


Lys 


Phe 


Ser 


Ser 


Asn 


Thr 








2 GO 










2*55 










270 






A^n 


Tyr 


Phe 


Phe 


Tyr 


His 


Thr 


Asp 


Gly 


Trp 


Gin 


Asn 


Ser 


Pro 


Phe 


Glu 






275 










280 










285 








lie 


Lys 


Lys 


Lys 


I.ys 


Gly 


Sec 


Gly 


Glu 


Pro 


Val 


Leu 


Glu 


Glu 


Thr 


Tyr 




290 










295 










300 












Lys 


Thr 


Phe 


Arg 


Ala 


Gin 


Glu 


Asn 


Gin 


Gly 


Val 


Ser 


Gin 


Ser 


Leu 


3G5 










310 










315 










320 


5«r 


Tyr 


Tyr 


Ala 


Thr 


Asn 


Asn 


Leu 


Ser 


Phe 


Ser 


Gly 


Asn 


Val 


Gin 


Tyr 










325 










330 










335 




Asn 


Lys 


Arg 


Gin 


lie 


Phe 


Thr 


Pro 


Thr 


Phe 


Ser 


Glu 


Lys 


Lys 


Ala 


Tyr 








34C 










345 










350 






Asp 


M^t 


Arrp 


Tyr 


Arg 


Ala 


Leu 


Thr 


Ala 


Ser 


Lou 


Gly 


Thr 


A.'^n 


Tyr 


Leu 






355 










360 










365 








Ph* 


Pro 


Asn 


Gly 


Leu 


His 


Thr 


Lsu 


Ser 


Phe 


Asp 


Ala 


Val 


Tyr Asp 


Arg 




37 0 










37 5 










3no 










Ph^ 


Aiy 


Phe 


Gly 


Tyr 


Leu 


Tyr 


His 


Asp 


Lye 


Atrp 


Ser 


Ser 


Glu 


Ser 


Leu 


395 










390 










395 










4 00 


11- 


nsn 


Asn 


Gin 


Gly 


Gin 


Thr 


Glu 


Gin 


Pro 


Thr 


Phe 


Phe 


Pro 


Gly 


Gin 










4 05 










410 










415 




Leu 


Arg 


Arn 


Lys 


Asn 


Asp 


Gin 


He 


Arg 


Tyr 


Thr 


Ala 


Glu 


Ala 


Arg 


Gly 








420 










425 










430 






Val 


Phe 


Thr 


Leu 


Pro 


Tyr 


Ala 


Gin 


Lys 


Leu 


Thr 


Gly 


Gly Leu 


Glu 


Tyr 






435 










440 










445 






Phe 


Arg 


Glu 


Glu 


Leu 


He 


Ser 


Pro 


Tyr 


A^n 


Leu 


He 


Thr 


Asp 


Lys 


Ala 




■J50 










455 








460 










Asp Ala 


Ser 


Thr 


I>eu 


Ser 


Ala 


Tyr 


Val 


Gin 


A^p 


Glu 


Trp Lys 


Pro 


Leu 


465 










470 




















480 


Asp 


rrp 


Phe 


Asn 


Met 


The 


Ala 


Gly 


Phe 


Arg 


Leu 


Val 


His 


His 


Gin 


Glu 



SXJBSTrrUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 

356/490 



fhe Gly Thr Arg Met Thr Pro Lys Val Ser , 

^ , 500 "-^^ Leu Al3 Lys Tyr Gly 

Pro Leu A^n Phe Arg Ala Thr Tvr . 510 ^ 

S1.S Ala A.n Gly Tyr Ly. Thr Pro Thr 

Leu Lys Glu Leu Phe Ala Aro 1 ^25 

53P Ala Arg Asn Glu Leu Thr Thr Met Gly s-r HI. 

A.-n Leu Tyr Leu Glv Asn - ^ 

^IS ^'-^ r|,° net Ser A.p Tyr 

Ma Leu Gly Le. clu Tyr A.n Gin Gly P^^ xie s.r Phe Ser Ifa 
Thr Val Tyr A.p A.n Glu Leu Arg Asn L-u <= 

Sao ''''^ I-^e Ser Phe Met Asp He 

- s.. P„ I- P e 

Tyr Ala A,„ He Gly x.ys Ai, Arc S « 

H :: :r - .^^^ - - 
z „ : - ^''^ - -V A. 

7 - Ala A.„ ..1 - ^ 

T/r Arg Leu Gly Val Ser L^u Ph- cit a 670 ^ 

Tyr Lye Arp Cly A.-n Ala Pro t ^ 685 ^ ^ 

S-r . ' To^S 

S-sr His Arg Phe Ala c- nu « ^^^^^ 

Gl^v II « ' "''^ 7^^^ '^■^ '^^^ 

Gii lie A«p A.»n Leu yh. Asp Tyr Val As. 1^' . 

7?S ^ ~P '^•■'P '^rg Pro Met Gly Val 

IS" Tyr Ala Thr Va l tk „ ^^0 

lie AT. : " ^" Ala 



10 



15 



20 



25 



30 



55 



60 



65 



70 



75 



^ . lie Arg Phe Acn Asn 750 
755 

(2) INFORI^TION FOR SEQ ID „O:420 

40 SEQUENCE CHARACTERISTICS- 

A LENGTH: 331 amino acid. 
JBJ TYPE: amino acid 
(D) TOPOLOGY: linear 

45 (li) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 
(vi) ORIGINAL SOURCE • 
50 'A, 0RGA;.ISM: Porphyromona. glngivalis 

(ix) FEATURE: 

(A) NAHE/KEY: ml„ f„tur« 

(B) LOCATION 1...331 

SEQUENCE DESCRlPTIOt.: SEQ ,D „O:420 
p X.r A.P .y. ^„ 
- - Gly val 11. .^^^ 
Arg rro Ala Glu Glu Ho Tl^ ^? 30 

Arg Val Ser Ser Lvs Val Pr„ , <S 

S„ Val Pro Gly Ar, rie ty. Glu I^u Arg Val Ser 

Glu Gly «i„ sin Val Gin Ala Gly , t° 

. 70 ^ '•^^ Ala Val He Glu Ala 

1 A? aT - '^^ 

; " '^^^ ... Ala Gin Ala ^I? Val Ala 

- T..r ... T.r g1 ^„ 

" - - - - A.. Glu Ala T.r itl G. Ar, A.. Al 



PCT/AU98/01023 



'STTTUTE SHEET (Rule 26) (RO/AU) 



iP'-'h ir«ii JH-'™ iF=lt 

lUi "t'ji^ 1:3 



& Bi 'iuii o id Q %ai 



wo 99/29870 PCT/AU98/01023 

357/ 490 



35 



45 



50 



55 



60 



65 



70 



75 



145 










150 










155 










160 


Ala 


Tie 


Ala 


Thr 


Gin 


Lys 


Ala 


Ala 


Glu 


Ala 


Gin 


Tyr 


Af n 


Met 


Ala 


Arg 










] 65 










no 










175 




Ash 


Gly 


Ai- 


Glu 


Arg 


Glu 


A5Jp 


Lys 


Leu 


Ala 


Ala 


Ser 


Ala 


Leu 


Vol 


A^Fp 








180 










IBS 










190 






Arg 


Ala 


Arg 


G]y 


Ala 


Val 


Ala 


Glu 


Val 


Glu 


Ser 


Tyr 


He 


Asn 


Glu 


Thr 




195 










200 










205 








Tyr 


Leu 


IlE 


Ala 


Pro Arg 


Ala 


Gly 


Glu 


Val 


Ser 


Glu 


He 


Phe 


Pro 


Ly.s 




210 










215 










220 










Ala 


Cly 


Glu 


Leu 


Val 


Gly 


Thr 


Gly 


Ala 


Pro 


He 


Met 


Asn 


He 


Ala 


Glu 


225 










230 










235 










240 


Met 


Gly 


Afp 


Mot 


Trp 


Ala 


Ser 


Phe 


Ala 


Val 


Arg 


GTu 


A5P 


Phe 


Leu 


Ser 










215 










250 










255 




Scr 


Met 


Thr 


H*t 


Gly 


Ala 


Val 


Leu 


Glu 


Thr 


Val 


Val 


Pro 


Ala 


Leu 


Asn 


















265 










2-fO 






Glu 


Glu 


Lys 


Vsl 


Arg 


Phe 


Lys 


lie 


Thr 


Phe 


He 


Lyr 


Asn 


net 


Gly 


Thr 






2T5 










280 










285 








Tyr 


Ala 


Ala 


Trp 


Lys 


Ala 


Thr 


Lys 


Thr 


Thr 


Gly 


Gin 


Tyr 


A^p 


Leu 


Lys 




290 




















300 










Thr 


Phe 


Glu 


Val 


Lys 


Ala 


Thr 


Leu 


Ala 


Afp 


Lyf 


Aep 


Lys 


Ala 


Gin 


Lys 


305 










310 










315 










320 


Leu 


Arg 


Tro 


Gly Met 


Ser 


Val 


Tie 


lie 


Arg 


Lyff 




















325 










330 















10 



15 



20 



25 12) III FORI iATION FOR SF.Q ID MO: 421 

ti) SEQUEMCE CHARACTERISTICS: 

(A} LEIIGTH: 261 amino atlds 
IE) TYPE: amino acid 
30 tD) TOPOLOGY: linear 

tii) MOLECULE TYPE: protein 

{iii> HYPOTHETICAL: YES 



<vil ORIGIHAL SOURCE: 

(AJ ORGAtiia-'f: Po rphyromonas gingival is 



(ixj FEATURE: 
40 (AJ IWIE/KEY: miac feature 

(BJ LOCATION 1. . .267 



(xU SEQUENCE DESCRIPTIOM: SEQ ID NO: 421 



Met 


Arq 


He 


Val 


Ser 


Asn 


Phe 


Leu 


Phe 


Val 


Ser 


Phe 


Ser 


Val 


Leu 


Leu 


1 


















10 










15 




Phe 


Ala 


Ser 


Cys 


Arg 


Ser 


Gin 


Arg 


Glu 


Lys 


Val 


Val 


Tyr 


Leu 


Gin 


Asp 








20 










25 










30 






He 


Gin 


Thr 


Phe 


Asn 


Arg 


Glu 


He 


He 


Ala 


Lyr 


Pro 


Tyr 


Asp 


Val 


Lys 






35 










40 










45 








He 


Glu 


Lys 


A5p 


Asp 


Val 


Leu 


Asn 


He 


Leu 


Val 


Ser 


Scr Arg 


Asp 


Pro 




50 










55 










60 










Glu 


Leu 


Ser 


Thr 


Pro 


Tyr 


Asn 


Gin 


Val 


Leu 


Thr 


Thr 


Arg Ala 


Leu 


Ala 


65 










70 










75 










SO 


Arg 


Asn 


Gly 


Tyr 


Gly 


Thr 


Asn 


Ser 


Asn 


Glu 


Gly 


Phe 


Leu 


Vul 


Asp 


Ser 










85 










90 










95 




Lys 


Gly 


Tyr 


He 


Asn 


Tyr 


Pro 


He 


Leu 


Gly 


Gin 


He 


Tyr Val 


Glu 


Gly 








100 










105 










110 






Leu 


Thr 


Arg 


Thr 


Glu 


Leu 


Glu 


Lys 


Glu 


He 


Gin 


Lyis' 


Arg 


He 


He 


£*r 






1 15 










120 










125 








Ser 


Gly 


Phe 


He 


Lys 


Asp 


Pro 


Thr 


Val 


Thr 


Val 


Gin 


Leu 


Gin 


Asn 


Fhe 




130 










135 










140 










Lys 


Val 


Ser 


Val 


Leu 


Gly 


Glu 


Val 


Asn 


His 


Pro 


Gly 


Ser 


Met 


Ser 


Val 


145 










150 










155 










160 


Ly^ 


Gly 


Glu 


Arn 


He 


Thr 


Leu 


Leu 


Glu 


Ala 


He 


Gly 


Met 


Ala 


Gly 


Aj*p 










165 










170 










175 




Leu 


Thr 


He 


Tyr 


Gly 


Arg 


Arg 


Asp 


Arg 


Val 


Phe 


Val 


He 


Arg 


Glu 


Thr 








180 










185 










190 






Asp 


Gly 




Arg 


Glu 


Val 


Phe 


Gin 


Thr 


Asp 


Leu 


Arg 


Lys 


Ala 


Asp 


Leu 






195 










200 










205 








Leu 


Ala 


Ser 


Pro 


Val 


Tyr 


Tyr 


Leu 


His 


Gin 


Ai-n 


Asp 


Val 


He 


Tyr 


Val 




210 










215 










220 










Glu 


Pro 


Asn 


AJp 


Ly5 


Lys 


Thr 


Gin 


Met 


Ser 


Glu 


He 


Asn 


Gin 


Asn 




225 










230 










235 










2 4 J 


Asn 


Val 


Asn 


Val 


Trp 


f.CLI 


Ser 


Val 


Thr 


Scr 


Thr 


Leu 


Val 


Ser 


He 


Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



na-. rfi ir^ii -iF^n tJ''^ 




wo 99/29870 



PCT/AU98/01023 



358/490 



245 250 
Thr Leu Thr lis Thr lie He Asp Lys Thr Ly^ 
260 265 



255 



12} ItlFOFUlATION FOR SEQ ID tiO:422 

(i) SEQUEMCE CHARACTERISTICS: 

(Ai LENGTH: 569 amino acids 
IB) TiVt: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECU'LE TYPE: protein 

(lU) HYPOTHLTICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAKTSt!: Porphycomonas gingival! s 

(ix) FEATURE: 

(A) IWIE/KEY: misc feature 

(B) LOCATIOH 1. , .569 

SEQL'EHCE DtrSCRIPTIOH: SEQ ID NO: 422 



net 


Lys 




Thr 


Asn 


Lcii 


Phe 


Leu 


Ser 


Leu 


Leu 


Val 


He 


Phe 


He 


Thr 


1 








5 










10 










15 




'3Ly 


S?.Z 


Fhe 


Met 


Thr 


Ala 


Cys 


Ala 


Gin 


Lys 


Ser 


Lys 


Thr 


Asn 


LyS 


Lsu 








20 










25 








30 






Thr 


Glu 


Glu 
3^ 


Afp 


Arg 


Scr 


Arg 


Aflfn 
40 


Glu 


Tyr 


Val 


Gin 


Ser 
45 


Het 


Asp 


Val 


Leu 


Ser 


Asn 


11- 


He 


Gly 


A^n 


Va 1 


Arg 


Leu 


Tyr 


Phe 


Val 


Asp Tht 


He 




50 










55 










60 










Ser 


He 


Lys 


Hi£ 


Hst 


Thr 


Arg 


Arg 


Gly 


lie Asp 


Ala 


Met 


Leu Gly Gly 












70 










75 










flO 


Leu 


Asp 


Pro 


Tyr 


Thr 

85 


Glu 


Tyr 


He 


Pro 


Tyr 
90 


Glu 


Glu 


Het 


Asp 


Glu 
95 


Leu 


Lys 


Leu 


tiet 


Thr 


Thr 


Gly 


Glu 


Tyr 


Ala 


Gly Vol 


Gly Ala 


He 


He 


Ser 








100 










105 










110 






Gin 


Arg 


Pro 


Asp 


Ser 


Ala 


Val 


He 


He 


Gin 


Arq 


Pro 


Met 


Glu 


Gly Het 






lis 










120 










125 








Pro 


Ala 
130 


Asp 


Glu 


Ala 


Gly 


Leu 

135 


He 


Ala 


Gly 


Asp 


Arg 
140 


He 


Leu 


Thr 


He 


Asp Gly Lys 


Asp 


Phe 


Arg 


Lys 


Scr 


Thr 


Thr 


Pro 


Lys 


Val 


Ser 


Gin 


Ala 


145 










150 










155 










160 


Leu 


Lys 


Gly 


He 


AJa 
165 


Gly 


Thr 


Val 


Ala 


tys 
170 


Val 


Thr 


Val 


Met 


Arg 
175 


Tyr 


Gly Glu 


Thr 


Lys 


Pro 


Arg 


Thr 


Phs 


Ser 


Val 


)>ys 


Arg 


Gin 


Lys 


Val 


l\n 








190 










185 










190 






(let 


Af n 


Ser 


Vol 


Thr 


Tyr 


Ser 


Gly 


Met 


Leu Asp 


Gly 


Sei: 


He 


Gly 


Tyr 
















2 GO 










205 






lie Arg 


Leu 


Asn 


Asn 


Phe 


Thr 


Asp 


Lys 


Ser 


Ala 


Glu 


Glu 


Val 


Arg 


Thr 




210 










215 










220 








Ala 


Leu 


Leu 


Asp 


Leu 


Arg 


Asp 


Lys 


Gin 


Giy Ala 


Lyc 


Gly 


Leu 


He 


Leu 


225 










230 










235 










240 


Asp 


Leu 


Arg 


Gly 


Asn 
24 5 


Gly 


Gly 


Gly 


Leu 


Het 
250 


Gin 


Ala 


Ala 


He 


Glu 
255 


He 


Val 


Asn 


L&u 


Phe 


Val 


Pro 


Lys 


Giy 


Lys 
265 


Glu 


val 


Val 


Thr 


Thr 


Lys 


Giy 








260 


















270 






Arn 


He 


Ala 

275 


Glu 


Ser 


Ala 


Ssir 


Val 
280 


Phe 


Arg 


Thr 


Leu 


Thr 

285 


Glu 


Pro 


He 


Ai-p Thr 


h'/s 


l.f*U 


Pro 


He 


Val 


Val 


Leu 


Ilc- 


Asp 


Gly 


Gin 


Ser 


Ala 


Ser 




2 90 










295 










300 










Ser 


Ser 


Glu 


He 


Val 


Ala 


Gly 


Ala 


Leu 


Gln 


Asp 


Met 


Aap 


Arg 


Ala 


Val 


30S 










310 










315 










320 


Leu 


Met 


Gly 


Gin 


Lys 


Ser 


Tyr 


Gly 


Lys 


Gly 


Leu 


Val 


Gin 


Thr 


Thr 


Arg 










325 










330 










335 


Gin 


Lsu 


Pro 


Tyr 


Asn 


Gly 


Vol 


He 


Lys 


Leu 


Thr 


Thr 


Ala 


Lys 


Tyr Tyr 








3^0 










345 










350 






He 


Pro 


Ser 


Gly 


Arg 


Cys 


He 


Gin 


Arg 


Leu 


A^p 


Tyr Sec 


Arg 


Thr 


Asn 






355 










360 










365 








Arg 


Thr 


Gly 


Met 


Ala 


Thr 


Ala 


He 


Pro 


Asp 


Ser 


L«u 


His 


Lys 


He 


Phe 




370 










375 








380 








Tyr 


Thr 


Ala 


Ala 


Gly 


Arg 


Arg 


Val 


Glu 


Asp 


Ala 


Gly 


Gly 


He 


Leu 


Pro 


385 










390 










395 










400 


Asp 


He 


Glu 


Val 


Lyff 


Gin 




Thr 


Ala 


Ala 


The 


Leu 


Leu 


Tyr 


Tyr 


Met 
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405 










410 










4 15 




Al a 


lie 


Am 


Asn 


Asp 


Vai 


Phc 


Asp 


Phe 


Val 


Thr 


Gly 


Tyr 


Val 


Leu 


Lys 








420 










425 










430 






Hi£ 


hys 


TJir 


lie 


Ala 


Lys 


Pro 


Glu 


Asp 


Pile 


Ser 


lie 


Thr 


Ann 


Ciu 


Asp 






435 










440 










445 








T\'r 


Ala 


Ala 


Phe 


Cys 


Lys 


Het 


Met 


Glu 


Glu 


Lys 


Lys 


Phe Asp Tyr 


Asp 




450 










455 










460 










Arg 


(slu 


Sec 


Gly 


Lys 


Met 


L«u 


Asp 


Lys 


Leu 


Glu 


Glu 


Leu 


Ala 


l.ys 


lie 


465 










470 










475 










480 


Glu 


Gly 


Tyr 


Leu 


Pro 


Glu 


Ala 


Asn 


Ser 


Glu 


Leu 


Lys 


Ala 


Leu 


Arg 


Glu 










4fl5 










4 90 










495 




Lys 


Leu 


Lys 


Pro 


Af n 


Lsu 


Ser 


Arg 


Asp 


Leu 


Leu 


Ary 


Phe 


Lys 


l.ys 


Glu 








500 










505 










510 






lie 


Thr 


Asn 


ryr 


Leu 


Asn 


Asn 


Glu 


lie 


Val 


Thr 


Arg 


Tyr 


Tyr 


Tyr 


Glu 






515 










520 










525 








Arg 


Gly 


Sor 


Tie 


Arq 


Gin 


Ser 


Leu 


Pro 


Glu 


A.<fp 


Lys 


Val 


Val 


Lys 


Glu 




530 










535 










540 










Ala 


lie 


Ly5 


Leu 


Leu 


Lys 


Arp 


His 


Pro 


Glu 


Gin 


He Arg 


Gin 


lie 


I.-u 


S45 










530 










555 










560 


Ala 


Ala 


Pro 


Lys 


Ala 


Glu 


Asn 


Lys 


Gly 






















565 
























IirroRMATION 


FOR 


SEQ 


ID lJO:42 


:3 

















U! SEQ<JEUC£ CHARACTERISTICS: 

(A) LENGTH: 981 amino acids 
(R) TYPE: amino acid 
tD) TOPOLOGY: linear 



(li) MOLECULE TTt'PE: protein 

(iii) JTiTFOTHETICAL: YES 

<V1 ) ORIGINAL SOURCE: 

(A) ORGAHISM: Porphyromonas gingival Is 

<ix| FEATURE: 

(AJ HAHE/KEY: nvisc^f eature 
tB) LOCATIO^^ 1. . . 981 



(Xi) SEQUEHCE DESCRIPTION: SEQ ID NO: 4 23 



tlet 


Gin 


Asn 


Lys 


Gly 


Phe 


Val 


He 


Val 


He 


Thr 


Ser 


Ala Leu 


Ala 


He 


1 








5 










10 








15 




lie 


Cys 


Ala 


Phe 


Tyr 


Leu 


Ser 


Phe 


Ser 


Phe 


Val 


Thr 


Asn Arg Tyr 


Glu 








20 










25 








30 






Lys 


Lys 


Ala 


Lys 


Ala 


net 


Gly 


Asp 


Val 


Ala 


Gly 


Het 


Ala Tyr 


Lsu 


Asp 






35 










40 










45 






Ser 


Itet 


Sec 


Asn 


Giu 


Lys 


Val 


Trp 


Phe 


Gly 


Tyr 


The 


Leu Lys 


Glu 


Ala 




50 










55 










60 








Gin 


Ala 


Gin 


Gin 


He 


Gly 


Leu 


Gly 


Leu 


Asp 


Leu 


Lys 


Gly Gly 


Het 


Asn 


€5 










70 










7S 








BO 


Val 


lie 


Leu 


Lys 


Leu 


Asn 


Ala 


Ser 


Asp 


Leu 


Leu 


Arg 


Asn Leu 


Ser 


Asn 










85 










90 








95 




Lys 


Ser 


Leu 


Asp 


Pro 


Asn 


Phe 


Asn 


Lys 


Ala 


Leu 


Giu 


Asn Ala 


Ala 


Lys 








100 










105 








110 






Ser 


Thr 


Glu 


Gin 


Ser 


Asp 


Phe 


He 


Asp 


He 


Phe 


Val 


Lys Glu 


Tyr 


Arg 






115 










120 










125 






Lys 


l.-u 


Asp 


Pro 


Asn 


Gly 


Arg 


Leu 


Ala 


Val 


He 


Phe 


Gly Ser 


Gly 


Asp 




130 










135 










140 








Ltu 


Arg 


Asp 


Gin 


He 


Thr 


Ala 


Lys 


Ser 


The 


Asp 


Ala 


Asp Val 


Val 


Arg 


145 










150 










155 








160 


Thsu 


Leu 


Lys 


Glu 


Lys 


Tyr 


T\f?.n 


Ser 


Ala 


Val 


Glu 


Ala 


Scr Phe 


Asn 


Val 










165 










170 








175 




Leu 


Arg 


Ala 


Arg 


He 


Asp 


Ala 


Phe 


Gly 


Val 


Val 


Ala 


Pro Asn 


Leu 


Gin 








180 










185 








190 






Arg 


Leu 


Glu 


Gly 


Gin 


Gly 


Arg 


He 


Leu 


Val 


Glu 


Leu 


Pro Gly Val 


Lys 






195 










200 










205 






Asp 


Pro 


Glu 


Arg 


Val 


Arg 


Thx 


Leu 


Leu 


Cln 


Arg 


Ser 


Ala Asn 


Leu 


Gin 




210 










215 








220 








Phe 


Trp 


Arg 


Thr 


Tyr 


Lys 


Phe 


Glu 


Glu 


Val 


Ser 


Gly 


Asp Leu 


He 


Ala 


225 










230 










235 








240 


Ala 


Asn 


Asp 


Arg 


Leu 


Ser 


Glu 


Leu 


Ala 


net 


Asn 


Asn 


Thr Asp Ala 


Thr 










245 










250 








255 




Pro 


Glu 


Thr 


Glu 


Pro 


Ala 


Thr 


Thr 


App 


Ser 


Val 


Ala 


Ala Thr 


Ala 


Arp 
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260 



265 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 





Ala 


Ala 


Va 1 


Gin 


Ala 


Val 


Ala 


Ann 


S<* r 


Ala 


The 


Val 


Ala 


Gin 








275 










28 0 










285 










Ala 


Lys 


Af p 


Ala 


Thr 


Arg 


Lys 


Asp 


Ala 


Leu 


Phe 


Ser 


Leu 


r e ( 
.eu 


Thr 














2 95 










300 










Pco 


a I 


Ajso 


Arg 


fzi -if 

tjiy 


Giy 


Ala 


Va 1 


Val 


Gl y 


Vai 


Ala 


Arg Arg 


rvia 


Asn 


305 










310 










315 










320 


Met 


Ala 




Tie 




Glu 




Leu 


Gin 




Ala 


His 


A^p 


Leu 


Lys 


Val 










325 




















335 






Arq 


t»lu 


n 


Val 


Leu 


Fhe 




Trp 


Gly 


Ala 


Lys 


Ala 


He 


Glu 


Asp 








340 










34 5 










350 






Pro 


Glu 


Thr 


T.ys 


Lys 


Glu 


Thr 


Asp 


Leu 


Tyr 


Glu 


Leu Tyr Ala 


He 


Arg 






355 










360 










365 








The 


Asn 


Arg 


Thr 


Gly 


Asp 


Pro 


T^p 


Leu 


Gly 


Gly 


Asp 


Val 


Val 


Thr 


Scr 




37 0 










375 










380 










Ala 


Lys 


Sor 


Asp 


He 


Gin 


Asn 


Asp 


Phe 


Gly 


Arg 


Ser 


Glu 


Pro 


He 


Val 


385 










390 










395 










4 00 


Sftr 


He t 


Thr 


( le t 


Asn 


Glu 


Glu 


Gly 


Al =» 


Arg 


Lys 


Trp 


Ala 


Arg 


He 


Thr 










4 OS 










410 










415 




Lys 


Asp 


Aen 


Va 1 


Gly 


Arg 


Ala 


He 


Ala 


He 


Val 


Leu Asp Gly 


Vai 


Val 








420 










425 










430 






Tyr 


Ser 


Ala 


Pro 


Asn 


Viil 


Asn 


Asp 


Glu 


He 


Thr 


Gly 


Gly 


Arg 


Ser 


Gl n 






4 35 










4 4 0 










445 








Tie. 

lie 




Gly 




Phe 


Thr 




Glu 


Glu 


■ 

Alci 


Gly 


TKsp 


Leu 


Ala 


Asn 


Val 
























460 












Asn 


Ser 


Gly 


Lys 




Asp 


Aia 


Thr 






He 


Glu 


Gin 


Glu 


Asn 


4 65 










4 ' 0 










47 5 










4 80 


Val 


He 


Gly 


Pro 


Thr 


Leu 


Gly 


Ala 


Glu 


Sec 


He 


Lys 


Ala 


Gly 


Phe 


LeU 










4 85 








4 90 










495 




5er 


Phe 


Leu 


Leu 


Ala 


Leu 


Val 


Tift 


Leu 


Het 


Cys 


Tyr 


Met 


Cys 


Leu 


Ala 








500 










505 










510 






Tyr 


Gly 


Phc 


Leu 


Pro 


Gly 


Leu 


He 


Ala 


Asn 


Gly 


Ala 


Leu 


He 


Val 


Asn 






515 










520 










525 








Ssr 


Phe 


Phe 


Thr 


Leu 


Gly 


Vai 


Leu 


Ala 


Ser 


Phe 


His 


Ala 


Vai 


Leu 


Thr 




530 










535 










540 










Leu 


Ser 


Gly 


He 


Ala 


Gly 


Leu 


Val 


Leu 


Thr 


Leu 


Gly 


Met 


Ala 


Val 


Asp 


545 










550 










555 










560 


Ala 


Asn 


Val 


Leu 


He 


Phe 


Glu 


Arg 


He 


Lys 


Glu 


Glu 


Leu Arg Ala 


Gly 










565 










570 










575 




Lys 


Thr 


Pro 


Tie 


Arq 


Al a 


Val 


Thr 


Asp 


Gly 


Tyr 


Gly Asn Ala 


Phe 


Ser 








580 










585 










590 






Ala 


He 


Phe 


Asp 


Ser 


Asn 


Va 1 


Thx 


Thr 


He 


He 


Thr 


Gly He 


He 


Leu 






S?5 










600 










605 








Phe 


I.su 


Tyr 


Gly 


Thr 


Gl y 


Pro 


He 


Arg 


Gly 


Phe 


Ala 


Thr 


Thr 


Leu 


He 




610 










615 










620 










lie 


Gly 


Leu 


lis 


Ala 


Ser 


Phe 


He 


Thr 


Ala 


Val 


Phe 


Leu 


Thr 


Arg 


He 


62 5 










630 










635 










64 0 


Val 




Glu 


Lys 


Leu 


Ala 


Lys 


Lys 


•Gl y 


Arg 


Leu 


Asp 


Lys 


He 


Thr 


Phe 










64 5 










650 










6S5 




Thr 




er 


He 


Thx 




fln 


Leu 


Leu 


Val 




Pro 


Ser 


Tyr 


Asn 


H * 








660 










665 










67 0 






Leu 


Gly 


hys 


Arg 


Lys 


Thr 


Giy 


Phe 


He 


He 


Pro 


Val 


He 


He 


He 


Val 






675 










680 










685 








Leu 


Gly 


Leu 


He 


Ala 


Ser 


Phe 


Thr 


He 


Gly 


Leu 


Asn 


Arg 


Gly 


He 


Glu 




690 










655 










700 










Phe 


Ser 


Gly 


Gly 


Arg 


Asn 


Tyr 


Val 


Val 


Lys 


Phe 


A.9P 


Gin 


Pro 


Val 


Ser 


70ij 










710 










715 










720 


Ser 


Glu 


Ala 


Val 


Arg 


Ser 


Ala 


Leu 


Ser 


Ser 


Pro 


Leu 


Gin 


Glu 


Lys 


Val 










725 










7 30 










735 




Leu 


Val 


Thr 


Ser 


He 


Gly 


Thr 


Glu 


Gly 


Thr 


Glu 


Vai 


Arq 


He 


Ser 


Thr 








740 










745 










750 






Asn 


•fyr 


Lyfc 


He 


Gin 


Glu 


Glu 


Ser 


Glu 


Glu 


TlM 


Glu 


Ala 


Glu 


He 


Thr 






755 










760 










7 65 








Asp 


Ly^ 


Leu 


Tyr 


Gin 


Ser 


Leu 


Lys 


Gly 


Phe 


Tyr 


Thr 


Gin 


Gin 


Pro 


Thr 




770 










77ii 










780 










Ala 


Asp 


Gin 


Phe 


Leu 


A5p 


Asn 


He 


He 


Ser 


Ser 


Gin 


Lys 


Val 


Ser 


Pro 


785 










790 










795 










800 


Ser 


Met 


Ser 


Ser 


Asp 


He 


Thr 


Arg 


Gly 


Ala 


He 


Trp 


Ala 


Val 


Leu 


Leu 










BOS 










810 










815 




Ser 


Met 


He 


Phe 


Met 


Ala 


He 


Tyr 


He 


Leu 


He 


Arg 


Phe 


Arg Asp 


He 








820 










825 










830 






S-r 


Phe 


Ser 


Ala 


Gly 


Val 


Phe 


Val 


Ser 


Val 


Ala 


Ala 


Thr 


Thr 


Phe 


Cys 






835 








840 










845 








He 


He 


Ala 


Leu 


Tyr 


Ala 


Leu 


Leu 


Trp 


Lys 


He 


Leu 


Pro 


Phe 


Thr 


Met 



850 855 860 
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lie 


Asp 


Gin 


Asn Phe Tie 


Ala 


Ala 


lie 


Leu 


Ala 


He 


Ho Gly Tyr 


865 








870 








875 






880 


Ser 


Lou 


A£n 




Thr Val Val 


Val 


Phe 


PiSp 


Arg 


He 


Acq 


Glu Thr Het 










ess 






990 








995 


Lys 


Leu 


Tyr 


Pro 


Asn Arg Asp 


Arq 


Tyr 


Gin 


Val 


Il« 


Asn Asp Ala Leu 








900 






905 










910 


Asn 


Ser 


Thr 


Leu 


Gly Arg Thr 


Leu 




Thr 


Ser 


Leu 


Thr 


Thr Phe He 






915 






920 










925 




Val 


Met 


Leu 


Val 


lie Phe lie 


Phe 


Gly 


Gly Ala 


Thr 


Het 


Arg Ser Phe 




930 






93S 










940 




The 


Phe 


Ser 


lie 


Leu Leu Gly lie Val 


He 


Gly 


Thr 


Tyr 


Ser Thr Leu 


945 








950 








955 






960 


Phe 


Val 


Ala 


Thr 


Pro Leu Ala 


Tyr 


Glu 


He 


Gin 


Lys 


Arg 


Lys Leu Asn 










965 






97 0 








97 5 


tys 


Ala 


Ala 




Ly£ 

















980 



(2) lUl-ORHATIOH FOR SEQ ID MO: 4 2-1 

(1) SEQUENCE CHARACTERISTICS: 

(Ay LENGTH: 1017 amino acid^ 
(Bl TYPE; amino acid 
(0) TOPOLOGY: linear 

Ui) IJOLECULE TYPE: protein 

(ill) Hi POTHETICAL : YES 

(vl) ORIGIHAL SOURCE: 

(A) ORGAUISII: Porphyromona^ gingiva iia 

(ix) FEATURE: 

(A) HAHE/KEY: ml5C_f eature 

(B) LOCATIOIf 1.,.1017 

txl} SEQUENCE DESCRIPTIOH: SEQ ID MO; 424 



Met 


Lys 


Arg 


Met 


Thr 


Leu 


Phe 


Phe 


Leu 


Cys 


Leu 


Leu 


Thr 


Ser He 


Gly 


1 








5 










10 








IS 




Trp Ala 


Het 


Ala 


Gin 


Asn 


Arg 


Thr 


Vai 


Lya 


Gly 


Thr 


Val 


He Ser 


Ser 








20 










25 










30 




Glu 


Asp 


Asn 


Glu 


Pro 


Leu 


He 


Gly 


Ala 


Asn 


Val 


Val 


Val 


Val Gly Asn 






35 










40 










45 






Thr 


Thr 
50 


He 


Gly 


Ala 


Ala 


Thr 
55 


Asp 


Leu 


A,«p 


Gly 


Asn 
60 


Phe 


Thr Leu 


Ser 


Val 


Pro 


Ala 


Asn 


Ala 


Lya 


Met 


Leu 


Arg 


Val 


Ser 


Tyc 


Ser Gly Met 


Thr 


65 










70 










75 








80 


Thr 


Lys 


Glu 


Val 


Ala 
85 


He 


Ala 


Asn 


Val 


Met 
90 


Lys 


He 


Val 


Leu Afp 

95 


Pro 


A^p 


Ser 


Lys 


Val 


Leu 


GJ u 


Gin 


Val 


Val 


Val 


Leu 


Gly 


Tyr Gly Thr 


oly 








100 










105 










HO 




Gin 


Ly^ 


Leu 
115 


Ser 


Thr 


Val 


Ser 


Gly 
120 


Ser 


Val 


Ala 


Lys 


Val 
125 


Ser Scr 


Glu 


Lys 


Leu 


Ala 


Glu 


Lys 


Pro 


Val 


Ala 


Asn 


He 


Het 


Asp 


Ala 


Leu Gin Gly 




130 










135 










140 








Gin 


Val 


Ala 


Gly 


Met 


Gin 


Val 


Met 


Thr 


Thr 


Ser 


Gly 


Asp 


Pro Thr 


Ala 


145 










150 










155 








160 


Val 


Ala 


Ser 


Val 


Glu 


He 


His 


Gly 


Thr 


Gly 


Ser 


LC2U 


Gly Ala Ser 


ser 










165 










170 








175 




Ala 


Pro 


Lsu 


Tyr 


He 


Val 


Asp 


Gly 


Met 
185 


Gin 


Thr 


Ser 


Leu 


AsD Val 
190 


Val 


Ala 


Thr 


Met 
195 


Asn 


Pro 


Asn 


Asp 


Phe 
200 


Glu 


Ser 


Met 


Ser 


VaJ 
205 


Leu Lys 


Asp 


Ala 


Scr 


Ala 


Thr 


Ser 


He 


Tyr 


Gly 


AlA 


Arg 


Ala 


Ala 


Asn 


Gly Val 


Val 




210 










215 








220 








Phe 


He 


Gin 


Thr 


Lys 


Lys 


Gly 


Lys 


Met 


Ser 


Glu 


Arg 


Gly 


Arg He 


Thr 


225 










230 










235 








2^0 


Phe 


Asn 


Ala 


Ser 


Tyr 
245 


Gly 


He 


S-er 


Gin 


He 

250 


Leu 


Asn 


Thr 


Lyr Pro 
255 


Leu 


Asp Aen 


Het 


Met 


Thr 


Gly 


Asp 


Glu 


Leu 


Leu 


Asp 


Phe 


Gin 


Val Lys 


Ala 








260 










265 










27 G 




Gly 


Phe 


Trp 


Gly 


Asn 


Asn 


Gin 


Thr 


Val 


Gin 


Lys 


Val 


Lyff Asp Het 


He 






2V5 










280 








285 






Leu 


Ala 
290 


Gly 


Ala 


Glu 


Asp 


Leu 
295 


Tyr 


Gly 


Asn 


Tyr 


Asp 
300 


Ser 


Leu Lys 


Asp 
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Olu Tyr r.Xy Lys Thr Leu Phe Pro Val Asp Phe Asn His Asp Ala A^p 
305 210 315 320 

Trp Leu Lyt' Ala Leu Phe Lys Thr Ala ?ro Thr Scr Gin Gl y Asp lie 

325 330 335 

Ser Ph!r ser Gly Gly Ser Gin Gly Thr Ser Tyr Tyr Ala Scr lie Gly 

340 34S 350 

Tyr Phe Asp Gin Glu Gly Het Ala Arg Glu Pro Ala Asn Phe Lys Ary 

355 360 365 

Tyr Ser Gly Arg Leu Asn Phe Glu Ser Arg lie Arn Glu Trp Lftu Lys 

370 375 380 

Vsl Gly Ala Ajsn Leu Ser Gly Ala lie Ala Asn Arg Arg Ser Ala Asp 
385 390 39S 400 

Tyr Phe Gly Lya- Tyr Tyr Met Gly Ser Glv Thr Phe Gly Val Leu Thr 

405 410 415 

l-Wl Pro Arg Tyr Tyr Asn Pro Pho Asp Val Asn Gly Asp Leu Ala Aep 

420 425 430 

Val Tyr Tyr Met Tyr Gly Ala Thr Arg Pro Set HeL Thr C] ii Pro Tyr 

435 440 4^15 

Phe Ala Lys Met Arg Pro Phe Ser Ser Glu Ser His Gin Ala Asn Val 

450 455 4€0 

hsn Gly Phe Ala Gin lie Thr Pro lie Lys Gly Leu Thr Leu I.y.-? Ala 
465 470 475 480 

Gin Ala Gly Val Acp lie Thr A«n Thr Arg Thr Ser Ssr Lys Arg Het 

4 85 4 50 4 95 

Pro Asn Asn Pro- Tyr Asp Ser Thr Pro Leu Gly Glu Ara Arg Glu Arg 

500 i,Q5 ' 510 

Ala Tyr Ary Asp Val Ser Ly^ Ser Phe Thr Asn Thr Ala Glu Tyr Lyf 

515 520 525 

Pho Ser lie Asp Glu Lye His Asp Leu Thr Ala Leu Met Gly His Glu 

530 535 540 

Tyr lie Glu Tyr Glu Gly Asp Val lie Gly Ala Ser Ser Lys Gly Phe 
545 560 555 560 

Glu Ser Asp Lys Leu Met Leu Leu Ser Gin Gly Ly^ Thr Gly Asn Ser 

565 570 575 

Leu Ser Lsu Pro Glu His Arg Val Ala Glu Tyr Ala Tyr Leu Ser Phe 

580 565 590 

Phe Ser Arg Phe Asn Tyr Gly Phe Asp Lys Trp Met Tyr lie Asp Phe 

595 600 605 

Sfcr Val Arg Asn Asp Gin Ser Scr Arg Phe Gly Ser Asn A&n Arg Ser 

610 615 620 

Ala Trp Phe Tyr Ser Val Gly Gly Met Phe Asp He Tyr Asn Lys Phe 
625 630 535 640 

lie Gin Glu Ser Asn Trp Leu Ser Asp Leu Arg Leu Lys Het Ser Tyr 

64S 650 655 

Gly Thr Thr Gly Asn Ser Glu He Gly Asn Tyr Asn His Gin Ala Leu 

660 665 670 

Val Thr Val Asn Asn Tyr Thr Glu A5:p Ala Met Gly Leu Ser He Sor 

675 680 685 

Thr Ala Gly Asn Pro Aep Leu Ser Tcp Glu Lys GVn Ser Gin Phe Asn 

690 695 700 

Phe Gly Leu Ala Ala Gly Ala Phe Asn Ajn Arg Leu Ser Ala Glu Val 
706 710 ^15 720 

Asp Phe Tyr Viil Arg Thr Thr Asn Asp Het Lou He A?p Val Pro Het 

725 730 735 

Pro Tyr He Ser Gly Phe Phe S«r Gin Tyr Gin Asn Val Gly Ser Met 

740 745 7b0 

Lys Asn Thr Gly Val Asp Leu Ser Leu Lys Gly Thr He Tyr Gin Asn 

755 760 765 

Lys Asp Trp Asn Val Tyr Ala Ser Ala Asr. Phe Asn Tyr Acn Arg Gin 

770 775 700 

Glu He Thr Lys Leu Phe Phe Gly Leu Asn Lyn Tyr Met Leu Pro Asn 
785 790 795 800 

The Gly Thr He Trp Glu He Gly Tyr frc Asn Ser Pht* Tyr Met Ala 

805 810 815 

Glu Tyr Ala Gly He Asp Lys Lys Thr Gly Lys Gin Leu Trp Tyr Val 

820 825 830 

Pro Gly Gin Val Asp Ala Asp Gly Asn Lys Val Thr The Ser Gin Tyr 

835 840 845 

Set Ala Asp Leu Glu Thr Arg He Ajjp Lys Ser Val Thr Pro Pro He 

850 855 860 

Thr Gly Gly Phe Ser Leu Gly Ala Ser Trp L/C Gly Leu Ser Leu Asp 
865 870 875 08O 

Ala Asp Phe Ala Tyr He Val Gly Lys Trp Met He Asn A^n Asp Arg 

885 890 895 

Tyr Phe Thr Glu Asn Ala Gly Gly Leu llet Gin Leu Asn Lys A?p Lys 
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900 



20 



Met Leu T,.„ A..n Ala Trp Thr Glu Asp. A:.n lys Glu Thr A.^p Val Pro 

' 920 

Ly. Leu aiy Gin Sex Pro Gin Phe A.p Thr Ml. L.u Leu Glu Asn Ala 

Ser Phc. Leu Arg Leu Ly, A^n Leu Lys L«u Thr lyr Val L,u Pro A-^n 



Ser Leu Phe Ala Gly ^In A:=n Val He Gly Gl" Ala Arg Val Tyr Leu 
Met Ala Aro A.n Le^ Leu Thr Val Thr Lys Tyr Ly. Gly Phe Pro 



lU Met Ala Arn Asn Leu Leu Thr Val Thr * ~ 

Glu Ala Gly Gl'y As.. Val Gly ,^^tl' Gin Tvr Pro A-.„ le'r Ly. Oln 

Tyr Val Al.i Gly n- cin L-u Ser Ph- ^^'^^ 
lO 1010 1015 



(2) IHrORIlATIOH FOR SKQ ID MOM 2 5 

U) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1014 amino acid.^ 

(B) TYFt:: amino acid 
(D) TOPOLOGY: lin-ar 



25 MOLECULE TYPE: protein 

(iii.) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
30 ORGANISM: Pcrphyromonas gingivallc 

( ir. ) FEATURE : 

{A) NAIIE/KEY: mxsc feature 
(B) LOCATIOH 1. . .1014 

'J^i* SEQUENCE DESCRIPTIOH: SEQ ID HO: 425 
Met Thr Leu Phe Phe Leu Cy.. Leu Leu Thr Ser He Gly Trp Ala Het 
^ Ala Gin A^n Arg Thr Val Lys Gly Thr vll He Ser Ser Glu ^^p Asn 
Glu Pro Leu He Gly Ala A.n Val vfl Val Val Gly A.n Thr Thr He 
Gly Ala Ala Thr A.p Leu Asp Gly Asn Phe Thr Leu Ser Val Pro Ala 
45 Ala Ly. H«t Leu Arg Val Ser Tyr Ser Gly Met Thr Thr Ly^ Glu 

Val Ala He Ala A.n Val Het Lys He Val l2u A.-p Pro A-.p Ser Ly. 
val Leu «lu Gin Val Val Val r.u Cly Tyr Gly Thr Gly Gin £y. Leu 



ser Thr V.l Ser Gly ser Val Ala Ly^ V.l Ser S.r Glu ?,y° Leu Ala 



Glu Ly. Pro Val Ala Asn lie He? A.p Ala Leu Gin Gl" Gin Val Ala 

55 Gly Met Gin Val ,..«t Thr Thr Ser Gly Asp Pro T^r Ala Val Ala ser 

155 1 
Val Glu He His Gly Thr Gly Ser Leu Gly Ala Ser Ser Ala Pro ^eu 

Tyr He Val Asp Gly Het Gin Thr Ser tl° Asp Val Val Ala Thr Het 

Asn Pro Asn Asp Phe Glu Ser Met Ser Val Leu Lys Asp Wa Ser Ma 



Thr ser He -.yr Gly Ala Arg Ma" Ala Asn Gly Val Val Phe He Gl„ 



65 Thr Lv:; Lvs G^ r.ir* n-i- fl- - _ 

Thr Phe Asn 

« Tyr Glv He Ser ^In He Leu Asn Thr ^ys' Pro Leu Asp As.. M^t 



Thr Ly. Lys Gly Lys Het Ser Glu Arg Gly Arg He Thr Phe Asn Ala 

230 



250 

Gin Val Lys Ala Gly phe 
Gly Asn Asn Gin Thr Val Gin Lys v" Lys Asp Het He Leu Ala Gly 

- ^ 280 285 



net Thr Gly A=p Glu Leu Leu Asp Ph, Gin Val Lys Ala Gly If^l Trp 



Ala Glu Asp Leu Tyr Gly A,n Tyr Asp Ser Leu Lys Asp Glu Tyr Gly 

75 Ly. Thr Leu Phe Pro Val A.p Phe A-^n His A.p Ala A.p Trp Leu Ly. 
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30 b 










310 










315 










320 


Ala 


Leu 


Phe 


Lys 


Thr 


Ala 


Pro 


Thr 


Ser 


Gin 


Gly 


Acp 


He 


Sec 


Phe 


Ser 










325 










330 










335 




Gly 


Gly 


Ser 


Gin 


Gly Thr 


Ser 


Tyr 


Tyr 


Ala 


Ser 


He 


Gly Tyr 


Phe 


Asp 








340 










345 










JbO 






61 n 


Glu 


Gly 


H«t 


Ala 


Arg 


Glu 


Pro 


Ala 


Asn 


Phe 


I.ys 


Arg 


Tyr 


Ser 


Gly 






355 










360 










365 








Arq 


Leu 


Asn 


Phe 


Glu 


Ser 


Arg 


He 


Asn 


Glu 


Trp 


Leu 


Lys 


Val 


Gly Ala 




370 










37b 










380 










Asn 


Leu 


Sm£ 


Gly Ala 


He 


Ala 


Asn 


Axy 


Arg 


Ser 


Al a 


Asp 


Tyr 


Phe 


Gly 


385 










390 










395 










400 


Lys 


Tyr 


Tyr 


Met 


Gly 


Ser 


Gly 


Thr 


Phe 


Gly 


Val 


Leu 


Thr 


Met 


Pro 


Arg 










405 










•JIO 










415 




Tyr 


Tyr 


Asn 


Pro 


Phe 


Asp 


Val 


Asn 


Gly Asp 


Leu 


Alii Asp val 


Tyr 


Tyr 








420 










425 










430 






(let 


Tyr 


Gly 


Ala 


Thr 


Arg 


Pro 


Ser 


Met 


Thr 


Glu 


Pro 


Tyr 


Phe 


Ala 


Lys 






435 










440 










445 








M*t 


Arg 


Pro 


Ph« 


Ser 


Ser 


Glu 


Ser 


His 


Gin 


Ala 


Asn 


Val 


Asn 


Gly 


Phe 




450 










455 










460 










Ala 


Gin 


He 


Thr 


Pro 


He 


Lys 


Gly 


Leu 


Thr 


Leu 


Lys 


Ala 


Gin 


Ala 


Gly 


46S 










4 70 








4 75 










480 


Vai 


Asp 


He 


The 


Aen 


Thr 


Arg 


Thr 


Ser 


Set 


Lys 


Arg 


Het 


Pro 


Asn 


Asi\ 










485 










4 90 










4 95 




Pro 


Tyr 


Asp 


Ser 


Thr 


Pro 


Leu 


Gly Glu Arg Arg 


Glu 


Arg 


Ala 


Tyr 


Arg 








500 










505 










510 






Asp 


Val 


Ser 


Lys 


Ser 


Pho 


Thr 


Asn 


Thr 


Ala 


Glu 


Tyr 


Lys 


Phe 


Ser 


He 






515 










520 










525 








Asp 


Glu 


Lys 


His 


Asp 


Leu 


Thr 


Ala 


Lc\i 


Met 


Gly 


His 


Glu 


Tyr 


He 


Glu 




530 










535 










540 










Tyc 


Glu 


Gly 


Asp 


Val 


He 


Gly 


Ala 


Ser 


Ser 


Lye 


Gly 


Phe 


Glu 


Ser 


Asp 


54 5 










550 










555 










560 


Lys 


Leu 


Met 


Leu 


Leu 


Ser 


Gin 


Gly 


Lys 


Thr 


Gly 


Asn 


Ser 


Leu 


Ser 


Leu 








565 










570 










575 




Pro 


Glu 




Arg 


Val 


Ala 


Glu 


Tyr 


Ala 


Tyr 


Leu 


Ser 


Phe 


Phe 


Ser 


Arg 








580 










585 










590 






Ph= 


Asn 


Tyr 


Gly 


Phe 


Asp 


Lys 


Trp Met 


Tyr 


He 


Asp 


Phe 


Ser 


Val 


Arg 






595 










600 










605 








Asn 


Asp 


Gin 


Ser 


Ser 


Arg 


Phe 


Gly 


S^ir 


Acn 


Asn 


Arg 


Ser 


Ala 


Trp 


Phe 




610 










615 










620 










Tyr 


Ser 


Val 


Gly Gly Het 


Phe 


Asp 


He 


Tyr 


Asm 


Lys 


Phe 


He 


Gin 


Glu 


625 










630 










635 










640 


Sec 


Asn 


Trp 


Lau 


Ser 


Asp 


Leu 


Arg 


Leu 


Lys 


l-ten 


Ser 


Tyr Gly 


Thr 


Thr 










645 










650 










655 




Gly 


Ast\ 


5er 


Glu 


He 


Gly 


Asn 


Tyr 


Asn 


His 


Gin 


Ala 


Leu 


Val 


Thr 


Val 








660 










665 










67 0 






Asn 


Asn 


Tyr 


Thr 


Glu 


Asp 


Ala 


Het 


Gly 


Leu 


Ser 


He 


Ser 


Thr 


Ala 


Gly 






67 5 










680 










685 








Asn 


Pro 


Asp 


Leu 


Ser 


Trp 


Glu 


Lys 


Gin 


Ser 


Gin 


Phe 


Asn 


Phe 


Gly 


Leu 




650 










695 










'00 










Ala 


Ala 


Gly 


Ala 


Phe 


Asn 


Asn 


Arg 


Leu 


Ser 


Ala 


Giu 


Val 


Asp 


Phe 


Tyr 


7 0S 










710 










715 










720 


Val 


Arg 


Thr 


Thr 


Asn Asp 


Met 


Leu 


He 


Asp 


Val 


Pro 


M«t 


Pro 


Tyr 


He 










725 










730 










735 




Ser 


Gly 


Phe 


Phe 


Ser 


Gin 


Tyc 


Gin 


Asn 


Val 


Gly 


Ser 


Het 


Lys 


Asn 


Thr 








V40 










745 










750 






Gly 


Val 


A^p 


Leu 


Ser 


Leu 


Lys 


Gly Thr 


He 


Tyr 


Gin 


Asn 


Lys 


A-sp 


Trp 






755 










760 










765 








Asn 


Val 


Tyr 


Ala 


Ser 


Ala 


Arn 


Phe 


Asn 


Tyr 


Asn 


Arg 


Gin 


Glu 


He 


Thr 




770 










775 










780 










Lys 


Leu 


Phe 


Phe 


Gly 


Lsu 


Arn 


Lys 


Tyr Met 


Leu 


Pro 


Asn 


Thr 


Gly 


Thr 


795 










7 90 










7 95 










800 


He 


Trp 


Glu 


Tie 


Gly 


Tyr 


Pro 


Asn 


Ser 


Phe 


Tyr 


Het 


Ala 


Glu 


Tyr 


Ala 










805 










810 










815 




Gly 


He 


Asp 


Lys 


Lys 


Thr 


Gly 


Lys 


Gin 


Leu Tro 


Tyr Val 


Pro 


Gly 


Gin 








920 










825 










830 






Val 


Asp 


Ala 


Asp 


Gly 


Asn 


Lys 


Val 


Thr 


Thr 


Ser 


Gin 


Tyr 


Ssr 


Ala 


A-yp 






835 










840 










045 








Leu 


Glu 


Thr 


Arg 


He 


Asp 


Lys 


Ser 


Val 


Thr 


Pro 


Pro 


He 


Thr 


Gly 


Gly 




850 










855 










860^ 










Phe 


Ser 


Leu 


Gly 


Ala 


Sec 


Trp 


Lys 


Gly 


Leu 


Ser 


Leu 


Asp 


Ala 


Asp 


Phe 












a70 










07 5 










880 


Ala 


Tyr 


lit 


Val 


Gly Lys 


Trp 


Met 


He 


Aan 


Asn 


Asp Arg 


Tyc 


Phe 


Thr 










885 










890 










895 




Glu 


Asn 


Ala 


Gly 


Gly 


Leu 


Met 


Gin 


Leu 


Asn 


Lyii 


Asp 


Lys 


Met 


Leu 


Leu 








900 










905 










910 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Asn 


Al- 


Trp 


Thr 


Glu 


Af p 


Asn 


Lys 


Glu 


The 


Asp 


Val 


Pro 


Lys 


Leu 


Gly 






?15 










920 










925 






Gin 


b'er 


Pro 


Gin 


Phe 


Asp 


Thr 


His 


Leu 


Leu 


Glu 


A^n 


Ala 


Ser 


Phe 


Leu 




930 










935 




















Arg 


Leu 




Asn 


Leu 


bys 


Leu 


Thr 


Tyr 


Val 


Leu 


Pro 


A^n 


Ser 


Leu 


Phe 


94 5 




















9SS 










960 


Ala 


•Sly 


Gin 


Asr. 


Vol 


lie Gly Gly 


Ala 


Arg 


Val 


Tyr 


Leu 


Uet 


Ala 


Arg 










5*65 










97 0 










57 5 




Asm 


Leu 


Leo 


Thr 


Val 




hys 


Tyr 


Lys 


Gly 


Phe 


Asp 


Pro 


Glu 


Ala 


Gly 








980 










985 










9S0 






Gly 


Am 


Val 


Gly 


Lys 


Asn 


Gin 


ryr 


Pro 


Aafl 


Ser 


L/f 


Gin 


Tyr 


Val 


Ala 






995 










1000 








1005 






Giy 


He 


Gin 


Leu 




Phe 























12) IMFORItATXON FOR SEQ ID MO: 426 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 amino acids 
tP) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) JJOLECULE riPE: protein 

{iiit iriPOTHETICAL: YES 

( V i ) OR I G3 1 lAL SO'JRCE : 

(A) 0RGA1-JISH: Porphyromonas gingivalii* 

(iyj FEATURR: 

(A) HATIE/KF.Y: nvisc_f eaturt 
|B» LOCATIOt4 1. . .921 



(xi) SEQURNCE DESCRlPTIOt>f : SEQ ID MO: 4 26 



(l2f 


Lys 


Lys 


Lys 


Asa 


Phe 


Leu 


Leu 


Leu 


Gly 


He 


Phe 


Val 


Ala 


Leu 


Leu 


1 








5 










10 










15 




Thr 


PhSf 


lie 


Gly Ser Met 


Gin 


Ala 


Gin 


Gin 


Ala 


Lys 


Asp Tyr 


Phe 


Asn 








20 










25 










30 






Phe 


Asp 


Glu 


Arg Gly Glu 


Ala 


Tyr 


Phe 


Ser 


Phe 


Lys 


Vai 


Pro 


Asp 


Arg 






35 










40 










45 








Aia 


Val 


Leu 


Gin 


Glu 


Leu 


Aia 


Leu 


He 


Hat 


Ser 


He 


Asp 


Glu 


Phe Asp 




50 










55 










60 










Pro 


Val 


Thr 


Asn 


Glu 


Ala 


lie 


Ala 


Tyr 


Ala 


Ser 


Glu 


Glu 


Glu 


Phe 


Glu 


65 










70 










75 










80 


Ala 


Phe 


Leu 


Arg 


Tyr 


Gly 


Leu 


Lys 


Pro 


Thr 


Phe 


Leu 


Thr 


Pro 


Pro 


Ser 










85 




















95 




tlo.t. 


Gin 


Arg 


Ala 


Val 


Glu 


Met 


Phe 


Asp 


Tyr 


Arg 


Ssr 


Gly 


Glu 


Lys 


Tyr 








LOO 










105 










110 






Glu Trp Asn 


Ala 


Tyr 


Pro 


Thr 


Tyr 


Glu 


Ala 


Tyr 


He 


Ser 


Met 




Glu 






115 










120 










125 








Glu 


Phe 


Gin 


Thr 


Lys 


Tyr 


Pro 


Ser 


Leu 


Cys 


Thr 


Thr 


Ser 


Val 


He 


Gly 




130 










135 










140 










Lye 


Ser 


Val 


Lys Asp Arg 


Lys 


Leu 


Het 


He 


Cys 


Lys 


Leu 


Thr 


Ser 


Ser 


145 










150 










155 










160 


Ala 


Asn 


Thr 


Gly Lys 


Lys 


Pro 


Arg 


Val 


Leu 


Tyr 


Thr 


Ser 


Thr 


Met 


His 










165 










no 










175 




Gly 


Asp 


Glu 


Thr 


Thr 


Gly 


Tyr 


Val 


Val 


Leu 


Leu 


Arg 


Leu 


He 


Asp 


His 








180 










185 










190 






t.cu 


Leu 


Ser 


Asn 


Tyr 


Glu 


Ser 


Asp 


Pro 


Arg 


He 


r.y,- 


Asn 


He 


Leu 


Asp 






195 










200 










205 








Lys 


Thr 


Giu 


Val 


Trp 


lie 


Cys 


Pro 


Leu 


Thr 


Asn 


Pro Asp Gly Ala 


Tyr 




210 










21b 










220 










Arg 


Ala 


Gly 


Aen 


His 


Thr 


Val 


Gin 


Gly 


Ala 


Thr 


Arg 




Asn 


Ala 


Asn 


225 










230 










235 










240 




Vsl 


A-p 


L-u 


Asn 


Arg 


Asn 


Phe 


Lys 


Asp 


Asp 


Val 


Ala 


Giy 


Asp 


His 










245 










250 














Pro 


A5P 


Gly 


Lys 


Pro 


Trp 


Gin 


Pro 


Glu 


7x1 a 


Thr 


Aia 


Phe 


list 


Asp 


Leu 








260 










265 










270 




Glu 


Gly 


Ajn 


Thr 


Ser 


Phe 


Val 


Leu 


Gly 


Ala 


AiJl 


He 


His 


Gly 


Gly Thr 






275 










260 










285 








Glu 


Val 


Vai 


Asn 


Tyr 


Pro 


Trp 


Asp 


Asn 


Lys 


Lys 


Glu 


Arg 


His 


Ala 


Asp 




290 










295 










300 








Ajp 


Glu 


Trp 


Tyr 


Lys 


Leu 


He 


Ser 


Arg 


Asn 


Tyr 


Ala 


Ala 


Ala 


Cye 


Gin 


305 










310 










315 








320 
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Ser 


lie 


Sec 


Al a 


Ser 


Tyr 


Met 


Thr 


Ser 


Glu 


Thr 


Asn 


Ser 


Gly 


He 


Ii« 










325 










330 










335 




A?n 


Gly 


Ser 


Asp Trp Tyr 


Val 


He 


Arg 


Gly 


Ser 


Arg 


Gin 


Asp 


Asn 


Ala 








340 










34 5 










350 






Afn 


Tyr 


Phe 


His 


Arg 


Leu 


Arq 


GXu 


He 


Thr 


Leu 


Glu 


He 


Ser 


Asn 


Thi- 






355 










360 










365 








I.yp 


L-u 


Val 


Pro 


Ala 


Ser 


Gin 


Lou 


Pro 


Lys 


Tyr 


Trp 


Asn 


Leu 


Asn 


Lys 




370 










375 










300 










Glu 


Ser 


Leu 


I.ftU 


Ala 


Leu 


He 


Glu 


Glu 


Ser 


Leu 


Tyr Gly He 


His 


Gly 












390 










395 










40C 


Thr 


Val 


Thr 


Ser 


Ala 


Ala 


Asn 


Gly 


Gin 


Pro 


Leu 


Lvs 


Cys 


Gin 


He 


Leu 










405 










410 










415 




Ii« 


Glu 


Asn 


Hi;; 


Asp 


Lys Arg 


Asn 


Ser 


Aerp 


Val 


Tyr 


Ser 


Asp 


Ala 


Thr 


















425 










430 






Thr 


Gly 


Tyr 


Tyr 


Val 


Arg 


Pro 


He 


Lys Ala 


Gly 


Thr 


Tyr 


Thr 


Val 


Lys 






4 35. 










440 










445 








Tyr 


hys 


Ala 


Glu 


Gly Tyr 


Pro 


Glu 


Ala 


Thr 


Arg 


Thr 


He 


Thr 


He 


Lys 














4 55 










•160 










Arp 


hys 


Glu 


Thr 


Val 


lie 


Met 


A£p 


Tl« 


Ala 


Leu 


Gly 


Asn 


Sec 


Val 


Pro 


46S 








470 








475 










480 


Leu 


Pto 


Val 


Pro 


Asp 


Phe 


Thr 


Ala 


Ser 


Pro 


J tot 


Thr 


He 


Ser 


Val 


31 •• 










485 










4 90 










495 




Gl Ll 


Ssr 


Val 




Phe 


Gin 


A^p 


Gin 


Thr 


Thr 


hsn 


Aen 


Pro 


Thr 


A-^n 


Tro 
















505 










510 






Olu 


* ^f- 


Thr 


Ph'*- 


Glu 


Gly 


Gly 


Gin 


Pru 


Aia 


Met 


Ser 


Thr 


Glu 


Gin 


A^'n 






bl5 










520 










525 








Pro 


L«vi 


Val 


Ser 


Tyr 


Ser 


His 


Pro 


Gly Gin 


Tvr 


Asp 


Val 


Thr 


how 


L-" 




530 










535 










540 










Val 


Trp 


Acn 


Ala 


Ssr 


Gly 


Ser 


Asn 


Thr 


He 


Thr 




Glu 


Lys 


Phe 


He 


S45 










650 










555 










560 


Thr 


Val 


Asn 


AXa 


Val 


Met 


Pro 


Val 


Ala 


Glu 


Phe 


Val 


Gly Thr 


Pro 


Thr 










565 










57 0 










575 




Glu 


He 


Glu 


Glu 


Gly Gin Thr Val 


Scr 


Phe 


Gin 


Asn 


61n 


Ser 


Thr 


Asn 








580 










585 










590 






Ala 


Thr 


Asn 


Tyr 


Val Trp 


He 


Phs 


Asp 


Gly 


Gly 


Thr 


Pro 


Ala 


Thr 


er 






S95 








600 










605 








Glu 


A'p 


Gl u 


A*n 


Pro 


Thr 


Val 


Leu 


Tyr 


Ser 


Lys 


Ala 


Gly Gin 


Tyr 






610 










615 










620 










Val 


'i'hc 


Leu 


Lys 


Ala 


He 


Ser 


Ala 


Ser 


Gly 


Glu 


Thr 


Val 


Lys 


Thr 


Lys 


6? 5 










630 










635 










640 


Glu 


hys 


Tyr 


Tie 


Thr 


Val 


Lys 


Lys 


Ala 


Pro 


Val 


Pro 


Ala 


Pro 


Val 


Ala 








645 










650 










655 




K-p 


Phe 


Glu 


Gly 


Thr 


Pro 


Arg 


Lys 


Val 


Ly5 


Lys 


Gly Glu Thr Val 


Thr 






660 








665 










67 0 






Phe 


Lys 


Asp 


Leu 


Ser 


Thr 


Asn 


Asn 


Pro 


Thr 


Ser 


Trp Leu 


Trp 


Val 


Phe 






6-7S 










680 










685 








Glu 


Gly 


oly 


Ser 


Fio 


Ala 


Thr 


Ser 


Thr 


Glu 


Gin 


Asn 


Pro 


Val 


VaL 


Thr 




690 










6S5 










700 










Tyr 


Asn 


Glu 


Thr 


Gly 


Ly^ 


Tyr 


Asp 


Val 


Gin 


7.CU 


Thr 


Ala 


Thr 


Asn 


Glu 


705 










710 










715 










720 


Gly 


Gly 


Ser 


Asn VaX Lys l-ys 


Ala Glu As^p Tyr 


He 


Glu 


Val 


He 


Leu 










725 










730 










735 




Asp 


Asp 


S-r 


Val 


Glu Asp He Val 


Ala 


Gin 


Thr 


Gly 


He 


Val 


He 


Arg 








740 










745 










750 






Pro 


Gin 


Asn 


Gly Thr Lys Gin 


He 


Leu 


He 


Glu 


Ala 


Asn 


Ala 


Ala 


He 






75S 










760 










765 








Lys 


Al a 


He 


Val 


Leu 


Tyr Afp 


lie 


Asn 


Gly Arg 


Val 


Val 


Leu 


Lys 


Thr 




110 










775 










780 










Thr 


Pro 


Ast\ 


Gin 


Leu 


Arq 


Ser 


Thr 


V^l 


Asp 


Lau 


Ser 


He 


Leu 


Pro 


Glu 


785 










790 










7 96 










800 


Gly 


life 


Tyr 


Thr 


He 


Asn 


He 


Lys 


Thr 


Glu 


Lys 


Ser 


Ala 


/org 


Thr 


Glu 










005 










810 










815 




Lys 


He 


Hl£ 


He 


Gly 

























{2) IJIE-ORJJATIOJ) FOR SEQ ID MO: 427 

(11 SEQUEMCE CHARACTERISTICS: 

tA) LENGTH: 76 amino acids 
(B) TfPE: amino acid 
(01 TOPOLOGY: linear 

(li) IJOLEOJLE TYPE: protein 

till) IJypOTHETICAL: YES 



820 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



a^t ^v°h n«» ip^h «ii iT^fc ,tS! iHl iH; 

lUi '^-li^ ^1 »iUiii^O ^iOi m ffc J .iOi .i^ U nuJ 



WO 99/29870 PCT/AU98/01023 

367/490 



(vl) ORIGIUAL SOURCE: 

{AI ORGANISM: Porphyromonas gingtvaiis 

(liU FEATURb: 

(Bl LOCATION 1. . .16 



SEQUENCE DESCRIPTIOtI : SEQ TD tlO: 4 2? 



Met 


Axg 


Leu lie 


Lys 


Ala 


Ph» Leu Val 


Gin 


L«u 


Lea 


Leu 


L«u 


Pro 


lie 


L 






5 






10 










15 




Phc 


Phs 


Tyc Lys 


Arg 


Phe 


Ila Ser Pro 


Leu 


Thr 


Pro 


Pco 


Sec 


Cys 


Arg 






20 






25 










30 






Phe 


The 


Pro Ser 


Cys 


Sec 


Sec Tyr Ala 


11% 


Glu 


Aid 


Leu Arg 


Lys 


Tyr 






35 






40 








45 








Gly 


Pco 


Gly Lys 


Gly 


Leu 


Leu Leu Ser 


Tie 


Lys 


Acq 


lie 


Leu 


Arg 


Cys 




50 








55 






60 










His 


Pro 


Trp Gly 


Gly 


Ser 


Gly Tyr Asp 


Pro 


Val 


Pro 










65 








70 






75 












(2) 


I » FORI fATIOH 


FOR 




ID lJO:428 

















(il SEOUENCS. CHARACTERISTICS: 

(AI LENGTH: 859 amino actd? 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



<ii) MOLECULE TYPE: proteJn 

fiix) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 

(I:'.! FEATURE: 

(A) IIAIIE/KEY: mi?c_feature 

(B) LOCATIOf^ 1. . .859 



SEQUENCE DESCRTPTIOM: SEQ ID IJO:J28 



Met 


A] a 


Tyr 


Asp 


Phe 


Thr 


Gin 


Thr 


Pho 


Arg 


Asn 


Ser 


Leu 


Glu 


T;yr 


Ser 


1 








5 










10 










15 




Tyr 


Gin 


Glu 


Als 


Thr 


Arg 


Leu 


Gly 


Val 


Val 


Ala 


Val 


Ttir Gin Asp 


Het 








20 










25 










30 






Leu 


Val 


Leu 


Gly 


lie 


lie 


Arg 


Aep 


Gly 


Asp 


Asn 


Gly 


Ala 


He 


Asp 


He 






35 










40 










45 








net 


Arg 


Hii- 


Tyr 


Gly 


He 


A^n 


Leu 


Tyr 


Glu 


Leu 


Lys 


Arg 


Leu 


He 


Glu 




50 










55 










60 








Leu 


Glu 


Ala 


lie 


Ala 


Glu 


Ser 


Leu 


Pro 


Ala 


Ser 


Pro 


Glu 


Gly 


Ser 


Pro 


65 










70 










75 








00 


lie 


Phe 


Thr 


Pro 


Ser 


Ala 


Arg 


Glu 


Ala 


lie 


Asp 


A^p 


Ala 


Thr Asp 


He 










65 










90 










95 




Cys 


Ala 


Asp 


Met 


Glu 


Asp 


Glu 


Ala 


Val 


Ser 


Pro 


Val 


His 


Leu 


Leu 


Leu 








100 










1 05 










110 






Ser 


lie 


Leu 


Asn 


Ser 


Thr 


Gin 


Glu 


Ser 


Leu 


Val 


Gin 


Ly3 


He 


Fhe 


Het 






lis 










120 










125 








Lys 


Gin 


Gly 


lie 


Lys 


Tyr 


Asp 


Thr 


lie 


Leu 


Ser 


Asp 


Tyr 


Phe Gly 


Gin 




130 










135 










14 0 










Arg 


Arn 


Pro 


Ser 


Glu 


Gly 


Lys 


Ser 


Pro 


Ser 


Glu 


Met 


Glu 


He 


Leu 


Asp 


145 










150 










155 










160 


Gly 


Tyr 


Gin 


Asp 


PiSTk 


Asp 


Phe 


Aap 


Asp 


Glu 


Glu 


Asp 


Glu 


Ser 


Ser 


Pro 










165 










170 










IVb 




Pro 


Ser 


Gly 


Asn 


See 


Gly 


'i'hr 


Gly 


Gly 




Scr 


Gly 


Asp Ala 


Pro 


Glu 








180 










185 










19C 






Gin 


Aan 


Thr 


Gly 


Gly 


Oiy 


Asp 


Thr 


Thr 


Thr 


■ihr 


Thr 


Axg 


Ser 


Gly Gly 






155 










200 










205 








Asp 


Thr 


Pro 


Ala 


Leu 


Asp 


Thr 


Phe 


Gly 


Thr 


As^p 


He 


Thr 


Ala 


Met 


Ala 




210 










215 










22-0 










Ala 


Ala 


Gly 


Lys 


L-^u 


Asp 


Pco 


Val 


Val 


Giy 


Arg 


Glu 


Gin 


Glu 


T1 e 


Glu 


225 










230 










235 










240 


Arg 


Val 


lie 


Gin 


He 


Leu 


Ser 


Arg 


Acq 


Lys 


Lys 


Asn 


Asn 


Pro 


Val 


Leu 










245 










250 










255 




lie 


Gly 


Glu 


Pro 


Gly 


Val 


Gly 


Lys 


Ser 


Ala 


Tl- 


Val 


Glu 


Gly 


Leu 


Ala 



260 26b 7.70 



SUBSTmrrE sheet (Rule 26) (RCVAU) 




^: tP^ --^t i|2S' iHl TT'" 

rcp o i« ^H„y o iini.'; ii^-ii 



wo 99amo 



PCT/AU98/01023 



368/ 



490 



Glu 






Va 1 






Lys 


Val 


Ser 


Arg 


Ht 


Leu 


Phe 


Asp 


Lys 


Arg 
















280 
























eu 


sp 


eu 


Al a 


Gin 


net 


Val 


Ala 


Gl y 


Thr 




Tyr 


rg 














295 










300 










Gly 


in 




Glu 


Glu 


Arg 


I.ca 


Lys 


Ala 


Val 


Leu 


Asp 


Glu 


eu 




Uj ^- 


305 










31 0 










315 










320 


Asn 




'.-.J n 




H* 


Leu 


■ke 


He 


Asp Glu 


He 


HI s 


The 


He 


Val 


'Sly 










32 5 










330 










335 






Giy 




Ala 


Ala 


fZ\ \j 
faiy 


Ser 


Met 


Asp 


Thr 


Ala 


Asn 


Met 


Leu 


Lys 


Pro 








340 










345 










350 






TVla 


Leu 


Al a 


Arg 


Gly 


Gin 


val 


Gin 


Cys 


He 


Gly 


Al s 


Thr 


Thr 


Leu 


Asp 






355 










360 










365 








Glu 


Tyr 


Ary 


Lys 


Af n 


He 


Glu 


Lys Asp Gly Ala 


Leu 


Glu 


Arq 


Arg 


Phe 




370 










37 5 










380 






■ 




Gin 


Lys 


Val 


Pro 


He 


Al a 


Pro 


Ser 


Thr 


Ala 


Glu 


Gl u 


Thr 


Leu 


Thr 


H e 


385 










3 90 










395 










4 00 


Leu 


Gin 


Ab'r) 


He 


Lys 


Glu 


Lys 


Tyr 


Glu 


Asp 


Tyr 


His 


Gly 


Val 


Arg 


Tyr 










405 










410 










415 




Thr 


A?p 


Glu 


Ala 


He 


Lys 


Ala 


Ala 


Val 


Glu 


Leu 


Thr 


Asp 


Arg 


Tyr 


Val 








420 










425 










430 






Sfet 




Arg 


Phe 


Phe 


Pro 


Asp 


Lys 


Ala 


He 


Asp 


Ala 


Het 


Asp 


Glu 


Al a 






435 










440 










4 4b 








Gly 


Ala 


tier 


Val 


Hif 


He 


Thr 


Aen 


Val 


Val 


Ala 


Pro 


Lys 


Glu 


He 


Glu 




450 










455 










4 60 










He 


L°u 


Glu 


Ala 


Glu 


Leu 


Ala 


Ser 


Val 


Arg 


Glu 


Asfi 


Lys 


T,fiU 


Ser 


Ala 


4 55 










470 




















480 


Val 


LV5 


Ala 


Gin 


?isn 


Tyr 


Glu 


Leu 


Ala 


Ala 


Ser 


Phe 


Arg 


Asp 


Gin 


Glu 










465 










490 










4 95 




Axg 


Arg 


Thr 


Gin 


Gl n 


Gin 


He 


Ala 


Glu 


Glu 


Lys 


Lys 


Lys 


Trp 


Gl u 


Glu 








500 










505 










510 






Gin 


Met 


ser 


Lys 


His 


Arg 


Glu 


Thr 


Val 


Asp 


Glu 


Asn 


Val 


Val 


Ala 


His 






515 










520 










525 








Val 


Val 


Ala 


Leu 


Met 


Thr 


Gly Val 


Pro 


Ala 


Giu 


Ary 


Leu 


Ser 


Thr 


Gl y 




530 










535 










54 0 










Glu 


Gly 


Glu Arg 


Leu 


Arg 


Thr 


Met 


Ala 


Asp 


Asp 


Leu 


Lys 


Thr 


Lys 


Val 


545 










550 










555 










560 


V^l 


Gly 


Gin 


Asp 


Thr 


Al a 


He 


Glu 


Lys 


Het 


Val 


His 


Ala 


lie 


Gin 


Arg 








565 










570 










57 5 




Asn 


Arq 


Leu 


Gly 


Leu 


Arg 


Asn 


Glu 


Lys 


Lys 


Pro 


He 


Gly 


Ser 


Phe 


Leu 








580 










585 










590 






Phe 


Leu 


Gly 


Pro 


Thr 


Gly Val 


Gly Lys 


Thr 


Tyr 


Leu 


Ala 


Lys 


Lys 


Leu 






595 










600 










600 








Ala 


Glu 


Tyr 


Leu 


Phe 


Glu 


Asp 


Glu 


Asn 


Ala 


Met 


He 


Arg 


Val 


Asp 


Met 




610 










615 










620 










Ser 


Glu 


Tyr 


Met 


Glu 


Lys 


Phe 


Ser 


Val 


Ser 


Arg 


Leu 


Val 


Gly 


Ala 


Pro 


625 










€30 










635 










64 0 


Pro 


Gly 


Tyr 


Val 


Gly 


Tyr 


Glu 


Glu 


Gly 


Gly Gin 


Leu 


Thr 


Glu 


Arg 


Val 










645 










6S0 










655 




Arg 


Atg 


Lys 


Pro 


Tyr 


Ser 


Val 


Val 


Leu 


Leu 


Asp 


Giu 


lie 


Giu 


Lys 


Al a 








660 










665 










670 






His 


Ala 


Asp 


Vai 


Phe 


Asn 


Leu 


Leu 


Leu 


Gin 


Val 


Met 


Asp 


Glu 


Gly 


Gin 






67 5 










680 










635 








Leu 


Thr 


Adp 


Ser 


Leu 


Gly Arg Arg 


Val 


Aen 


Phe 


Lys 


Asn 


Thr 


Val 


He 




690 










695 










700 










He 


He 


Thr 


Ser 


Asn 


Val 


Gly 


Thr Arg 


Gin 


Leu 


Lys 


Asp 


Phe 


Gly 


Gin 


705 










710 










715 










720 


Gly 


He 


Gly 


Phe Arg 


Ser 


Glu 


Lys Asp Glu Glu 


Ala 


Asn 


Lys 


Glu 


His 










725 










730 










735 




Ser 


Arg 


Ser 


Val 


He 


Gin 


Lys 


Ala 


Leu 


Asn 


Lys 


Thr 


Phe 


Ser 


Pro 


Glu 








740 










745 










750 






Phe 


Leu 




Arg 


Leu 


Asp 


Asp 


He 


He 


Leu 


Phe 


Asp 


Gin 


Leu 


Gly 


Lys 






755 










7 60 










765 








Thr 


Glu 


He 


Ary 


Arg 


Met 


Val 


Asp 


He 


Glu 


Leu 


Lys 


Ala 


Val 


Leu 


Ala 




770 










775 










780 










Arg 


Tie 


Hi^ 


Arg 


Ala 


Gly Tyr Asp 


Leu 


Val 


Leu 


Thr 


Asp 


Glu 


Ala 


Lys 












730 










7 95 










000 


Asp 


Val 


He 


Ala 


Thr 


Lys 


Gly 


Tyr 


Asp 


Leu 


Gin 


Tyr 


Gly 


Ala 


Arg 


Fro 










605 










810 










815 




Leu 


Lys 


Arg 


Thr 


Leu 


Gin 


Asn 


Glu 


Val 


Glu 


Asp 


Arg 


Leu 


Thr 


Asp 


Leu 








820 










825 










830 






He 


Leu 


Ser 


Gly 


Gin 


He 


Glu 


Lys 


Gly 


Gin 


Thr 


Leu 


Thr 


Leu 


Ser 


Ala 






835 










640 










845 








Arg 


Asp 


Gly 


Glu 


He 


He 


Val 


Gin 


Glu 


Gin 


Ala 














05O 










855 





















SUBSTrrUTE SHEET (Rule 26) (RCVAU) 




wo 99/29870 PCT/AU98/01023 

369/ 490 



{2> I H FORI (AT rot I TOR SLQ 10 110:429 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 417 amino acids 
5 fP) TYPE: amino acid 

(0) TOPOLOGY: linear 

(Ul UOLtCULE TVPE; protein 

10 (iiil HYPOTHETICAL: YES 

ivl) ORIGIIIAL SOURCE: 

(A) ORGATUSH: Porphyromonae <gingivali5 

15 a:-:l FEATURE: 

(AJ n;iME/KEY: misc feature 
IB) LOCATION 1. . ..117 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



SEQUEIICE DESCRIPTION: SEQ ID MO: 12 9 



list 


A.»n 


T-,-r 


Leu 


Tyr 


He 


Leu 


He 


Thr 


Leu 


Leu 


Leu 


Ser 


Gly 


Phe 


Phe 


1 








5 










10 










15 




Ssr 


Gly Ala 


Glu 


Tie 


Ala 


Phe 


Leu 


Ser 


Ser 




Lys 


Leu 


Arg 


Leu 


Glu 








20 










25 








30 






Leu 


A?p 


Arg 


Asn 


Arg 


Gly 


Asp 


Leu 


Thr 


Gly 


Arg 


Ala 


Leu 


Asn 


Leu 


Lsu 






35 










4C 










45 








Tyr Arg 


His 


Pro 


Asp 


Gin 


Leu 


Val 


Thr 


Thr 


Leu 


Leu 


Val 


Gly 


Asn 


Asn 




to 










55 




















lie 


Val 


Leu 


Val 


Vol 


Tyr 


Gly 


Leu 


Leu 


Met 


Ala 


Gly 


Leu 


Leu 


Ala 


Ala 


65 










70 










75 










80 


Pro 


Lttu 


Ala 


Gin 


Trp 


He 


Asp 


Asn 


Asp 


Ala 


tlet 


tie 


Val 


Val 


Leu 


Gill 










85 










90 










95 




Ser 


Val 


Leu 


Ser 


Thr 


11% 


He 


He 


Leu 


Phe 


Thr 


Giy Glu 


Phe 


Leu 


Pro 








1 OC 










105 










110 






Lys 


Ala 


11* 


Ph« 


hys 


Thr 


Asn 


Ala 


Asn 


Met 


Met 


Met 


Arg 


Val 


Phe 


Ala 






115 










120 










125 








Leu 


Pro 


Tie 


Val 


Ala 


He 


Tyr 


Tyr 


Leu 


Leu 


Ti'r 


Pro 


Leu 


Ser 


Lis 


Leu 




130 










135 










140 










Vht 


Thr 


Gly 


Leu 


Ser 


Arg 


Ssr 


Phe 


He 


Arg 


Leu 


Val 


Asp 


Lys 


Asn 


Tyr 


145 










150 










155 










160 


V3l 


Pro 


Thr 


Thr 


Val 


Gly 


Leu 


Gly 


Arq 


Val 


Asp 


Leu 


Asp 


His 


Tyr Leu 










155 










170 










175 




Ala 


Glu 




Met 


Ser 


Gly 


Glu 


Asn 


Glu 


Gin 


Asn 


Asp 


Leu 


Thr 


Thr 


Glu 








180 










185 










190 






Val 


LV5 


lie 


He 


Gin 


Asn 


Ala 


Leu 


Asp 


Phe 


Sex- 


Gly 


He 


Gin 


Val 


Arg 






195 










200 










205 








Asp 


CV6- 


tlet 


lie 


Pro 


Arq 


Asn 


Glu 


Met 


lis 


Ala 


Cys 


Glu 


Leu 


Gin 


Thr 




21CJ 










2LS 










220 










Asp 


lie 


Glu 


Vol 


Leu 


Lys 


Thr 


Thr 


Phe 


He 


Asp 


Thr 


Gly 


Leu 


Ser 


Lys 


225 










230 










235 










240 


lie 


lie 


lie 


Tyr 


Arg 


Gin 


Asn 


He 


Asp 


Asp 


Val 


Val 


Gly Tyr 


lie 


His 










245 










250 










255 




Ser 


Ser 


Glu 


Met 


Phe 


Arg 


Gly 


Gin 


Ajp 


Trp 


Gin 


Lys 


Arg 


He 


Asn 


Thr 








260 










265 










270 






Tlir 


Val 


Php 


Val 


Pro 


Glu 


Ser 


Met 


Tyr 


Ala 


Asn 


Lys 


Leu 


Met 


Arg 


Leu 






275 










280 










285 








Leu 


list 


Gin 


Arg 


Lys 


Lys 


Ser 


He 


Ala 


He 


Val 


He 


Asp 


Clu 


Leu Gly 




2 90 










295 










300 










Gly Thr 


Ala 


Gly 


Met 


Val 


Thr 


Leu 


Glu 


Asp 


Leu 


Val 


Glu 


Glu 


He 


Phe 


305 










310 










315 










320 


Gly 


Asp 


lie 


Glu 


A^p 


Glu 


His 


Asp 


Thr 


Arg 


Lys 


He 


He 


Ala 


Lys 


Gin 










325 










330 










335 




Leu 


Gly 


Pro 


His 


Thr 


Tyr 


Leu 


Val 


Ser 


Gly 


Arq 


Met 


Glu 


He 


Asp Asp 








340 










345 










350 






Val 


Aan 


Glu 


Ary 


?he 


Gly 


Leu 


Ser 


Leu 


Pro 


Glu 


Ser 


Asp Asp 


Tyr 


Leu 






3S5 










3S0 










365 








Tlir 


Val 


Ala 


Gly 


Phe 


He 


Leu 


Asn 


Ser 


His 


Gin 


Asn 


He 


Pro 


Gin 


TUa 




370 










375 










380 










Ajn 


Glu 


Val 


Val 


Glu 


He 


Ala 


Pro 


Tyr 


Thr 


Phe 


Thr 


He 


Leu 


Arq 


Ser 


38 £ 










390 










395 










400 


Ser 


Scr 


Thr 


Lys 


Xle 


Glu 


Leu 


Val 


Lys 


;tet 


Sec 


He 


Asp 


Asp 


Gin 


Ser 










405 








410 










415 




Asn 

































SUBSTITUTE SHEET (Rule 26) (KO/AU) 



wo 99/29870 PCT/AU98/0m3 



370/490 



{P i IVrOHSUKTXOti FOR SEQ XD NO: 430 

(i) SEQUEUCP CHARACTERISTICS: 

(A) LENGTH: 293 amino acids 
IB) rrPE: amino ncid 

(n) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

( i i 1 ) H /POTHETI CAL : YES 

tvi) ORIGIItAL SOURCE: 

(Ai ORGANISM: Porphycomonas gingival is 

(ix) FEATURE: 

{A J HAIfE/KEV: mi sc feature 

(B) LOCATIOM 1. . .293 





IrA ) 


SEQUENCE DESCRIPTION: SEC : 


rO NO: 430 










Met 


Lys 


Gin 




Tyr 


Ph- 


Lys 


Arg 


Val 


Cys 


Ser 


Leu 


Leu Trp 


r.eu 


Val 


1 








S 










10 










15 




Leu 


Pro 


Met 


Leu 


He 


H«t 


Pro 


Leu 


Glu 


Val 


Ala 


Ala 


Gin 


Glu 


He 


He 








20 










25 










30 






Pro 


Asn 


Glu 


Glu 


Val 


T.«u 


Glu 


Ser 


Leu 


Thr 


Fhfc 


Val 


/Ma 


Pro 


Val 


Glu 






35 










40 










•15 








GLu 


Thr 


Affp 


Ala 


He 


Glu 


Ala 


Glu 


Val 


Glu 


Ala 


Leu 


Gin 


Glu 


He 


Val 




St' 










55 










60 










Ala 


Thr 


Glu 


Glu 


He 


Ala 


Glu 


Gin 


Al^ 


Vol 


Arg 


Ser 


Tyr 


Thr 


Tyr 


Thr 


65 










70 










75 










80 


Val 


T/r 


Arg 


Affp 


Gly 


Val 


Lys 


lis 


Ala 


Ser 


Gly 


Lsu 


Thr 


Glu 


Pro 


Thr 


















90 










S5 




Phe 


Leu 


Asp 


Glu 


Asp 


Val 


Pro 


Ala 


Gly 


Glu 


His 


Thr 


Tyr 


cys 


Val 


Glu 








100 










105. 










110 






Val 


GIm 


Tyr 


GVn 


Gly 


Gly 


Val 


Ser 


Asp 


Lys 


Val 


Cys 


Val 


Asp 


Val 


Glu 






115 










120 










125 








Val 


Lys 


Asp 


Phe 


Lys 


Pro 


Val 


Thr 


A5n 


Leu 


Thr 


Gly 


Thr 


Ala 


Sor 


Asn 




130 










135 










140 










A£p 


GIu 


Val 


Ser 


Leu 


Affp 


Trp 


Asp 


Gly 


Val 


Glu 


Glu 


Lys Ala 


Glu 


Glu 


14S 










ISO 










155 










160 


Pro 


Ala 


Ser 


Asp 


Lys 


Ala 


Val 


Ser 


Tyr 


Asn 


Val 


Tyr 


Lys 


Asn 


Gly Thr 










165 










170 










175 




Leu 


lie 


Gly 


Asn 


Thr 


Ala 


Glu 


Thr 


His 


Tyr 


Val 


Glu 


Thr 


Gly 


Val 


Ala 








180 










185 










19G 






Afn 


Gly 


Thr 


Tyr 


He 


Tyr 


Glu 


Val 


Glu 


Val 


Lyi- 


Tyr 


Pro Asp 


Gly 


Val 






19S 










200 










205 








Ssr 


Pro 


Lys 


Val 


Ala 


Val 


Thr 


Val 


Thr 


Val 


Thr 


Asn 


Ser 


Ser 


Leu 


Ser 




210 










215 










220 










Arn 


Val 


Asp 


Gly 


Gin 


Ala 


Pro 


Tyr 


Thr 


Leu 


Am 


Val 


Glu Gly 


Lys 


Lys 


225 










230 










235 










240 


lift 


lie 


Ala 


Glu 


Ala 


Hif 


Gly 


Met 


H« 


Thr 


Leu 


Tyr 


Ae-p 


Ho 


Asn 


Gly 










245 










250 










255 




Arg 


Thr 


Val 


Ala 


Val 


Ala 


Pro 


Asn 


Arg 


Leu 


Glu 


Tyr Met 


Ala 


Gin 


Thr 








260 










26S 










27 0 






Gly 


Phs 


Tyr 


Ala 


Val 


Arg 


Phe 


Asp 


Val 


Gly Asn 


Lys 


HiE 


His 


Val 


Ser 






275 










280 










285 








Lys 


He 


Gin 


Val 


Arg 


























290 































12) IllFORIIATIOM FOR SEQ ID NO: 4 31 

U) SEQUENCE CHARACTERISTICS: 

(A) LEMGTH: 312 amino acids 
(81 TYPE: amino acid 
(D) TOPOI/OGY: linear 

(ii) riOLECULE TYPE; protein 

(iiiJ HVPOTHF.TICAL: YES 

tvi) ORIGIMAL SOURCE: 

(A) ORGAIIISH: Pcrphyromonas gingivaiis 

lix) FEATURE: 

(A) NAldE/KEY: roi3C_featurt 



SUBSTtTUTE SHEET (Riile 26) (RO/AU) 




ir«h tJ=J> r^il 

iL3 C3i m U.^|i O O ^ CJ 



WO 99/29870 



PCT/AU98/01023 



371/ 490 



(B) LOCATiai 1. . .312 



Ui) SEQUENCE DBSCRI PTI0t4 : SEQ ID MOM 31 



Met 


lie 


Pro 


Leu 


Ser 


Glu 


Ser 


Phe 


Glu 


Ser 


Gly 


He 


Pro 


Ala 


He 


Trp 


1 








5 










10 










15 




Lys 


Thi- 


He 


Asp 


Ala 


Affp 


Gly 


Asp 


Gly 


Tyr 


Asn 


Trp 


Met 


His 


Leu 


Thr 






20 










25 










30 






Afn 


Phe 


Thr 


Gly 


Gin 


Ser 


Gly 


Leu 


eye 


Val 


Ser 


Ser 


Ala 


Ser Tyr 


He 






35 










40 










45 








Gly 


Gly 


Val 


Gly 


Alfl 


Leu 


Thr 


Pro 


Asp Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Giu 




50 










55 










60 










Leu 


Lys 


Leu 


Pro 


Thr 


Asp 


Ala 


Leu 


Val 


Glu 


He 


He 


Tyr Trp Val 


Cy5 


S5 








70 










75 










ao 


Thr 


Gin 


Asp 


7.eii 


Thr 


Ala 


Pro 


Ser 


Glu 


His 


Tyr Ala Val Tyr Ser 


Ser 










85 










90 










95 




Ser 


Thr 


Gly 


Afi'n 


Asn 


Ala 


Ala 


Asp 


Phe 


Val 


Asn 


Leu 


Leu 


Tyr 


Glu 


Giu 








100 










105 










110 






Thr 


Leu 


Thr 


Ala 


Lys 


Azq 


He 


Gin 


Ser 


Pro 


Glu 


Leu 


He 


Arg 


Gly 


Asn 






115 










120 










125 








Arg 


Thr 


Gin 


Gly 


Val 


Trp 


Tyr 


Gin 


Arg 


Lys 


V^l 


Val 


Leu 


Pro 


Asn 


Asp 


130 










135 










140 










The 


Lys 


Tyc 


Val 


Ala 


Phe 


Arq 


His 


Phe 


Asn 


Sor 


Thr Asp Asn 


Phe 


Trp 


115 










ISO 










155 










150 


Leu 


A5n 


Leu 


A5P 


Giu 


Val 


Ser 


Hs 


Leu 


Tyr 


Thr 


Pro 


Leu 


Pro 


Arg 


Arg 










165 










170 










175 




Ala 


Pro 


Cya 


Pro 


His- 


Pro 


Gly 


Gly 


Tyr 


Thr 


Tyr 


Ser 


Val 


Phe 


Arg 


Arp 








180 










185 










190 






Gly 


'31 n 


hys 


He 


Ala 


Ser 


Gly 


Leu 


Ser 


Ala 


Leu 


Ala 


Tyr 


He 


Asp 


Thr 






195 










200 










205 








A^p 


Val 


Pro 


Tyr 


Gly 


Thr 


Gin 


Asp 


Tyr 


Cys 


Val 


Gin 


Vbl 


Asn Tyr 


Leu 




210 










215 










220 










Gin 


Gly 


Asp 


Ser 


Tyc 


Lys 


Val 


Cys 


Lys 


Asn 


Tl» 


Val 


Val 


Ala 


Asn 


Ser 


2?.5 










230 










235 










240 


Ala 


Arrv 


He 


Tyr 


Gly 


Ala 


Asp 


Lys 


Pro 


Phe 


Ala 


Leu 


Thr 


Val 


Val 


Gly 










245 










250 










255 




Lys 


Thr 


He 


Val 


Ala 


Ser 


Ala 


Phe 


Lys Gly 


Glu 


He 


Thr 


Leu 


Tyr 


Asp 








260 










265 










270 






He 


Axg 


Gly 


Arg 


I.eu 


He 


Ala 


Ser 


Gly Cys 


Asp 


Thr 


Leu 


Arg 


Tyr 


Lys 






275 










280 










285 








Ala 


Glu 


Asn 


Gly 


Phe 


Tyr 


Leu 


He 


Lys 


He 


Gin 


Val 


Asn 


Gly Thr 


Val 




2 90 










295 










300 










T-^'r 


Thr 


Glu 


l.ys 


He 


Gin 


He 


Gin 


















305 










310 






















(21 


Il-JFORJiATiOH 


FOR 


SEQ 


ID MO: 432 

















(1) SKQUEiiCE CHARACTERISTICS; 

(AJ LEMSTH: 84 3 amino acids 
(B) TYPE: amino acid 
{Dl TOPOLOGY: linear 

till MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas qingivalis 

(1x1 FEATURE: 

(A) MAI IE/ KEY: inisc_f eatur e 
(Bl LOCATION 1. . . 843 



l::l) SEQUENCE DESCRIPTION: SEQ ID HO: 4 32 



Met 


Lys 


Lys 


Ser 


Phe 


Leu 


Leu Ala He 


Val 


Met 


Leu 


Phe Gly He 


Ala 


1 








S 






10 






15 




Met 


Gin 


Gly 


His 


Ser 


Ala 


Pro Val Thr 


Lys 


Glu 


Arg 


Ala Leu Ser 


Leu 








20 






?.5 








30 




Ala 


Arg 


Leu 


Ala 


Leu 


Arg 


Gin Val Ser 


Leu 


Arg 


Met 


Gly Gin Thr 


Ala 






35 








40 








45 




Val 


Ser 


Asp 


Lys 


He 


Sec 


He Asp Tyr 


Val 


Tyr 


Arg 


Gin Gly Asp 


Ala 




50 










55 






60 






Glu 


Ary Gly 


He 


Thr 


Ser 


Gin Glu Glu 


Gly 


Ser 


Pro 


Ala Tyr Phe 


Tyr 


65 










70 






75 






80 



SUBSTITUTE SHEET (Rule 26) <RO/AU) 



tP^Ii tP™ i?"^' 



Ti«'" ^'ii fP*^' 

: '^Up ii:::: 4Ji ium 



wo 99/29870 PCT/AU98/01023 

372/490 



Val 


Ala 


Ao» 


Arg 


Gly hsn 


Asn 


Glu 










85 








her, 


Aro 


T } e 


Pro 


Thr 


He 


Leu 


Ala 








100 










list 


Asp 


Ser 


Met 


Pro 


Ai-p 


Asn 


I.ou 






lis 










120 


n 


Glu 


lis 


Gl y Leu 


lie 


Leu 


Ser 














135 




lis 


Leu 


Arg 


Thr 


Glu 


Gly Val 


Pro 


14 5 










150 






Asn 


Gly 


H .I 5 


Phe 


Ala 


Asn 


Asp 


Pro 










165 








Trp 


Asn 


Asn 


Lys 


Glu 


Pro 


Lgu 


Leu 








1«0 










Gly 


Cys 


Val 


Ala 


Thr 


Ala 


Ala 


Aid 






1 95 










200 


Pro 


Leu 


Gin 


Gly 


G-l u 


Gly Ser 


Phe 




210 










.215 




Gly 


Asn 


Trp 


Ser 


Gly 


Thr 


Phe 


Gly 


225 










230 






Pro 


Gly 


Aen 


Pro Asp 


Leu 


Asp Asn 










245 








Tyr 


Ala 


Thr 


Leu 


Met 


Arg Asp 


Val 








260 










Tyr 


Glu 


Am 


Gly Ser 


Gly 


Thr 


Tyr 
















280 


Arg 


Asn 


Asn 


Phe 


Arg 


Tyr 


Lys 


Arg 




290 










295 




Leu 


Tyr 


Thr 


Ser 


Gin 


Glu 


Trp 


Itis 


305 










310 




Ser 


Gly 


Arg 


Pro 


Val 


Tyr Tyr Ala 










325 








AX a 


Phe 


Va 1 


Cye 


Asp Gly 


Tyr 


Ala 








340 










Trp 


Gly 


Trp 


Gly Gly Val 


Ser 


Asn 






355 










360 


Ser 


Pro 


Thr 


S^ir 


Leu 


Gly 


He 


Gly 




370 










375 


Tyr 


Gin 


Glu 


He 


He 


Thr Gly He 


385 










390 






Ala 


Gly 


Thr 


Af p 


Ala 


Leu 


Pro 


He 










405 








Glu 


Tyr 


Lys 


Ser 


Glu 


Ser 


Gly 


Leu 








420 










Thr 


Gly 


Glu 


Glu 


Gin 


S-r 


Asn 


Leu 






435 










440 


Ala 


Ajp 


Gly 


Giu 


Val 


He 


Glu 


Val 




4 50 










455 




Trp 


Tyr 


Gly 


Tyr 


Gly 


Giu 


His 


Pro 


4 65 










470 






fJln 




Ser 


Gin 


Gly He Asn Thr 




i nr 


Gl u 


Gin 


Trp 


Glu 


Pro 


Val 








500 












Sftr 


lie 


Lys 


Val 


Asn 


Thr 


Thr 






515 










520 


Val 


Asp 


Asn 


Asn 


Glu 


Gly 


Lys 


Leu 




530 










535 




Ala 


Asp 


Leu 


Asn 


Ser Tyr 


Glu 


His 


545 










550 






Ser 


Asp 


Ser 


Pre 


Asp Glu 


He Arg 










565 








Thr 


Gly 


Ali. 


Thr 


Ala 


Asp 


Asp 


Val 








580 










Glu 


Val 


Pro 


Gly 


Gly 


Ser 


Ser 


Asn 






595 










600 


VbI 


L-u 


Thr 


Leu 


Ser 


Glu 


Gly A.-?p 




610 










615 




lie 


Aen 


Asn 


Gin 


Lya 


Asp 


Glu 


Trp 


625 










630 






Lys 


Thr 


Pro 


Thr 


Glu 


Tyr 


Thr 


His 










645 








Gin 


Thr 


Ser 


Thr 


Tyr 


Thr 


Leu 


Asp 








660 








Asp 


Phe 


Thr 


Leu 


Lys 


Asn 


Leu 


Gly 



Gly Tyr 


Ala 


Leu 


Val 


Ala 


Al a 


Mtp 




90 










95 




Tyr 


Ser 


Pro 


He 


Gly Arg 


Phe 




105 










110 






Arg Met 


T. rp 


Leu 


Gin 


He 


ryr 


Asp 










125 








Gly Lys 


Al a 


Gl n 


Leu 


Asn 


li u 










140 










Ala 


Glu 


Val 


Jlis 


Ala 


Leu 


Met 


Asp 






155 










150 


Met 


Arg 


Trp 


Asn 


Gin 


Gly 


Tyr 


Pre 




170 










175 




Pro 


Asn 


Gly 


Asn 


His 


Ala 


Tyr 


Thr 


185 










150 






Gin 


11c 


net 


Arg 


Tyr 


His 


Ser 


Trp 










205 








Asp 


Tyr 


His 


Ala 


Gly Ser 


Leu 


Val 








220 










Glu 


Met 


Tyr 


A.^p 


Trp 


He 


Asn 


Het 






235 










240 


Leu 


Thr 


Gin 


Ssr 


Gin 


Val 


Asp 


Ala 




250 










255 




Ssr 


Ala 


Ser 


Val 


Ser 


Met 


Ser 


Phe 


265 










270 






Ser 


Val 


Tyr 


Val 


Val 


Gly 


Ala 


Leii 










285 








Ssr 


Leu 


Gin 


I.eu 


His 


Val 


Arg 


Ala 








30 C 










Asp Met 


He 


Arg 


Gly Glu 


Leu 


Alo 






315 










320 


Gly Asn Asn 


Gin 


Scr 


He 


Gly 


His 




330 










335 




Ser 


Asp 


Gly 


Thr 


Phe 


His 


Phe 


Asn 


345 










350 






Gly 


Phe 


Tyr 


Lys 


Lou 


Thr 


Leu 


Leu 










365 








Gly Glu 


Gly 


He 


Gly 


Phe 


Thr 


He 








300 










Glu 


Pro 


Ala 


Lys 


Thr 


Pro 


Ala 


Glu 






395 










400 


Leu 


Ala 


Leu 


Lys 


Asp 


Ho 


Glu 


Ala 




410 










415 




Asn 


Val 


Gly 


Tyr 


Ser 


He 


Tyr 


Asn 


425 










4 30 






Asp 


Leu 


Gly 


Tyr 


Arg 


Lc-u 


Asn 


Lys 










445 








hyr. 


Thr 


^jer 


Ser 


He 


Asn 


He 


Ser 








4 60 










Giu 


Ser 


Phe 


Ser 


Lsu 


Ma 


Pro 


Asn 






475 










480 


He 


Thr 


Leu 


Leu 


Tyr Arg Arg 


Thr 




4 90 










495 




Arg 


His 


Ala 


Gin 


Gly Gly Tyr 


Vai 


505 










510 






Asp 


Pro 


Asn 


Asn 


Val 


Val 


Val 


Thr 










525 








Ser 


He 


Val 


Pro 


Asn 


Ser 


Phe 


Val 








540 










Ser 


Thr 


He 


Thr 


Val 


Gin 


Phe 


Asn 






555 










560 


Thr 


Pro 


Val 


Ala 


Phe 


Ala 


Leu 


Ser 




570 










675 




He 


Ser 


Leu 


Gly 


Trp 


Val 


Met 


Ala 


Sfi5 










590 






Tyr 


Pro 


Val 


Val 


Trp 


Ser 


Lys 


Asp 










605 








Tyr 


Thr 


Leu 


•i'rp 


Tyr 


Arg 


Ph% 


Ser 








620 










Lys 


Lye 


lis 


Gly 


Ser 


Val 


Ser 


Val 






635 










640 


Pro 


Leu 


Phe 


Giu 


Val 


Gly 


His 


Asn 




650 










655 




Her 


Ala 


His 


Asn 


Arg 


Val 


Leu 


Pro 


665 










670 






Leu 


Pro 


Ph* 


Asn 


Gly 


Glu 


Leu 


Val 



SUBSTmjriB SHEET (Rule 26) (RCVAU) 



■d^ij ti^ii ri^i -iF^ii tf^-li ir^i- -3=^ 

ii^ iw: 11:31 m ^lui o inn:; 43 u as 



30 



35 



40 



WO 99/29870 



373/490 



67 Ti 



685 



val V.l P,.. A.c, Gin Thr Gin ser s,r 3 = r Gly Ser Leu Trp Ala Al. 

oSb 7 00 

g Gin ni„ The V«l Ki. lie Ly. Gin Gly Glu Thr Ph. Val Tyr Ly. Pro 

val Val 51u Gly Pro lie Pro A.p Gly ser Tyr Arg Ala Thr Leu lit 

Ala Ph, val A-.n Gly Gin Gin Gin L^u Tyr Leu Ly, Gly Ly. Ill f^u 

10 Tyr Thr Val Ly= He Val A..n Gly ^^r Ala Val Glu Ala "e Glu Ser 

ser Glu Gl'u He Ar, Val Phe Pro A.,n Pro Ala Arc I" Tyr Val Glu 

15 780 ^" ^" 790 

Leu Ser Gly Ly.- lie Val Met Lys Asn Ser llu Ser Ala Gly His Gi° 

81P 

Arg net A=p V.l Ser Arg Leu Pro Asn Gly Alo Tyr He Leu V.l 

ZU Asp Gly Tyr Thi: Thr Lyr Tie Asn He Val Hi- 
833 840 

12) IMFORlIATlOt) FOR SF.Q ID KO: 433 

25 t1) .'^EQUEtJCt; CHARACTERISTICS: 

(A) LF.ICTH: 2 90 amino acids 

(B) TiFE: amino ncld 
(D) TOPOLOGY: linear 



(ii) IIOLECULE TYPE: protein 
(iii) HYPOTHETICAL: YES 

(vi) ORIGI^fAL SOURCE: 

(A) ORGAtnSM: Porphyromonas gingivali^ 
(Jx) FEATURE: 

(A) HAHE/KEY: mijc teature 

(B) LOCATIOH 1. . .290 

SEQUENCE DESCRIPTlOtl; SEQ TD 110:433 



Met Lyr Lys Leu Phe Leu Ser Leu Thr Ser Leu Val Met Val Phe Ala 
45 Val Ala Ser Cys Asp He He A^p Lys ^^p Gin Thr Leu Leu Pro Ala 
Fro Thr A^n v^l Thr Pro A^p A^n Pro Asp A^p Asn Pro Ser Glu He 

50 ""^^ lo^ ""^^ "^"^ ll'' "^Vr Val Leu Ala Glu Glu Phe 

Thr Gly Gin J.y.. Cys Leu Cys Pro Lyc Gly His Arg Ly^ Leu Ala 

Ala Leu Lys Glu Glr. Tyr Gly I.y. Arg Leu ^'hr Val Val Gly He hL 

55 Ala Gly Pro c:iy Ser Leu Val Pro Pro Leu Phe Arg Thr Glu Ala Gly 
_ , , u 105 110 

Ala Tyr Tyr Set Lye Phe Ala A^n A^n Thr Pro Leu Pre Ala Leu 
120 

60 ill "'^ ''^'^ ^^'^ ser Tyr Val Tyr A.p Lya Ser 

Tyr Lys Thr Trp A.-p Val Pri He Ala Glu Gin nit Glu Gin Lys Ala 

ISS 1 

Lys He Asn He Phe Ala Val Ala Glu Tyr Thr A«p Thr Gin Lya lI: 
65 ,,y, val Thr Val Ly^ Gly Lys Ho Leu gIu Gly A.n Thr Leu Pro Lyc 

„ 185 1 on 

ser net Val Gin Val Tyr Leu Leu Glu Asp Ly^ Leu He Ala Pro Gin 



1?5 200 



70 '''^ Yfi ^'^^ ^" "^^'^ HI-- val Leu 

215 ''20 
Arg Gly Ala Val Asn Gly He Trp Gly Glu Glu Jhe Val A.n Leu Ly. 

A.^p Tyr Leu Tyr Thr Tyr Ala Vai Glu Pro Ser Gly Mer Ser Phe 

75 Val Ala Glu Asn Ty^ S. r He Val Ala The Val Tyr Arc V.l cfn Thr 



PCT/AU98/01023 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ir"*t tTHti id— 



wo 99/29870 PCT/AU98/01023 

374/490 

260 265 270 

Phe Glu V'iil Tyr Asp Val Val His Val L/s lie Acn Pro Gin Ser Asp 
27b 290 285 

Gly Ly5 
5 290 

(21 IMFORIIATIOM FOR SEQ ID NO: 4 34 

(1) SEQUElJCt; CHARACTERISTICS: 
10 fA) LENGTH: 223 amino acids 

(B) TYPE: amino acid 
fDJ TOPOLOG'/: linear 



15 



20 



35 



40 



45 



50 



55 



75 



(ii) MOLECULE Ti-PE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIllAT, SOL'RCE: 

(A) CRGAHISM: Po rphyromona^ gingivaii£r 



il>:) FEATURE: 

(A) NAIIE/KEV; misc_f eature 

(B) LOCATION 1 . . .223 

25 .SEQUENCE DESCRIPTIOM: SEQ ID »0:^3^ 



30 



llet 


Ly^ 


Lys 


Sec 


Sec 


Vol 


Val 


Ala 


Ser 


Val 


Leu 


Ala 


Val 


Ala 


I.eu 


Val 


1 








b 










10 










15 




Phe 


Ala 


Gly 


CVS 


Gly 


Leu 


Asn 


Arn 


Met 


Ala 


Lys 


Gly 


Gly 


Leu 


lie 


Gly 








20 










25 










30 






Ala 


Gl y 


Val 


Gly 


Gly 


Ala 


lie 


Gly 


Ala 


Gly 


Val 


Gly 


Asn 


Val 


Ala 


Gly 






35 










40 










45 










Tbr 


Ala 


Val 


Gly 


Ala 


lie 


Val 


Gly 


Thr 


Ala 


Val 


Gly 


Gly 


Ala 


Ala 




50 










55 










60 










Gly Ala 


Lea 


lie 


Gly 


hyc 


Lys 


Met 


Asp 


Lys 


Gin 


Lys 


Lys 


Glu 


l.fiU 


Glu 


65 










70 










75 










90 


Ala 


Ala 


Val 


Pro 


Asp 


Ala 


Thr 


lie 


Gin 


Thr 


Val 


Asn 


Asp 


Gly Glu 


Ala 










95 




















95 




ile 


Leu 


Val 


Thr 


Phe 


A5p 


Ser 


Gly 


Ile 


Leu 


Phe 


Ala 


Thr 


Asn 


Ser 


Sez 








100 










105 










110 






Thr 


Leu 


Sftr 


Pro 


Asn 


Ser 


Arg 


Thr 


Ala 


LS:U 


Thr 


Lys 


Phe 


Ala 


Ala 


Asn 






115 










120 










125 








tlet 


Asn 


Lyi; 


Asn 


Pro 


Asp 


Thr 


Asp 


lie 


Arg 


lie 


Val 


Gly 


His 


Thr 


Asp 




130 










135 










I'iO 










Asn 


Thr 


Gly 


Ser 


Asp 


Lys 


lie 


Asn 


Asp 


Pro 


Leu 


Ser 


Glu 


Arg 


Arg 


Ala 


145 










150 










155 










160 


Ala 


S^r 


Vol 


Tyr 


Sec 


Phe 


Leu 


Asn 


Ser 


Gin 


Gly 


Val 


Ser 


Met 


Ser 


Arg 










165 










no 










175 




list 


Ala 


Ala 


Glu 


Gly 


Arg 


Gly 


Ser 


His 


Glu 


Pro 


Val 


Al^i 


Aep 


Asn 


Ser 








180 










IBS 










190 






Thr 


V«l 


Ali 


Gly 


Arq 


Ser 


Ala 


Asn 


Arg 


Arg 


Val 


Glu 


Val 


Tyr 


lie 


Leu 






195 










200 










205 








Pro 


Asn 


Ala 


Lys 


Het 


lie 


Glu 


Gin 


Ala 


Gin 


Gin 


Gly 


Thr 


Leu 


Lys 






210 










215 










220 











(2) inFORI-tATION FOR SEQ ID NO: 435 



(i) SEQUENCE CHARACTERISTICS: 

(A) LEIIGTH: 337 amino acids 
60 (8) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

65 (iii) HYPOTHETICAL: YES 

(vl) ORTGIJiAL SOURCE: 

(A) ORGANISE: Porphyromonar glngi*v*alis 

70 (I:-:) FEATURE: 

(A) 2^AJIE/KEY: mific_f ea ture 

(B) LOCATIOJI 1. . .337 



fxi) SEQUEMCE DESCRIPTION: SEQ ID HO: 4 35 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



fn\ o 1^: "ij rpj ^-ii jr'i} ^^"^ ^^-^^ ^-^^ 

luJ «^ iL-, •ii„Ji O Hi ALJi 'O ar: lO iUi 



wo 99/29870 PCT/AU98AH023 

375/490 

t-fet Ser Lys Lys Ser lis Leu Leu Leu Cy^ Cys S-r Leu Cys Ph» H- 
5 10 15 ~ 

Ser Ala Thr Lys Ala Val Thr Pro Vai Ary Asn Val Arg Asn Ser Gin 

20 2S 30 

Val Asa Ser Lys Ala Lys Thr Glu Arg Thr Lys Fro Ser Asp Ser Val 

Arg Tyr He Sec Asn Met He Ala P^p Arg Leu Glu Phe Arg Asn L-s 

55 60 
lie Ser Ser Glu Lys Glu Val Arg Lys Ala Glu Tyr Glu Asn Arg L-u 

''O 75 eo 

Ala Met Glu Ala Leu Asn Tyr Pro Ala He Arp Leu Tyr Gly Glu A-p 

85 90 95 ^ 

Ser Trp Ser Glu Tyr Val A^n Pro Phe Val Gly Ala Gly Thr Asp Val 

110 

Glu Tie Pro A^n Ser Tyr Asp He Ai-p Cys Ser Ser Phe Val Met Pro 

lis 120 125 

val Glu A^p Lys Gin Val Thr .Ser Gin Phe Gly Tyr Arg Arg Ary Phe 

' 135 ijrt 

Gly Arg H°t His Tyr Gly He Asp Leu Ser Val r^n Arg Gly Asp Thr 
ISO 

He Arg Ala Ala Phe A-p Gly Lys Val Arg Vai Arg Ser Tyr Glu Ala 

Arg Gly Tyr Gly Tyr Tyr He Val Leu Arg His Pro Asn Gly lIu Glu 

lys 

Thr val Tyr Gly His Met Ser Arg Gin Leu Val Asd Glu Asn Gin 11^ 

200 ■ 205 

Val Arg Ala Gly Gin Pro He Gly Leu Gly Gly Ser Thr Gly Arg S«r 

^10 215 220 

Thr Gly Pro His Leu His Phe Glu Thr Arg Fhe Met Gly He Pro II- 
230 235 240 

Asn Pro Ser Thr He He Asp Phe Asp Asn Gly Val Pro Leu Arg Asp 

250 255 
He Tyr Thr Phe Lys Arg Gly Ser Asn Ser Arg Tyr Ala hys Ala S-r 

265 270 
Lys Thr Ser Ser Arg Tyr Ala Lys Lys Gly Lys Lys Gly Arg Gla Ala 

275 280 285 

^^'^ ^Ir ^""^ '^^^ '^'''^ Li'^' '^^V Asp Thr Lou Glu Thr 

He Ala Lys Arg His Gly Thr Ser Val Gin l.ys Leu Cys Ala Thr Asn 
" 315 

Gly He Gly Lys Ser Lys He Leu Thr Pro Gly Lys Ala Leu Arg II- 

330 335 

Lys 

(2) INFORMATION FOR SEQ ID NO: 436 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(li) MOLECULE TYPE: protein 

(ill J HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGAJJISM: Porphyromonas gingivalis 

(Iz) FEATURE: 

(A) WAME/KEY: roisc_feature 

(B) LOCATION 1 , . . 151 

IXi) SEQUEHCt: DESCRIPTION: SEQ TH HO: 435 

Met Ala Lys He Asn Phe Tyr Ala Glu Gly Val Ser Leu Pro Arg He 

Arg Arg Arg He Val Gly Lys Trp Ho Ala Glu Val Cys Ser ^rg Tyr 

Gly Lys Ala Val Gly Glu He Ser Tyr Leu Phe Cys Asp Asp Glu Tyr 

He Leu Lys Ala Asn Gin Glu Phe Leu Asp His Asp Tyr Tyr Thr Asp 

55 60 

He He Thr Phe Asp Ser Cys Glu Ala Asp Thr Vai Asn Gly Asp Leu 

'5 80 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 PCT/AU98/01023 

376/ 490 



Lbu He Ser LtfU Asp Thr Val Arg Ser Asn Ala Arrg Ala Leu Aifp Leu 

85 90 95 

Arg Tyr Glu Asp Giu Leu His Arg Vsl lie Tl<! His Gly Il-s L-au His 

100 105 110 

Leu Cys Gly Leu Lys Aep Lys Ser Lys Lys Asp Glu Ala Gin Met Arg 

115 a;?0 125 

Ala Ala Clxi Glu Lys Ala Leu Val Met Leu Arg Glu Thr He Gly Ser 

130 135 140 

Glu Leu Ser Leu Leu His Thr 
145 150 



(21 IHE-OWIATIOH FOR SEQ ID 140:437 

li) SEgUBUCE CHARACTERISTICS: 

(A) LENGTH: 391 amino acids 

(B) TYPE: amino acid 
ID) TOPOLOGY: linear 

(iij IIOLECULE TYPt: protein 

liii) HiPOTHETICAL: YES 



(vi) ORTGTIIAL SOURCE: 

(A) ORGAlJISn: Porphyrortwnas gingivalis 

(ixl FEATURE: 

(A) tlAt-<E/KEY: misc_feature 

(B) LOCATION 1. . . 391 

{::a) SEQUENCE DESCRIPTION: SEQ ID N0:43T 



Met 


Lys 


Val 


Lys 


Tyr 


Leu 


Met 


Leu 


Thr 


Leu 


Val 


Gly 


Ala 


He 


Ala 


Leu 


1 








S 










10 










15 




Asn 


Ala 


Ser 


Ala 


Gin 


Glu 


Asn 


Thr 


Val 


Pro 


Ala 


Thr 


Gly 


Gin 


Leu 


Pro 








20 










25 








30 






Ala 


Lys 


Asn 


Val 


Ala 


Phe 


Ala 


Atg 


Asn 


Lys 


Ala 


Gly 


Ser 


Asn 


Trp 


Phe 






35 










40 










4.5 








Val 


Thr 


Leu 


Gin 


Gly 


Gly 


Val 


Ala 


Ala 


Gin 


Phe 


Leu 


Asn 


Asp 


Asn 


Asn 




50 










55 










60 








Asn 


Lys 


Asp 


Leu 


Het 


Asp 


Arg 


Leu 


Gly 


Ala 


He 


Gly 


Ser 


Leu 


Ser 


Val 


65 










70 










75 










80 


Gly Lys 


Tyr 


His 


Ser 


Pro 


Phe 


Phft 


Ala 


Thr 


Arg 


Leu 


Gin 


He Asn Gly 










35 










90 










95 




Gly Gin 


Ala 


His 


Thr 


Phe 


Leu 


Gly 


Lys 


Asn 


Gly 


Glu 


Gin 


Glu 


He 


Asn 








100 










105 










110 






Thr 


Asn 


Phe 


Gly 


Ala 


Ala 


His 


Phe 


Asp 


Phe 


Met 


Phe 


Asp Val 


Val 


Asn 






lis 










120 










125 








Tyr 


Phe 


Ala 


Pro 


Tyr 


Arq 


Glu 


Asn 


Ary 


Phe 


Phe 


His 


Leu 


He 


Pro 


Trp 




130 










135 










110 










Val 


Gly 


Val 


Gly 


Tyr 


Gin 


His 


Lys 


Phe 


He 


Gly 


Ser 


Glu 


Trp 


Ser 


Lys 


145 










150 










155 










160 


Aep Asn 


Val 


Glu 


Ser 


Leu 


Thr 


Ala 


Asn 


Val 


Gly 


Val 


Met 


Met 


Ala 


Phe 










165 










no 








175 




Arq 


Leu 


Gly 


Lys 


Arg 


Val 


Asp 


Phe 


Val 


He 


Glu 


Ala 


Gin 


Ala 


Ala 


His 








180 










185 










190 






Ser 


Asn 


Leu 


Asn 


Leu 


Ser 


Arg 


Ala 


Tyr 


Asn 


Ala 


Lys 


Lys 


Thr 


Pro 


Val 






195 










200 






205 








Phe 


Glu 


Asp 


Pro 


Ala 


Gly 


Arg 


Tyr 


Tyr 


Asn 


Gly 


Phe 


Gin 


Gly Met 


Ala 




210 










215 










220 










Thr 


Ala 


Gly 


Leu 


Asn 


Phe 


Arg 


Leu 


Gly 


Ala 


Val 


Gly 


Phe 


Asn 


Ala 


lie 


225 










230 








235 








240 


Unfc 


Pro 


HeL 


Asp 


Tyr 


Ala 


Leu 


He 


Asn 


Asp 


Leu 


Asn 


Gly 


Gin 


He 


Asn 










2-15 










250 










255 




Arg 


Leu 


Arg 


Ser 


Glu 


Val 


Glu 


Glu 


Leu 


Ser 


Lys 


Arg 


Pro 


Val 


Ser 


Cys 








260 










265 










270 






Pro 


Glu 


eye 


Pro 


Glu 


Val 


Thr 


Pro 


Val 


Thr 


Lys 


Thr 


Glu 


Asn 


He 


Leu 






275 










280 








285 








Thr 


Glu 


Lys 


Ala 


Val 


Leu 


Phe 


Arg 


Phe 


Asp 


Ser 


His 


Val 


Vai 


Affp 


Lys 




290 










295 










300. 






Asp 


Gin 


Leu 


He 


Asn 


Leu 


Tyr 


Asp 


Val 


Ala 


Gin 


Phe 


Val 


Lys 


Glu 


Thr 


305 










310 










315 








320 


Asn 


Glu 


Pro 


He 


Thr 


Val 


Val 


Gly 


Tyr 


Ala 


Asp 


Pro 


Thr 


Gly Asn 


Thr 










325 










330 










335 




Gin 


Tyr 


Asn 


Glu 


Ly.^ 


Leu 


Ser 


Glu 


Arg 


Arg 


Ata 


Lys 


Ala 


Val 


Val 


Asp 



310 345 350 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



G ibi O J.. = o ib u o ^ o u U3 



WO 99/29870 PCT/AU98/01023 



377/490 



Val Leu Thr Gly Lyn Tyr Giy Val Pro Ssr Glu Leu He Sec Val Glu 

355 350 36h 

Trp Lv'P Gly Asp Ser Thr Gin Pro Phe Ser Lys Lys Ala Trp Asn Ar*^ 

370 37i 380 

Val Val He Val Arg Scr hys 
38S 390 

(21 lUroWtATION FOR SEQ TO HO: 4 38 

(i) SCQVKUCt CHARACTERISTICS: 

(A) r.ENGTH; 305 amino acids 
(E) TYPE: amino acid 
(D) TOPOLOGY: linear 

Uil MOLECULF. TYPE: protein 

(iii) HYPOTHETICAL: YES 

(v1 ) ORIGIIIAL SOURCt:: 

(A) ORGAHISH: Porphyromonas gingival is 

(i::) FEATURE: 

(A) HAllE/KEY: misc_£eature 
(91 T.OCATIOIV 1...3Q5 

(xi) SEQUEtJCE DESCRIPTTOH: SEQ ID HO: 438 





Thr 


Tyr 


Arg 


He 


Met 


Lys 


Ala 


Lys 


Ser 


L^u 


Leu 


Leu 


Ala 


Lsu 


Ala 


1 








5 










10 










15 




GVy 




Al- 


Cys 


Thr 


Phe 


S&r 


Ala 


Thr 


Al a 


Gin 


Glu 


Ala 


Thx 


Thr 


Gin 






20 










25 










30 






Asn 


Lys 


Ala 


Gly 


Met. 


His 


Thr 


Ala 


Phe 


Gin 


Arg 


Asp 


Lye 


Ala 


Ssr 


Asp 






35 










40 










45 








His 


Trp 


Phe 


He 


Asp 


He 


Ala 


Gly 


Gly 


Ala 


Gly 


Met 


Ala 


Leu 


Ser 


Gly 




50 










55 










60 










Trp 


Asn 


A.?n 


Asp 


Val 


Asp 


Phe 


Val 


Acp 


Arg 


Leu 


Ser 


He 


Val 


Pro 


Thr 


65. 










70 










75 










BO 


Phe 


Gly 


He 


Gly 


Lys 


Trp 


His 


Glu 


Pro 


Tyr 


Phe 


Gly 


Thr 


Arg 


Leu 


Gin 










85 










90 










95 




Phe 


Thr 


Gly 


Phe 


Asp 


He 


Tyr 


Gly 


Phe 


Pro 


Gin 


Gly 


Scr 


Lys 


Glu 


Arg 








100 










105 










110 






Asn 


His 


Asn 


Tyr 


Phe 


Gly 


Asn 


Ala 


His 


Lsu 


Asp 


Phe 


Met 


Phe 


Asp 


Leu 






115 








120 










125 








Thr 


Asn 


Tyr 


Phe 


Gly 


Val 


Tyr 


Arg 


Pro 


Asn 


Arg 


Val 


Phe 


His 


He 


He 




130 








135 










140 










Pro 


Trp 


Ala 


Gly 


He 


Gly 


Phe 


Gly 


Tyr 


Lys 


Phe 


His 


Ser 


Glu 


Asn 


Aln 


145 










150 










155 










160 


Asn 


Gly 


■31 u 


Lys 


Val 


Gly 


Ser 


Lys 


Asp 


Asp 


Met 


Thr 


CAy 


Thr 


Val 


Asn 










165 










17 0 










175 




Val 


Gly 


Leu 


Met 


Leu 


Lys 


Phe 


Arg 


LSIJ 


Ser 


Arg 


Val 


Val 


Asp 


Phe 


Asn 








180 










185 
















He 


Glu 


Gly 


Gin 


Ala 


Phe 


Ala 


Gly 


Lys 


Met 


Asn 


Phe 


He 


Gly 


Thr 


Lys 






195 










200 










205 








Arg 


Gly 


Lys 


Ala 


Asp 


Phe 


Pro 


Val 


Met. 


Ala 


Thr 


Ala 


Gly 


Leu 


Thr 


Phe 




210 










215 










220 










Asn 


Leu 


Gly 


Lys 


Thr 


Glu 


Trp 


Thr 


Glu 


He 


Val 


Pro 


Met 


Asp 


Tyr 


Ala 


225 










230 










235 










240 


Lsu 


Val 


Asn 


Asp 


Leu 


Asn 


Asn 


Gin 


He 


Asn 


Ser 


Leu 


Arg 


Gly 


Gin 


Val 










245 










250 










255 




GIu 


Glu 


Leu 


Ser 


Arg 


Arg 


Pro 


Val 


Ser 


cys 


Pro 


Glu 


Cys 


Pro 


Glu 


Pro 








260 










265 










27 0 






Thr 


Gin 


Pro 


Thr 


Val 


Thr 


Arg 


Val 


Val 


Val 


Asp 


Asn 


Val 


Val 


Tyr 


Phe 






27 5 










280 










285 








Arg 


Tie 


Asn 


Ser 


Ala 


Lys 


He 


Asp 


Arg 


Asn 


Gin 


Glu 


He 


Asn 


Val 


Tyr 




2 90 










255 










300 










Asn 


Thr 


Ala 


Glu 


Tyr 


Ala 


Lys 


Thr 


Asn 


Asn 


Ala 


Pro 


He 


Lys 


Val 


Val 


305 










310 










315 










320 


Gly Tyr 


Ala 


Asp 


Glu 


Lys 


Thr 


Gly 


Thr 


Ala 


Ala 


Tyr 


Asn 


Mat 


Lys 


Leu 










32 5 










330 










335 




S«r 


Giu 


Arg 


Arg 


Ala 


Lys 


Ala 


Val 


Ala 


Lys 


Met 


Leu 


Glu 


Ly3 


Tyr 


Gly 








340 










345 










350 






Val 


Ssr 


Ala 


Asp 


Arg 


He 


TJ»r 


He 


Glu 


Trp 


Lys 


Gly 


Ser 


Ser 


Glu 


Gin 






355 










360 










365 








He 


Tyr 


Giu 


Glu 


Afn 


Ala 


Trp 


Asn 


Arg 


He 


Val 


VaJ 


Met 


Thr 


Ala 


Ala 




370 










375 










3fl0 











SUBSTITUTE SHEET (Rale 26) (RO/AU) 



iL.{i '^i* r^i a., in:: Oi lul o ii2 S u J O 



wo 99/29870 



PCT/AU98/01023 



378/490 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Glu 
385 

(2) INFOW-IATIOTJ TOR SEQ ID HO: 4. I? 

(1) SKQUEfJCE CHARACTERISTICS- 

(A) LENGTH: 190 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

Mil) HYPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

fA) ORGAI^lsii: Porphyromonas gingivalis 
<ix) FEATURE: 

(A) HAME/K-EY: misc fcatur- 

(D) LOCATIOH 1 . . .190 

(xi) SEWENCE DESCRTPTIOH: SEQ ID NO:43o 
Met Glu Phe Phe Met Leu Phe lie Ala AJ a V.l Fh. Val 
val Lou Ser Gin Phe Leu Gly cys Pro Phe Leu Gly 

Lys Val A^p Thr Ser He Gly H^t G^y Ala Ala Val Thr 
Ala Leu Ala Thr Leu Val Thr Phe Leu He Gin Ly. Phe 
Arg Fhe Gly Leu Gly Phe Het Gin Thr He Ala Phe He 
Ala Ala Leu Val Gin Met Val Glu He He Leu Lys Lys 

Pro L,u Tyr Gl„ Ala Leu Gly Val Phe f.eu Pro Leu He 

105 

Cy. Cy. val Leu G. y Val Ala lie Leu Val He Gin Ly. 

Leu Leu Gin Ser Phe Val Tyr Ma He 5er Thr Ala iff 

Leu Ala Met Val Thr Phe Ala Gly He Arg Glu G^n Leu 

Asn Leu Pro Lys Ala Mel Lys Gly H« p^o Ser Ala Leu 

Gly He Leu Al. Met Ala Phe Met Gly Phe Ser Gly He 

185 



{2) niFORHATION FOR SEQ ID HO:4.|0 

ii) SEQUENCE CHARACTERISTICS- 

(A) LENGTH: 186 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(111) HYPOTHtTICAL: YES 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonnr. gingivalis 
t i X ) FEATURE : 

(A] NAME/KEY: misc featur- 

(B) LOCATION 1. . .186 

O-U SEQUENCE DESCRlPTIOtl: SEQ ID NO: 4 4 0 
Met Leu Phe He Ala Al. Val Phe Val Asn Asn VaX Val 
Phe Leu Gly H, p,, ^.^^^ 

ser He Gly Met Gly Ma Ala Val Thr Phe Val Leu Ala 
Leu val Thr Phe Leu Il« Gin tya Phe Val Leu A.p Arg 



Asn A^n Val 
1 5 

Val Ser Lyc 
30 

Phe Val Leu 

Val Leu Asp 

Leu Val He 
80 

Val Ser Pro 
95 

Thr Thr Asn 
110 

Asp Tyr Thr 

Gly Phe Thr 

Asp Met Thr 
160 

Leu Ala Ala 
175 

Ala 
190 



Leu Ser Gin 
15 

Val Asp Thr 
3C 

Leu Ala Thr 
Phe Gly Leu 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



„rBJj, -iiira, -aiuii „J1 -Bsnj, .-fi*' «="i» --.jP ti^Hl ul-'^J iS^^tt 



WO 99/29870 PCr/AU98/01023 

379/490 



50 55 60 



Glv 


Phe 


Met 


Gin 


Thr 


He 


Ala 


Phe 


He 


Lsu 


Val 


Tie 


Ala 


Ala 


Leu 


Val 


65 










70 










75 










80 


Gift 


H«L 


Val 


Glu 


lie 


Dn 


Leu 


Lys 


Lys 


Val 


Ser 


Pro 


Pro 


Leu 


Tyr 
90 


Gin 










85 










90 












Ala 


Leu 


Gly 


Val 
100 


Phe 


Leu 


Pro 


Leu 


He 
105 


Thr 


Thr 


Asn 


Cys 


Cys 
110 


Val 


Leu 


Gly 


Val 


Ala 


llo 


Leu 


VaJ 


He 


Gin 


Lys 


Asp 


Tyr 


Thr 


Leu 


Leu 


Gin 


Ser 




Hi 










120 










125 








Phe 


Val 

130 


Tyr 


Ala 


He 


Ser 


Thr 
135 


Ala 


He 


Gly 


Phe 


Thr 
140 


Leu 


Ala 


Met 


Val 


Thr 


Phfe 


Ai« 


Gly 


Xle 


Arg 


Glu 


Gin 


Leu 


Asp 


net 


Thr 


Asn 


Leu 


Pro 


Lys 


1 J5 








ISO 










155 










160 


A] a 


Met 


Lys 


Gly 


He 


Pro 


Ser 


Ala 


Leu 


Leu 


Ala 


Ala Gly He 


Leu 


Ala 










165 










I/O 










175 




Met 


Ala 


Phe 


Met 


Gly 


Phe 


Ser 


Gly 


He 


Ala 















180 185 



(2) INFORJIATIOi; FOR SEQ ID HO; 441 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 833 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOCT/; linear 

MOLECULE TYPE: protein 

(iii) HVPOTHETICAI*: YES 

(vi) ORIGINAL SOURCE; 

tA) ORGAIJISH: Porphyromonas gingival Is 

(ix) FEATURE: 

lA) UAME/KEY: mlffc feature 
I D ) LOCATIOM 1 . . . 833 



txl) SEQUENCE DESCRIPTION: SEQ ID NO: 4 41 



Met 


Lys 


Gin 


Leu 


Asn 


He 


He 


Ser 


Phe 


He 


He 


Ala 


Phe 


Lsu 


Phe Leu 


1 






5 










10 










15 


Gly 


Thr 


Ser 


Ala 


Ser 


Ala 


Gin 


Gin 


Ser 


Gly 


Gly 


Ser 


Val 


Thr 


Gly Thr 






20 










25 










30 




Val 


Val 


Asp 


Lys 


Ser 


Ser 


Lys 


Glu 


Pro 


He 


Alo 


Tyr 


Val 


Gin 


Val Phs 






35 










40 










45 






Val 


Lys 


Gly 


Thr 


Thr 


Leu 


Gly 


Thr 


Sec 


Thr 


Asp 


Ala 


Asn Gly Asn Tyr 




50 










55 










60 








Ser 


He 


Lys 


Gly 


Tie 


Pro 


Ser 


Gly 


Asn 


Glu 


Thr 


He 


Val 


Ala 


Arg Leu 


65 










70 










75 








90 


Met 


Gly 


Tyr 


Ser 


Thr 


Cyr 


Glu 


Glu 


Lys 


Val 


His 


He 


Glu Lys 


Gly Gly 










85 










90 










95 


Ser 


A eg 


Hta 


Val 


Asp 


Leu 


Tyr 


Leu 


Thr 


Glu 


Glu 


Tie 


Leu 


Ser 


Leu Asp 








100 










105 










110 




Gly 


Val 


Val 


Val 


Ser 


Ala 


Asn 


Arg 


Asn 


Glu 


Thr 


Phe 


Arg Arg 


Gin Ala 






115 










120 










125 






Pro 


Ser 


Leu 


Val 


Thr 


Val 


Leu 


Ser 


Pro 


Glu 


Leu 


Phe 


Leu 


Lys 


The Asn 




130 










135 










140 








Ser 


The 


Asn 


Leu 


Ser 


Gin 


Gly 


Leu 


Lys 


Phe 


Gin 


Pro 


Gly 


Leu 


Arg Val 


145 










ISO 










155 








160 


Glu 


Asp 


Asn 


Cys 


Gin 


Asn 


Cys 


Gly 


Phe 


Asn 


Gin 


Val 


Arg 


He 


Asn Gly 










165 










170 










175 


Leu 


Glu 


Gly 


Ala 


Tyr 


Ser 


Gin 


He 


Leu 


He 


Asp 


Ser 


His 


Pro 


He Phe 








180 










185 










190 




Ser 


ser 


Leu 


Ala 


Gly 


Val 


Tyr 


Gly 


Leu 


Glu 


Gin 


Met 


Pro 


Ala 


Asn Met 






196 










200 










205 






He 


Glu 


Arg 


Val 


Glu 


Val 


He 


Arg 


Gly 


Gly 


Gly 


Ser 


Ala 


Leu 


Phe Gly 




210 










216 










220 








Ser 


Asn 


Ala 


Val 


Gly 


Gly 


Val 


He 


Asn 


Val 


He 


Thr 


Lys 


Glu 


Pro Leu 


225 










230 










235 








240 


Axg 


Asn 


Scr 


Ala 


Glu 


He 


Ser 


His 


Ser 


Thr 


Met 


Thr. 


,Phe 


Affp 


His Ala 










245 










250 










2 55 


Lys 


Gly 


Trp 


Gly 


Ser 


Phe 


Gin 


Asn 


Thr 


Thr 


Gin 


Phe 


Asn 


Gly 


Ser Met 








260 










265 










270 




Leu 


Thr 


Glu 


Asp 


Arg 


Lys 


Ala 


Gly 


Vai 


H«t 


Val 


Phe 


Gly Gin His Asn 






275 










280 










285 






Tyr 


Arg 


Pro< 


. Gly 


Gin 


Asp 


Tie 


Asp 


Gly 


A&'p 


Am 


Phe 


Thr 


Glu 


Leu Pro 
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290 










295 










300 












I.eu 


Arg 


A.«n 


Arg 


Ser 


Leu 


Gly 


Phe 


Arg 


Ser 


Tyr 


Tyr 


Lys 


Thr 


Gly 


305 










310 










315 










320 


hau 


T^r 


Sec 


Lys 


Ala 


Thr 


Leu 


Glu 


Tyr 


His 


Ser 


Met 


Gin 


Glu 


Tyr 


Arg 










325 










330 










335 




Arq 


Gly 


Gly 


Asp 


Acg 


Leu 


Asp 


Asn 


Pro 


Pro 


Phe 


Glu 


Ala 


Gin 


He 


Ala 








340 










345 










350 






Glu 


Tyr 


l,eu 


Gin 


His 


Tyr 


He 


Asn 


Gly 


Gly 


Ser 


Phe 


Lys 


Phe 


Asp 


Gin 




355 










360 










365 








Gly 


Phe 


Ser 


Gly 


Gly 


Lys 


Asp 


Phe- 


Phe 


Ser 


Leu 


Tyr 


Ala 


Ser 


Ala 


Gin 




370 










375 










380 










As-p 


Val 


Gin 


Arg 


Arg 


Ser 


Tyr 


Tyr 


Gly Gly 


Gly Asp 


Tyr 


Thr 


Glu 


Asn 












390 










395 










400 


L4U 


Leu 


Arn 


Gly 


Ala 


Vai 


Gli-i 


Ser 


Gly 


Scr 


Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 










405 










410 










415 




Afn 


Acp 


Ala 


Phe 


Thr 


Ala 


Leu 


Thr 


Sec 


Tyr 


V»X J 


Thr 


Thr 


Lys 


Gly 


Phe 








420 










4 25 










430 






Atfp 


Leu 


Gin 


Gly 


Gly 


Gly 


Met 


Tyr 


Arg 


His 


Th t 


Phe 


Gly 


Glu 


Asn 


Trp 






435 










440 










445 








Asp 


Phe 


Thr 


Gly 


t^ly 


Leu 


Glu 


Tyr 


He 


T-'r 


Gl 


Gin 


Leu 


Asp 


Asp 


Arg 




450 










455 










460 










S-r 


Gly 


Tyr 


Arg 


Pro 


Stt 


Lys 


He 


A.sp 


Gin 


Asn 


Thr 


Ser 


Thr 


Phe 


Ser 












470 










4"5 












Gin 


Tyr 


A:jp Gl n 




Glu 


Tyr 


Ly5 


Thr 


Glu 


Lys 


Leu 


Ser 


Ala 


Leu 


He 










485 










4 ?0 










495 




GI 7 


Ala 


Arcj 


Il^i 


Asp 


Tyr 


Vol 


Leu 


Leu 




Gin 


Asp 


Gly 


Lys 


Arg 


Tyr 








500 










505 










510 






T1.S 


A,'?p 


Pro 


Leu 


Phe 


He 


Fhe 


Ser 


Pro 


Arg 


Ala 


Asn 


Val 


Arg 


Tyr 


Asn 






515 










520 










525 








Pl'O 


Aan 


Lye 


Asn 


L.SO 


SGr 


Phe 


Arg 


Leu 


Ser 


* 


Se r 


Glu 


Gly 


Phe 


Arg 




530 










535 










540 










Alf» 


Pro 


Gin 


T^'r 


Phe 


Asp 


Glu 


ASD 


Leu 


His 


Val 


Glu 


Leu 


Ala 


Gly 


Gly 












550 










555 










560 


Thr 


Pro 


lie 


Ser 


Arg 


Val 


Lcii 


Ser 


Pro 


Asn 


Leu 


Lys 


Glu 


Glu Arg 


Ser 










565 










570 










575 




Arg 


Ser 


He 


Ser 


Ala 


Ser 


Phe 


Asp 


- 

i i r 


T\r 


His 


Ara 


Ala 


Asp 


Glu 


Trp 








580 










505 










590 






Gin 


Phe 


Ash 


He 


Il«t 


Gly GUi 


Ala 


Ph« 


Ser 


Thr 


Phe 


He 


Ser 


Asn 


Gin 






595 










600 










60S 








Phe 


Lys 


Pro 


Ser 


hsp 


Lys 


Val 


Glu 


Thr 


Thr 


Ser 


Asp 


Gly 


Lys 


Glu 


Trp 




610 










515 










620 










lie 


lie 


Arg 


Thr 


He 


Tyr 


Af n 


Asp 


T.yr. 


Asp 


Gly 


Vol 


Ser 


Lys 


Val 


Tyr 


625 










63U 










635 










640 


Gly 


Val 


Asn 


LfrU 


Glu 


Gly 


Arg 


He 


Ala 


Tyr 


Asn 


Lys 


Ser 


Phe 


Asp 


Leu 










645 










650 










655 




Gin 




Gly 


Gly 


Thr 


Trp 


Gin 


Arg 


Ser 


Arg 


Tyr 


Gly 


Ser 


He 


Tyr 


Thr 








660 










665 










670 






Ala 


Val 


Glu 


Ala 


Asp 


Lys 


Thr 


Thr 


Gly 


Gin 


Ala 


Glu 


He 


Ser 


V-1 


Ly? 






675 










680 








6SS 








Asp 


Tyr 


Val 


Arg 


Thr 


Pro 


Asn 


Leu 


Tyr 


Gly 


Tyr 


the 


Val 


Ala 


Thr 


Val 




690 










€95 








700 










Arof 


Pro 


Thr 


Glu 


His 


Phe 


Ala 


He 


Asn 


Leu 


Ser 


Gly 


Thr 


Fhe 


Thr 


Gly 


70S 










710 










715 










720 


Lys 


Met 


Asp Val 


Val 


His 


Glu 


Ala 


Tyr 


Glu 


Giy 


Asp 


He 


Pro 


Ala 


Glu 










725 










730 










735 




Hi5 


He 


Ala 


Pro 


Asp Gly 


Ser 


Phe 


Asp 


Phe 


Glu 


Met 


Asn 


Gly 


Gin 


Gin 








740 










745 










750 






Phe 


Lys 


Gly Leu 


Ala 


Glu 


Gly 


His 


Ala 


Lys 


Leu 


V3l 


Lys 


Thr 


Pro 


Ala 






755 










760 










7 65 








Phe 


Ala 


Asp 


He 


Asp 


Leu 


Lyc 


Leu 


Ser 


His 


Asp 


Phe 


His 


Leu 


Ala 


Ser 




■770 










775 










760 










Thr 


Met 


Thr 


Lsu 


Glu 


Leu 


Asn 


Ala 


Gly 


He 


Gin 


Asn 


He 


Phe 


Asn 


Scr 


785 










7 90 








795 










800 


Tyr 


Gin 


Lys 


Asp 


Thr 


Asp 


Lyt; 


Gly 


Pro 


Gly 


Arg 


Ma 


Ser 


Thr 


Tyr 


Val 










805 










810 










815 




Tyr 


Gly 


Pro 


Met 


Gin 


Pro 


Arg 


Arg 


He 


Phe 


Val 


Gly 


Thr 


Lys 


Hs 


Am 








820 










825 










8J0 






Phe 

































(2 1 lUFOWlATTOtr FOR SEQ XD NO: 44 2 

U) SEQUEIICE CHARACTERISTICS: 

(A) LEIIGTH: 091 amino acids 

(B) TYPE: amino acid 



SUBS Ml U IE SHEET (Rule 26) (RO/AU) 
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{DJ TOPOLOGY: linear 

(11 J M01>EajLt: TVPE: piotein 

5 tiii) HYPOTHETICAL: YES 

(vl ) ORIGINAL SOURCE: 

(A) ORGAfJISII; Pcrphyromoaafi' gingiva lie 

10 (ix) FEATURE: 

(A) MAllE/KEY: misc_feature 
(3) LOCATION 1 . . .891 
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(y.il b'EQUtNCE DE3CRIPTI0H: SEQ ID NO: 442 





Tyr 


Lys 


Lys 


He 


He 


Ala 


Val 


Ala 


Al a 


Leu 


Phe 
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II ** 










5 










10 










15 




oiy 


XI 5 


Leu 


Lys 


Gly Gj.n 


Ser 


Ser 


Asp 


Leu 


Thr 


Pro 


Gin 


Asp 


Thr 


He 








20 










25 










30 






Tyr 


Se c 


Plo 


Glu 


He 


Ser 


1 \ r 


Ala 


Ly5 


PCC7 


He 


His 


Lye 


Thr 


lie 


Al =i 






35 










40 










4 i* 








Ser 


He 


Gl u 


He 


Glu 


Gly 


1 )er 


Arg 


Ser 


Phe 


Asp 


Asp 


Phe 


val 


Leu 


Arg 




50 










55 










f ^ 










A.'jri 




^er 


ij1 y 


Leu 


Ala 


Vdl 


Gly Acp 


'jIu 




i.-U 


He 


Pro 


i»ly 


A^p 


65 










7 0 




















60 


Aia 


Met 


Se f 


Al a 


Ala 


Val 


sn 


Arg 


He 


Met 


Arg 


Gill 


Gly 


Tyr 


Phe 


S<jr 




















90 














A^rn 


Val 


Arg 


11" 


He 


Ala 


Asp 


Lys 


Tyr 


Val 


Gl y 


Asn 


7-ys 


Val 


Tyr 


Leu 








1 00 










105 










110 






Lys 


He 


T 


Val 


Thr 


Glu 


Arg 


Pro 


Arg 


He 


Ser 


Lys 


Val 


Thr 


Phe 


Ser 






115 










120 


















Gly 


Val 


Lys 


uyj? 


Ser 


Glu 


Arg 


Glu 


Affp 


Leu 




H»t 


Lys 




Gly 


Leu 




130 










135 










140 










Arg 


Glu 


Gly 


He 


Gin 


Met 


Thr 


Arg 


Asn 


A«sn 


Glu 


Asp 


Lv 


Val 


Arg 


Gi n 


145 










150 










1 55 










1 60 


He 


Val 


Gin 


Lys 


Tyr 


Phe 




Glu 


Lys 


Gly 


Tyr 


Arg 


Asp 


Al 


Ser 


He 










165 










170 










175 




Acg 


He 


The 


Gin 


Glu 


Pro 


Asp 


r.eu 


Sec 


Lys 


A^p 




Phe 


Val 


Asn 


Val 








100 










185 










190 






Leu 


He 


Ser 


He 


Glu 


Lys 


Lys 


Ser 


Lys 


Thr 


Lys 


Val 


Asn 


Glu 


He 


Tyr 






155 










200 










205 








Phe 


Ser 


Gly 


Asn 


Lys Ala 


Leu 


Ser 


Asn 


His 


Lys 


Leu 


Arg 


Met 


Ala 


Met 




210 










215 










220 










Lye 


Asn 


Thr 


Asn 


Ala 


Lyf 


Phe 


Ser 


Leu 


Arg 


Lys 


Hii? 


He 


Arg 


Ser 


Ser 


225 










230 










235 










240 


Phe 


Leu 


Lys 


Leu 


Phe 


Ser 


Thr 


His 


Lys 


Phe 


Val 


Glu 


Glu 


Ser 


Tyr 


Arg 










245 










250 










255 




Glu 


Asp 


Leu 


Val 


Arg 


Leu 


He 


Glu 


Lys 


Tyr 


Sin 


Glu 


Tyr 


Gly 


Tyr 


Arg 








260 










265 










270 






Asp 


Ala 


Glu 


He 


Leu 


Thr 


A£p 


Ser 


Val 


Val 


Lys 


Ala 


Pro 


Asp 


Gly 


Lys 






275 










280 










285 








Arg 


Val 


Asp 


He 


Tyr 


Leu 


Asn 


He 


Glu 


Glu 


Gly 


Gin 


Lys 


Tyr 


Tyr 


He 




290 










295 










300 










Lys 


Asp 


Val 


Asn 


Phe 


Val 


Gly 


Asn 


Sor 


Gin 


Tyr 


Pro 


Ser 


Glu 


Tyr 


Lsii 


305 










310 










315 










320 


Glu 


Arg 


Val 


Leu 


Gly 


He 


L\-s 


Ser 


Gly 


Asp 


Val 


Tyr 


Asn 


Gin 


Arg 


Arg 










325 










330 










335 




Leu 


Aia 


Lys 


Arg 


Leu 


Asn 


Glu 


Asp 


Glu 


Asp 


Ala 


Val 


Gly 


Asn 


Leu 


T^-r 








340 










345 










350 






Tyr Asn 


Asn 


Gly 


Tyr 


He 


Phe 


Ala 


Trp 


Val 


Asp 


Pro 


Val 


Glu 


Thr 


Asn 






355 










360 










365 








Val 


Val 


Gly 


A=p 


Sec 


Val 


Sec 


Leu 


Asp 


He 


Arg 


He 


Ala 


Glu 


Gly 


Lys 




370 










375 










380 










Gin 


Ala 


Aan 


He 


Asn 


Ly^ 


Val 


He 


He 


Lys 


Gly 


Asn 


Thr 


Val 


Val 


Tyr 


385 










390 








395 










400 


Glu 


Afp 


Val 


Val 


Arg 


Arg 


Glu 


Leu 


Tyr 


Thr 


Lys 


Pro 


Gly 


Gin 


Leu 


Phe 










4 05 










410 










415 




Ser 


Arq 


Glu Asp 


He 


He 


As-n 


Ser 


He 


Arg 


Leu 


He 


Asn 


Gin 


Leu 


Gly 








420 










425 










4 30 






His 


Phe 


Asp 


Ma 


Glu 


Lys 


Ser 


He 


Pro 


Aiy 


Pro 


He 


Pro 


Asn 


Pro 


Glu 
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440 










445 








Thr 


Gly Thr 


Val 


Asp 


lis 


Glu 


Tyr Asp 


Leu 


Val 


Pro 


Arg 


Ssr 


Ser 






450 










455 










4fiO 










Gin 


Leu 


Glu 


Leu 


i*er 


Val 


Gly 


Trp 


Ser 




Ser 


Gly 


Leu 


Leu 


Phe 


Arg 


465 










470 










475 










480 
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Gly 


Ala 


lie 


Lys 


E»he 


Thr 


Asra 


Phe 


Se r 


Va 1 


Gly 


Asn 


Leu 


Leu 


His 


Pro 
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490 










4 95 




Ser 


Met 


Tyr 


Lys 


Lys 


Gly 


rie 


lie 


Pro 


Gin 


Gly 


Asp 


Gly 


Gin 


Thr 


Leu 








500 










505 










510 






S*r 


Leu 


Ser 


Ala 


Gin 


Thr 


Asn 


Gly 


Lj s 


Tyr 


Tyr 


Gin 


Gin 


Tyr 


Ser 


Val 






515 










520 










525 








Thr 


Phe 


Met 


Asp 


Pro 


Trp 


Phe 


Gly 


Gly 


Lys 


Arg 


Pro 


Asp Met 


Phe 


Ser 




530 










535 










540 










Phs 


Ser 


Ala 


fhs 


Tyr 


Ser 


Lys 


Thr 


Thr 


Ala 


He 


Asp 


Ser 


Lys 


Phe 


Tyr 


545 










550 










555 










560 




Sei: 


A«n 


Ala 


Gly 




Tyr 


Tyr 


Asn 


Ala 


T^^r 


Tyr 


Asn 


Ser 


Tyr 


Tyr 










S&5 








57 0 










575 




Aen 


A^n 


■* J * 


Asn 


Ser 


- 

1 i r 


Tyr Asn 


Gly 


Met 


Ser 


Acp 


Tyr 


Thr 


ray 










500 










585 










590 








1 yr 


Thr 


Gin 


Ala 


Ser 


A.-P 


Pro 


Arp 


A r -T 




Leu 


Gin 


Met 


Leu 


Gly 






595 










600 










605 








Th r 




He 


Gly Tyr 


Gly 


Lys 


Arg 


Leu 


Ttir 


T rn 

1 rp 


Pro 


A:rp 


Asn 


Trp 


Phe 




610 










615 










620 










Gin 




Ti'r 


Thr 


Ssr 


Leu 


A?n 


Tyr 


Thr 


Tyr 


T--r 


Arg 


Leu 


Arg 


Asn 


Trp 
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630 
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610 


Ser 
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Thr 


Phe 
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Phe 
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Gly 


Ser 


Ala 


Asn 


Asp 


Leu 
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Arq 
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Ser 


Arg 


Thr 




lie 


Asp 
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Pro 
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Tyr 
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665 










670 








rg 


Ssr 


Gly 


Ser 


Asp 


Phe 


Met 


Vsl 


Co r 


V?i 1 


Ala 


Ala 


Thr 


Leu 


Pro 






'^75 










680 










685 








TyE 


Scr 


Leu 


Trp 


Asp 


Asn 


1(13 


Asp 


Tyr 


Al a 


Sfer 


Gin 


Asn 


Leu 


Ser 


Vsl 




690 




















700 










Ssr 


Asp 


rg 


Tyr 


Arg 


Phe 


lie 


Glu 


Tyr 


HI s 


Lys 


Trp 


Lys 


Phe 


Arg 


Gly 


705 










710 










715 










720 


Arg 


V3l 


Phe 


Thr 


Pro 


Lsu 


Leu 


Asn 


Pro 


Ala 


Thr 


His 


Lyr 


Tyr Thr 


Pro 










725 










730 










735 




Val 


Leu 


Met 


Ser 


Acg 


Val 


Glu Gly Ala 


Val 


Leu 


Gly 


Ser 


Tyr 


Asn 


Ser 








740 










7-15 










750 






Asn 


Lyff 


Lys 


Ser 


Pro 


Phe 


Gly 


Thr 


Phe 


Tyr 


Met 


Gly Gly Asp Gly Met 
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760 
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S*r 


Ser 


Tyr 


Tyr 


Gly 


Giy Tyr Met 


A£:n 


Glu 


Thr 


He 


Gly 


Leu 


Arg 


Gly 
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775 
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Tyr 


Lys 


Asn 


Gly 


Ser 


I] e 


Ala 


Giy Asn 


Asn 


Tyr 
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Tyr Ala 


Tyr Ala 
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Arg 
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Met 


Glu 


Leu Arg 


Phe 
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He 


Leu 


Phe 


Glu 


Asn 
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810 
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Sec 


Phe 


Asn 


Ala 


Trp 


Leu 


Leu 


Ala 


Fhe 


Ala 


Glu 


Ala 


Gly 


Asn 


Ala 


Trp 
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830 






Arg 


Ser 
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Asp 


Asn 


Tyr Asn 


Pro 


Phe 


Asn 


Leu 


Lys 


Arg 


Ser 


Ala 


Gly 






835 
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Leu 
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12) IMFORJ'IATIOI) FOR SEO ID NO: 443 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEUGTH: 17 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPF.: protein 

(111) H-fPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAIJISIU Porphyron-tonas qinqivalis 

dx] FEATURE: 

(A! MAItE/KE:-: misc_feature 
{B) LOCATlOfl I. . - 174 

(Xil SEQUENCE DES'JRIPTIOH: SEQ ID HO: 443 

Met Asn Gly Asp Met Lys Arq Phe Leu He Lfeu lie Gly Phe Ala L«u 
1 5 IC 15 
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Arg 


Ser 


Asp 


Leu 


Met 


Lys 


fro 


He 


Gin 


Asp 


Glu 






115 










120 










125 








He 


Trp 


Asn 


Ala 


He 


Lys 


Glu 


He 


Ala 


Lys 


Arg 


A£n 


Asn 


Tyr 


Gin 


Met 




130 










135 










140 










Vdl 


Leu 


Asp 


Arg 


Gly 


Thr 


Ser 


Gly 


He 


lie 


Fhe 


Ala 


Ser 


pro 


Ser 


He 


14b 










150 










155 










160 


As-p 


He 


Ser 


Asp 


Leu 


Val 


Leu 


Ser 


Lys 


Met 


Gly 


Pho 


Scr 


Lys 







165 170 



{2) lUFOWIATIW FOR SEQ ID HO: 44 4 

(i) SEQUENCE CHAH;>.CTERISTICS: 

(A) LENGTH: 17 0 amino acid* 
(B> Tl'PE: amino acid 
fD) TOPOLOGY: linear 

(11 » KfOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vl) ORIGIHAr. SOURCE: 

(A) ORGAKISH: Porphyromonaj gingivalis 

iix) FEATURE : 

(A) NAME/KEY: mi3C_feature 
IB) LOCATiaJ 1 . . .170 



(Xi) SEQUENCE DESCRIPTION: SEQ ID HOi't'l'l 



Het 


Lys 


Ary 


Phe 


Leu 


He 


Leu 


He 


Gly 


Phe 


Ala 


Leu 


Ala 


Val Ala Phe 


1 








5 










10 








15 


Ser 


Gly 


Phe 


Ser 


Gin 


Lys 


Phe 


Ala 


Leu 


Val 


Asp 


not 


Glu 


Tyr He Leu 








20 










25 










30 


Ary 


Asn 


He 


Pro 


Asp 


Tyr 


Glu 


Met 


Het 


Asn 


Glu 


Gin 


Leu 


Glu Gin Val 






35 










40 










45 




Ser 


Lys 


Lys 


Trp 


Gin 


Asn 


Glu 


He 


Glu 


Ala 


Leu 


Glxi 


Asn 


Glu Ala Gin 




50 




















60 






Ser 


Het 


Tyr 


Lys 


Lys 


Tyr 


Gin 


Ser 


Asp 


Leu 


Val 


Phe 


Leu 


Ser Ala Ala 


65 










70 










75 






80 


Gin 


Lys 


Lys 


Thr 


Gin 


Glu 


Glu 


Ala 


He 


Val 


Lys 


Lys 


Glu 


Gin Gin Ala 










85 


















95 


Ser 


Glu 


Leu 


t.ys 


Arg 


Lys 


Tyr 


Phe 


Gly 


Pro 


Glu 


Gly 


Glu 


Leu Tyr Lys 








100 










105 










110 


Lys 


Arg 


Ser 


Asp 


Leu 


Het 


Lys 


Pro 


He 


Gin 


Asp 


Glu 


He 


Trp Asn Ala 






115 










120 










125 




He 


Lys 


Glu 


He 


Ala 


Lys 


Arg 


Asn 


Asn 


Tyr 


Gin 


Met 


Val 


Leu Asp Arg 




130 










135 










140 






Gly 


Thr 


Sec 


Gly 


He 


He 


Phe 


Ala 


Ser 


Pro 


Ser 


rie 


Asp 


He Ser Asp 


145 










150 










155 






160 


Leu 


Val 


Leu 


Ser 


Lys 


Met 


Gly 


Phe 


Ser 


Lys 


















165 










170 










(21 


IIIFOR1IA7ION 


FOR 


SEQ 


ID NO: 445 













(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 163 amino acids 

(B) TYPE: amino acid 
(11) TOPOLOGY: linear 



(ii) nOLECULE TYPE: protein 
(iil) H-^-POTHETlCAL: iTS 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




,-s»P ti?"i\ 



;il to 'IT™ O 'ii«ai 



WO 99/29870 



PCT/AU98/01023 



384/ 490 



(VH ORiaillAL SOURCE: 

(A) ORGAJilSM: Porphyromonas ginglvalis 

(lK> FEATURF.: 

(A) IIAIIE/KEY: misc_£eature 
fB) LOCATIOM 1. . .163 

txi) SF.QUF.IJCE DESCRimW: SEQ ID NO: '145 





Lys 


Lys 


Phs 


Phfe 


Leu 


Met 


Leu 




Met Ala 


Leu 


Pro 


Leu 


Ser 


Leu 


1 








5 










10 








15 




Lou 


Ala 


Gin 


Lye 


Val 


Ala 


Val 


Val 


Asn 


Thr Glu 


Glu 


He 


lie 


Ser 


L\'s 








20 










25 








30 






net 


Pro 


GIu 


Glii 


Val 


Ala 


AJ;* 


Thr 


Lys 


Gin Leu 


Asn 


Glu 


Lsu 


Ala 


Giu 






35 










40 








AS 








Lye 


Tyr 


Arg 


Leu 


Asp 


Leu 


Lys 


Ser 


Met 


Pisp Asp 


Glu 


Phe 


Ala 


Lys 


Lys 




50 










S5 








€0 










Thr 


GJu 


Glu 


Phe 


Val 


Lys 


Glu 


Lys 


Asp 


Ser Leu 


Leu 


Glu 


A^n 


lis 


Arg 


65 










70 








75 










80 


Asn 


Arg 


Arg 


Gin 


Gin 


Glu 


Leu 


Glr\ 


Asp 


lie Gin 


Thr 


Arg 


Tyr 


Gin 


Gin 










85 










90 












Ser 


T/r 


Gill 


Thr 


Met 


Gin 


Glu 


Asp 


Leu 


Gin Lys 


Arg 


Gin 


Gin 


Gin 


Leu 








lOQ 










105 








110 






Fha 


Ala 


rro 


lie 


Gin 


Gin 


Lys 


Val 


Ala 


Asp Alo 


He 


hyr 


Ly^ 


Val 


Gly 






115 










120 








125 








Asp 


Glu 


Glu 


Asn 


Cys 


Alo 


Tyr 


Tl» 


net 


Glu Ala 


Gly 


(let 




Leu 


Tyr 




130 










135 








IJO 










Thr 


Gly 


Al a 


Thr 


Ala 


11- 


Asp 


Leu 


Thr 


Aim Lys 


Vol 


Ly^ 


Ala 


Lys 


Leu 


115 










1 HO 








155 










160 


Gly 


He 


Lys 



























(2) INFOWIATION FOR SEQ ID J fO: 44 6 

(i) SEQUE)1CE CRARACTERl STICS: 

(A J LENGTH: 82V aiaino acids 
tB) TYPF.: amino acid , 
(D) TOPOLOGY: linear 

tli) MOLECULE TYPE: protein 

(tii) HYPOTHETIC7VL: YES 

(vl) ORIGIHAL SOURCE: 

{A> ORGAJJISH: Porphyromonas glngivalis 

{ir.l FEATURE: 

I A} HAME/KEY: misc Leature 
IB) LOCATIOIJ 1- . , 827 

(Xi) SEQUENCE DESCRIPTTOH: SEQ ID NO: 446 

HeL Lye Glu Ala lie Pro Arg Lys Asn Lys Tyr He Lyc Leu Asn Gly 

1 5 10 15 

He Tyr Arg Leu Ser Phe He Leu Leu Cys Cys Leu Leu Cys Scr GJn 

20 25 30 

Ala Ala M-t Ala Gin Gly Val Arg Val. Ser Gly Tyr Val Leu A^p Arg 

35 40 45- 

Gly Glu hys Pro He Pro Phe Ala Gly Val Lys Val Arg Gly Thr Gly 

50 55 60 

Thr Gly Ala Thr Thr Asn Leu Lys Gly Tyr Tyr Glu Phe Arg Met Lys 
65 70 75 80 

Ala Thr Thr Asp Ser He Thr He Glu Phe Ser Ser llet Gly Tyr Gin 

85 90 95 

Gly Val Ser Arg iier Phe Pro Ser Leu Thr Ly^ Asp Thr Arg Leu Asn 

100 105 no 

Val Arg Leu Ala Glu Ala Glu Hftt Glu Leu Ser Ser Val Thr Val Gin 

lis 120 125 

Ala Thr Uyf* Arg Arq Leu Am Thr Met Glu Arg Val-.Asn Thr Arg Asp 

130 135 140 

Leu Arg Val Asn Ala Gly Pro Thr Gly Gly Val Glu Ser Leu He Ser 
145 150 155 160 

Thr Tyr Ala Gly Val Thr Gin Asn Asn Glu Leu Ser Ser Gin Tyr Ser 

165 170 ]75 

Val Arg Gly Gly Sex Tyr Asp Glu Asn Met Val Tyr Val Asn Gly Val 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 
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180 










185 










190 






Glu 


Val 


Tyr 


Arg 


Fro 


Leu 


Leu 


Va 1 


Arg 


Ser 


Ai a 


Gin 


Gin 


Glu 


Gly 


Leu 






195 










200 










205 








Ser 


Phe 


Val 


Asn 


Pro 


Asp 


Leu 


Thr 


Gin 


Ser 


Val 


GJ n 


Phe 


Ser 


Al™ 


Gly 




210 










215 










220 










Gly 


Phe 


Thr 


Ala 


Asp 


Tyr 


Gly 


hsp 


Lys 


Het 


Sar 


Ser 


Val 


Leu 


A^p 


He 


225 










230 










235 










240 


Arg 


Tyr 


Lys 


Gin 


Pro 


Gin 


Glu 


Lys 


Glu 


Gly 


Ala 


Val 


Leu 


Leu 


Gly Het 










245 










250 










255 




Leu 


Gin 


Ser 


Ser 


Al a 


Tyr 


Tyr 


'^-y 


Ser 


Ser 


Ala 


Gly 


Ala 


Phe 


Ser 


Gin 








260 










265 










27 0 






lie 


Thr 


Gly 


Val 


Arg 


Tyr 


Lys 


Ser 


Ala 


hys 


Ser 


Leu 


Leu 


Gly Thr 


Thr 






275 










280 










285 








Anp 


Thr 


Lys 


Ala 


Glu 


Tyr 


Afp 


Pro 


He 


Tyr 


Ala 


Asp Gly 


Gin 


The 


Phe 




290 










295 










300 










Het 


Thr 


Tyr 


Ary 


Phe 


Ser 


Pro 


Lys 


Leu 


Ser 


Val 


Ser 


Phe 


Leu 


Gly 


Asn 


305 










310 










315 










320 


Tift 


Ser 


Gin 


Thr 


Arg 


Tyr 


Lys 


Phe 


Val 


Pro 


Gin 


Thr Arg 


Glu 


Thr 


Ser 










325 










330 










335 




Phe 


Gly 


Thr 


Leu 


Ser 


Asp 


Ala 


Ly5 


LVff 


Leu 


Lys 


He 


Phe 


Phe 


Asp 


Gly 








34 0 










345 










350 






Gin 


Glu 


CI n 


A.'sp 


Arg 


Phe 


Leu 


Thr 


Tyr 


Phe 


Gly 


Ala 


Phie 


Ser 


I let 


Asn 






355 










36Q 










365 










Val 


Pro 


Asp 


Aep 


Lys 


Gill 


Arg 


His 


Thr 


Val 


Thr 


Leu 


Ser 


Ala 


Phe 




370 










375 










380 










Afn 


Ser 


Acn 


Glu 


Arg 


C\ij 


Thr 


Tyr 


Asp 


He 


Gin 


Gly 


Glu 


Tyr 


Phe 


Leu 












390 










395 










4 00 


Asn 


Asp 


Val 


Gin 


Leu 


Gly 


Aid 


.i-^p 


Gly 


Thr 


Ala 


Ser 


Met 


Ala 


Ser 


Giy 










4 05 










dlO 










415 




Sfei 


Glu 


Arn 


Ser 


Asn 


Gly 


Leu 


Gly 


Il3 


Gly 


Axg 


Asn 


His 


Glu 


Hii- 


Ala 








420 










425 










430 






Arg 


Asn 


Arg 


Leu 


Ser 


Tyr 


Axg 


Val 


Leu 


Asn 


(let 


Gly 


Tyr 


Arg Gly 


Glu 






435 










440 










445 








Met 


hys 


Leu 


Asn 


Glu 


Lys 


His 


Arg 


Leu 


Gin 


Ala 


Gly 


Val 


Ser 


Ala 


Gin 




450 










455 










460 










Met 


Glu 


Lys 


lie 


Ala 


Asp 


His 


lie 


Ser 


Gl u 


Trp 


Glu 


Arg 


Arg Asp 


Ser 


465 










4 70 










475 










480 


Val 


Gly 


Tyr 


Asn 


Leu 


Pro 


His 


Ser 


Glu 


Thr 


Val 


Leu 


Leu 


Met 


Tyr Asn 










485 










4 90 










4 95 




Asn 


Leu 


Tyr 


Ala 


Asp 


Thr 


Gin 


net 


Arg 


Gly 


Thr 


Arg 


Leu 


Ser 


Ala 


Phe 








500 










505 










510 






Val 


Gin 


Aj?p 


Arg 


Phe 


Asn 


Phe 


Ser 


Het 


Gly 


Gly 


Gly Thr 


Phe 


Ser 


Lou 






515 










520 








525 








lie 


Pro 


Gly 


lie 


Arg 


Ala 


Ser 


Trp 


Trp 


Ser 


Phe 


Asn 


Lys 


Glu 


Leu 


Lsu 




530 










535 










540 










Val 


Ser 


Pro 


Arg 


lie 


Ser 


Val 


Gly 


Tyr 


Ser 


Pro 


Glu 


Ser 


Asn 


Pro 


Ala 


545 










550 










555 










560 


Leu 


Vai 


L«u 


Arg 


Al a 


Ala 


Ala 


'^ly 


Leu 


Tyr 


Tyr 


Gin 


Ala 


Pro 


Phe 


Tyr 










565 










570 










57 5 




l.ya 


Glu 


Leu 


Arg 


Gin 


Thr 


His 


Lys 


Aep 


Ala 


Glu 


Gly Aen 


Afln 


Val 


Val 








580 










585 










590 






Val 


Leu 


Af n 


Glu 


Lys 


He 


Arg 


Ser 


Gin 


Gly 


Ala 


Phe 


His 


He 


Leu 


Ala 






595 








600 










605 








Gly 


Al- 


A.?p 


Tyr 


Thr 


Phe 


Glu 




Gly 


Gly 


Arg 




Tyr 


Lys 


Phe 


Thr 




610 










615 










620 










Ala 


Glu 


Ala 


Tyr 


Tyr 


Lys 


Ser 


Leu 


Phe 


Asn 


He 


Asn 


Pro 


Tyr 


He 


He 












€30 










635 










6-tO 


Glu 


Asn 


Val 


Lys 


He 


Arg 


Tyr 


Leu 


Gly 


Glu 


Asn 


He 


Gly 


Ser 


CAy 


Tyr 










645 








650 








655 


Ala 


Ala 


Gly 


lie 


Aep 


Leu 




Leu 


Phe 


Gly 


Glu 


Leu 


Val 


Pro 


Gly 


Val 








660 










665 










670 






Affp 


Ser 


Trp 


Leu 


Thr 


Ala 


Ser 


He 


He 


Ly- 


Ala 


Arg 


Gin 


Lys 


Leu 


Asp 






675 










680 










685 








Gly 


Tyr 


Gly 


Ser 


T*cu 


Fro 


Leu 


Met 


Asn 


Aia 


Pro 


Thr 


Tyr 


Asn 


Phe 


Ser 




6?0 










695 










700 










Phe 


Phe 


Leu 


Gin 


Glu 


Tyr 


Val 


Pro 


Gly 


A.sn 


uys 


Arg 


He 


Thr 


Ala 


Thr 


705 










710 










715 










720 


Leu 


Arg 


Ala 


Ala 


Leu 


Ser 


Gly 


Gly 


Leu 


Pro 


Gin 


Leu 


Asn 


Pro 


Ser 


Lya 










725 










730 










735 




Cly 


Leu 


Ser 


Ser 


Pro 


Ala 


Phe 


Thr 


Ala 


Pro 


Ala 


Tyr 


Lys 


Arg 


Val 


Asp 








740 










745 










750 






Leu 


Gly 


Val 


Het 


Tyr 


Lys 


Trp 


Leu 


Aep 


Pro 


Asp 


Aep 


Scr 


Phe 


Ala 


Gly 






7S5. 




















765 








Pu-g 


Ser 


Lyff 


Trp 


L-u 


Het 


Gly 


Val 


hys 


Gly 


Ala 


Tyr 


He 


Gly 


Ala 


Asp 




770 










775 








7S0 











SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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Leu Phe Am Leu fhe Asp Met Thr Asa V^i fKsn Ser Tyr Tyr Trp Val 
7B5 790 795 800 

Sec Asp Ali Tyr Gin Gin Gin Tyr Ala Val Fro As-n Tyr Leu Thr Arg 

(305 810 815 

Arg Gin Phe Asn Leu Arg Leu Leu Val Giu Phe 
B20 625 

(21 rUFOWlATIOM FOR SEQ ID NO: 4 47 

\i) SEQUE14CE CHARACTERISTICS: 

(A) LENGTH: ^17. amino acids 
tB} TYPE: amino acid 
(D) TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

till) HYPOTHETICAL: YES 

(vi) OftlGIMAL SOURCE: 

(A) ORGANISM: Porphyromonas glngWall;? 

(ix) FEATURE 1 

(A) miE/KEY: misc feature 

(B) LOCATION 1. . . 672 





0:1) 


SEQLIEWCF. DESCRIPTION: SEy ID 1*0:4 41 










(let 


Tyr 


Ser 


Gly 


His 


His Lys 


lie 


His 


Tyr 


Pro 


Phe 


Leu 


Tie 


Leu 


Leu 


1 








5 








10 










15 




Val 


Cys 


Leu 


Ala 


Phe 


Ala Ala 


cys 


Lys 


Ser 


Val 


Lys 


Leu 


Lys 


Asp Ala 








20 








25 










30 






Glu 


Lys 


Ala 


His 


Asp 


Arg Gin 


Glu 


Tyc 


Thr 


Lys 


A1 a 


Alfl 


Asp Met 


Tyr 






35 








40 










45 








Am 


Thr 


Leu 


Tyr 


Arg 


Arg Thr 


Arg 


Arg 


Lys 


Gin 


Val 


Glu 




Lys 


Ala 




50 








55 










60 










rye 


Thr 


Ala 


Phe Arg 


Ser Gly 


Glu 


Asn 


Tyr 


Arg 


Ala 


Ala 


Gly Arg 


Gin 


65 










70 








75 










80 


Ala 


Lys 


Ala 


Leu 


Arg 


Gly Tyr 


Leu 


Asn 


Ala 


Arg 


Arg 


Tyr Gly Tyr 


Pro 










85 








90 










95 




Asp 


Ser 


Val 


Val 


Leu 


Leu Arg 


Leu 


Ala 


Gin 


Thr 


Tyr 


Gin 


Gin Gly Gly 








100 








105 










110 






Asn 


Tyr 


hys 


Glu 


Ala 


Glu Val 


Leu 


Phe 


Arg 


Gly 


Tyr 


Leu 


Glu 


Ala 


Tyr 






115 








120 










125 








Pro 


Lys 


Ssr 


Tyr 


Phe 


Ala Ala 


He 


Gly 


Leu 


Glu 


Gly 


Cys 


Leu 


Phe 


Ala 




130 






135 










140 










Arg 


Gin 


Gin 


Lys 


Glu 


Tyr Pro 


Thr 


Arg 


Tyv 


Ar<j 


Ho 


Arg 


Arg 


Ala 


Ala 


145 










150 








155 










3 60 


Glu 


Trp Asn 


Ser 


Als 


Arg Gly 


A£p 


Phe 


Gly 


Pro 


Ala 


Tyr 


Ala 


Pro 


Asp 










165 








17C 










175 




Ala 


Ser 


Ala 


Leu 


Tyr 


Phe Thr 


Ser 


Syr 


Arg 


Ser 


Lys 


Asp 


Asp 


G] y 


L^u 








180 








105 










190 






Asp 


Am 


Ser 


Ser 


lie 


Thr Gly 


Lsu 


Lys 


Pro 


Asn 


Asp 


He 


Tyr 


He 


He 






195 








200 








205 








Lys 


Arg 


Asp 


Ala 


Gin 


Gly Arg 


Trp 


Gly 


Arg 


Pro 


Asp 


Ser 


Val 


Ser 


Gly 




210 








215 










220 










Gly 


lie 


Asn 


Thr 


Pro 


Trp Asp 


Glu 


Gly 


Val 


Pro 


Thr 


He 


Thr 


Pro 


Asp 


225 










230 








235 










240 


Gly Ser 


Thr 


lie 


Tyr 


Tyr Thr 


Leu 


Ala 


Gin 


Gin 


Gly 


Ala 


Asp Tyr 


Asp 










245 








250 










255 




Arg 


Thr 


Val 


Gin 


lie 


Tyr Sfir 


Ala 


Ala 


Arg 


Ser 


Gly 


Glu 


Gly Gly 


Trp 








260 








265 










270 






ser 


A^n 


Gly 


Ser 


Leu 


Val Asp 


He 


Met 


Arg 


Asp 


Ser 


Leu 


Arg 


Met 


Ala 






275 








280 










285 








Aid 


His 


Pro 


Ser 




Ser Ala 


Ser 


Gly 


Asp 


Tyr 


Leu 


Tyr 


Phe 


Val 


Ser 




2 90 








295 








300 










Asn 


lie 


Gly 


Gly 


Ser 


Tyr Gly 


Gly 


Lys 


Asp 


He 


Tyr 


Arg Val 


Lys 


Val 


305 










310 








315 










320 


Ser 


JKsp Arg 


Ser 


Tyr Gly Ser 


Pro 


Glu 


Asn 


Leu 


Gly 


Pro 


Asp 


He 


Asn 










325 








330 










335 




Thr 


Pro 


Gly 


Asp 


Glu 


Met Pha 


Pro 


Phe 


He 


Asp 


Gly 


Asp 


Ser 


Thr 


Leu 








3<10 








34 5 










350 






Phe 


Phe 


Ala 


Ser Asp Gly His 


Ala 


Gly 


Leu 


Gly 


Gly 


Leu 


Asp 


He 


Ph© 






355 








360 










365 








Lys 


Ala 


Thr 


Leu 


Asp 


Ser Thr 


Gly 


Gin 


Trp 


His 


Val 


Val 


Asn 


Met 


Gly 




370 








375 








380 
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Gin 


Pro 


Val 


JKsn 


Ser 


Ser 


Ala 


Aap 


Asp 


Phe 


Gly 


Leu 


Ala 


Val 


Glu 


Pro 


385 










390 










395 










400 


Lys 


Gly 


hys 


Asn 


Lys 


Glu 


Glu 


Ala 


Leu 


Pro 


Asp 


Asn 


Gly 


Val 


Lys 


Gly 






4 05 










410 










415 




Val 


Fha 


Cys 


Sec 
420 


Asn 


Acg 


Gly 


Asp 


Ala 
425 


Arg 


Gly 


Trp 


Pro 


nis 

430 


Leu 


Phe 


Ills 


Phe 


Glu 
435 


Leu 


Pro 


Ala 


lie 


Tyr 
440 


Thr 


Glu 


lie 


Gin 


Gly 
445 


Tyr 


Vol 


Met 


/Vsp 


Acg 
450 


Glu 


Glu 


Asn 


Fro 


Tie 
455 


Ala 


Gly 


Ala 


Thr 


Val 

460 


Arg 


He 


Val 


Gly 


Glu 


Arg 


Gly 


Pro 


Vai 


Gly 


Gin 


Gly 


Phe 


VaI 


Thr 


Thr 


Arg 


Asp Asp Gly 


465 










470 










475 










480 


Ser 


Tyr 


Lys 


MeL 


Ser 


Val 


Gin 


Gly 


A.'jp 


Thr 


Arg 


Tyr 


Val 


Met 


Leu 


Ala 








495 










490 










4 95 




Gly 


Ala 


Ser 


Gly 


Tyr 


Leu 


Asn 


Gin 


Tyr 


Val 


Glu 


Leu 


Lys 


Thr 


Asp 


Thr 






500 










505 










510 






Ala 


Ly*.- 


GXn 


Sec 


Glu 


Thr 


Tyr 


Tyr 


Val 


Asp 


Phe 


Phe 


Leu 


Ala 


Ser 


Arg 




51L 










520 










525 








Glu 


L/.' 
530 


Ala 


Glu 


Gly 


Leu 


Gin 
535 


Asn 


lie 


Phe 


Tyr 


Asp 
540 


Phe 


Asp 


Lys 


Ala 


Thr 


Leu 


Ac9 


Pro 


Glu 


Ser 


Met 


Lys 


Ser 


Leu 


Asp 


Glu 


Leu 


He 


Arg 


He 


545 










550 










555 










560 


Leu 


Thr 


Asp 


Asn 


Pro 
565 


Asp 


He 


Arg 


He 


Glu 
570 


Leu 


Gly 


Ser 


His 


Ala 
575 


Asp 


Arg 


Lys 


Gly 


Pro 

590 


Asp 


Ala 


Tyr 


Asn 


Leu 
585 


Gly 


Leu 


Ser 


Ai?p 


Arg 
590 


Arg 


Ala 


Lys 


Ser 


Val 


Val 


Asp 


Tyr 


Leu 


Thr 


Ser 


Arg 


Gly 


lie 


Ala 


Ala 


Asp Arg 






595 










600 










605 








Leu 


Thr 


Trp 


Lys 


Gly 


Tyr 


Gly 


Lys 


Ser 


Val 


Pro 




Thr 


Val 


Thr 


Ala 




610 






615 










630 










Lys 


He 


Ala 


Glu 


Arg 


His 


Af p 


Phe 


Leu 


Lys 


Glu 


Gly 


Asp 


Val 


Lsu 


Thr 


625 










630 










635 










640 


Glu 


Glu 


Phe 


Val 


Ala 
645 


Pro 


Leu 


Thr 


Glu 


Glu 
€50 


Gin 


Gin 


Sec 


Val 


Cys 
655 


Asp 


Gin 


Leu 


Asn 


AC9 
660 


Arg 


Thr 


Glu 


Phe 


Arg 
665 


Val 


lie 


Glu 


Glu 


Glu 
670 


Leu 


Arg 



(2) INFOWIATIW FOR SEQ ID MO: 449 

(1) SEQUENCE aiARACTERISTICS: 

(A) LENGTH: 708 araino acids 

(B] TYPE: amino acid 
(D) TOPOLOGY: linear 

on noLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

<vl) ORIGi:iAI. SOURCE: 

(A) ORGAtlISM: Porphyromonas gingivalis 

flKl FEATURE: 

(A) riAJiE/KEY: misc^f eature 

(B) LOCATION l,..70e 

(Xl) SEQUENCE DESCKIPTIOM: SEQ ID MO:448 



net 


Lys 


Lys 


Phe 


Phe 


Phe 


Ala 


Leu 


Leu 


Ser 


He 


Gly 


He 


Ser 


Ala 


Gin 


1 








5 










10 










15 




Ala 




Ala 


Lys 


TKr 


Asp 


Asn 


Val 


Pro 


Thr 


Asp 


Sec 


Leu 


Arg 


Val 


His 








20 










25 










30 






Asn 


Leu 


Gin 


Thr 


Val 


Thr 


Val 


Tyr 


Ser 


Thr 


Arg 


Thr 


Ala 


Val 


Pro 


Leu 






35 










40 










45 








Lys 


Lys 


He 


Pro 


Ala 


Lys 


Met 


Glu 


Leu 


He 


Ser 


Scr 


Arg 


A^n 


He 


Lys 




SO 










55 










60 










Gin 


Ser 


Gly 


Phe 


Asn 


Asn 


Met 


Thr 


Asp 


He 


Leu 


Lys 


Thr 


Gin 


Ser 


Ser 


65 










70 










75 










80 


Leu 


Asp 


Val 


He 


Gin 


Tyr 


Pre 


Gly 


Phe 


Ser 


Ser 


Asn 


He 


Gly 


He 


Arg 










85 










90 










95 




Gly 


Fhe 


Lys 


Pro 


Ser 


Gly 


Lys 


Tyr 


Val 


Thr 


Val 


Leu 


Val 


Asn 


Gly 


He 








100 










105 










110 






Pro 


Ala 


Gly 


Thr 


TVsp hsn 


He 


Ser 


Thr 


Leu 


Asn 


Thr 


Ser 


Asn 


lie 


Glu 






115 










120 










125 








Gin 


He 


Glu 


He 


Leu 


Lys 


Gly 


Pro 


Phe 


Ser 


Ser 


He 


Tyr 


Gly Thr 


Asn 




13C 










135 










140 
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Ala 


lt€ t 


Gly 


Gly 


Val 


Val 


Asn 


He 


He 


Thr 


His 




Ser 


Lys Asp 


Lys 








150 










155 








160 


lie 


Hi? 


'Jly 


Asn 


Val 


Ser 


Leu 


Phe Gly 


Gly 


S^r 


Tyr 


Gin 


Thr llet 


Ala 










165 










170 








175 




ill 


S**r 


Ph- 


Afsn 


Leu 


Gly Gly Arg 


Phe 


Glu 


Asp 


He 


Phe 


Ser Phe 


Asp 








180 










185 










190 








L«;U 


Gly 


Leu 


Asp 


Lys 


Gin 


Aen 


Lys 


Asp 


Tyr 


Lye 


Thr Gly 


Ser 






195 










200 










205 






Asn 




Ph- 


Leu 


Ser 


Leu 


Ser 


Lys 


Leu 


Glu 


Glu 


Ala 


He 


Val Asp 


Val 




21 










215 










A^^ 








Asr 


Ala 


Thr 


Lys 


A^n 


Lys 


Lyf 


Het 


Lys 


Gly Scr 


sp 


Tyr Thr Val Ala 


22-" 










230 










235 








240 


Thr 




Acq 


Leu 


Arq 


Phe Gly 


He 


Asp 


Phe 


Thr 


Pro 


Glu 


Trp Ser 


Leu 










245 










25C 








255 








Tyr 


Gin 


Asn 


Val 


Phe 


Leu 


Gly 


Asp 


Ala 


Tie 


Pro 


Val Gly 


Gly 








260 










26S 










270 




Ser 


lis 


Trp Rly 


Val 


Tyr 


Gly 


Glu 


Ser 


Lys 


Lys 


Asn 


Leu 


Asn Arg 


Ser 






27S 










280 










285 






Ser 




S-sr 


fhe 


Glu 


Leu 


Leu Gly 


Lys 


His 


Gly 


-ys 


KiE 


Thr Leu 


Gin 




290 










295 










300 








rnS 


Ser 


Pro 


Tyr 


Phe 


Asn 


He 


Glu 


Lys 


Ser 


Glu 


Asn 


Tyr 


Asn Asn 


Ala 


3C5 










310 










315 








320 


Asp 


Pre 


Thr 


Gly 


Phe 


He 


Aen Tyr 


Lyc 


Ser 


Aep 




Tyr Thr Tyr Gly 










325 










330 








335 




Ai. a 




Leu 


Gin 


Acp 


Lys 


lit 


Ser 


Phe 


Gly 


Gly 




Asn 


He Val 


Leu 


















34 5 










350 








Asp 


b'er 


Arg Asn 


Met 


Thr 


Met 


Gl u 


Ser 


ri 


Arg 


Phe Glu 


Gin 






355 










360 










365 






rtla 


'j1 y 


Val 


A."?n 


Thr 


Lys 


Pro 


Tyr 


Asn 


Pre 


Gly 




Ala 


Thr Asn 


Asn 




370 










375 










if 








lie 


Gly 


Leu 


Phe 


Gly 


Gin 


Alo 


Asn 


Phe 


Tyr 


Leu 




Asn 


Asp Ala 


Leu 


385 










390 










395 








400 


£tr 


He 


Ser 


Ala 


Gly Ala 


Arg 


Ala 


Asp 


Phe 


Met 


Phe 


Phe Asp Leu Lys 










405 










410 








415 




Al a 


Asn 


Glu 


Tyr 


Leu 


Aen 


Asn 


Glu 


Ala 


Lys 


Gin 


Glu 


Thr 


Ills Asn 


Val 








420 










425 










430 




lie 


Asn 


Fro 


Asn 


Val 


Gly 


He 


Lys 


Tyr 


Glu 


Phe 


Val 


Lys 


Gly Leu 


Thr 






435 










440 










445 






Ala 


His 


Gly Thr 


Phe 


Gly 


Ser 


Ala 


Phe 


Ser 


Ala 


Pro 


Asp Ala Phe 


Gin 




4 50 










455 










460 








Lys 


Al a 


Gly 


Gin 


Tyr Val 


Gly 


Pro 


Phe 


Gly Thr 


Thr 


He 


Gly Asn 


Pro 


•165 










470 










475 








480 


Afp 




Lyt- 


Pro 


Glu 


Lys 


Ser 


Met 


Thr 


Trp 


Ai'p 


Phe 


Gly 


He Gly Tyr 










485 










4 90 








495 




Ssr 


A?r> 


Ala 


Arg 


Cys 


Gly 


He 


Gin 


Ala 


Asp 


Val 


Thr 


Leu 


Thr Tyr 


Phe 








500 










505 










510 




IS 




Asp 


Hi^ 


Lys 


Asp 


LeM 


He 


Leu 


Ser 


LJer 


ro 


Asp 


Tyr Ala 


Asn 






515 










520 










525 








n • 


Thr 


Thr 


Tyr 


He 


Asn 


Ala 


Asp 


Lys 


Ala 




Met 


Ser Gly 






530 










535 










54 0 








Giu 






Leu 


Ser 


Tyr A^p 


Phe 


Gly 


Ser 


Leu 


Phe 


Ala 


Asn Lys 


Phe 


54S 










550 


















560 


Ser 


Leu 


Arg 


Ala 


Phe 


Ala 


Am 


Ala 


Thr 


He 


Met 


Leu 


Asn 


Ser Glu 


Met 










565 










570 








575 




Ly5 


Lys 


Ser 


Gin 


Thr 


Asp Ala 


Pro 


Trp 


Ser 


Giu 


Uet 


Tyr 


Tyr Val 


Arg 








580 










585 










590 




Lys 


Gin 


Asn 


He 


Thr 


Phe 


Gly 


He 


Glu 


Tyr 


Arg 


Gly 


Lys Glu Gly 


Leu 






595 










600 










605 




Glvj 


Val 


Met 


Lttu 


Asn 


Gly Arg 


Phe 


Met 


Gly 


Arq 


Arg 


He 


Glu Gin 


Asn 




610 










615 










620 








Trp 


Tyr 


Ala 


Tyr 


Tyr 


Pro 


Giu 


Val 


Arg 


Pro 


Glu 


Leu 


Gin 


Gin Leu 


Leu 


625 










630 










635 








640 


Ala 


Ala 


Glu 


Glu 


Pro 


GLu 


Leu 


Ala 


Ala 


Gin 




Leu 


Leu 


Arg His 


Pro 










645 










650 








655 




Gin 


Ala 


Ifet 


Val 


Phe 


Asn 


Ala 


Ser 


Ala 


Tyr 


Tyr 


His 


Met 


Asn Lys 


Tyr 








6&0 










€65 








670 




Leu 


Thr 


Pho 


Gly 


Val 


Asn 


Leu 


Asn 


Asn 


He 


Leu 


Asp 


Glu 


Leu Tyr 


Thr 






€75 








680 










685 






Glu 


L/f 


Asp 


Gly 


Tyr 


Hi3 


Met 


Pro 


Gly 


Arq 


Asn 


H?. 




Gly Lys 


Val 




690 










695 










7C0 








Iter 




A.?n 


The 
























705 






























(2) 


iriFORIlATION 


FOR 


SZQ 
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(1) SEQUEllCE CKARACTERISTTCS; 

(A) LEIIGTH: 4 62 ainiJio acids 
(D) TYPE: amino acid 
[0] T0POIX}GY: liMar 

(ii> MOLECULE TYPE; protsin 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAIJISfl: Porphyromonas gingival is 

li?:) FEATURE: 

(A) NAME/KEY: mif c_f ea turo 
{B) LOCATIOtl 1 . . .462 



(xi) SEQUEllCE DESCRIPTTOM: 5EQ ID MO: 449 



Met 


Asn 


Aro 


Phe 


Ser 


Asn 


His 


Trp 


Pro 


Cys 


He 


Leu 


Val 


Gly 


Phe 


VaI 


1 








5 










10 










15 




Leu 


Tcp 


Phe 


Val 


Str 


Ala 


5or 


Arg 


Thr 


Val 


Ala 


Gin 


Asn 


Ala 


Ser 


Glu 






20 










25 










30 






Thr 


Thr 


Val 


Ser 


Tyr 


Asp 


Thr 


Asp 


Thr 


Ala 


Val 


Leu 


Ser 


Olu 


Als 


Asp 






35 










40 










45 






Val 


Leu 


Arg 


lie 


Ala 


Leu 




Glu 


TVcn 


Ala 


Thr 


Val 


Lys 


Val 


Ala 


Asp- 




50 










55 










60 










Met 


A5p 


Val 


Arg 


Lys 


Gin 


Glu 


Tyr 


Ala 


Arg 


Arg 


Ala 


Ala 


Arg 


Ala 


Asp 


65 










70 










7b 










80 


Lt^u 


Pile 


Pro 


Lys 


Val 


Arp 


L€U 


Aj?n 


Gly 


Val 


Ti-r 


Ser 


His 


Thr 


Leu 


Lys 










8b 










90 










95 




Lys 


Gin 


Val 


Leu 


Tyr 


He 


Asp 


Met 


Pro 


Gly 


Phe 


S&r 


Ser 


Ser 


Glu 


Gly 








1 00 










105 










110 






Il« 


Glu 


tiet 


Gly 


Arg 


Thr 


His 


Asn 


Thr 


Gin 


Gly 


Gly 


Vai 


Asn 


Val 


Ser 






lib 










120 










125 








Met 


Pro 


Leu 


Val 


Ser 


Ala 


Gla 


Leu 


Trp 


Lys 


Ssr 


He 


Ala 


Met 


Thr 


Gly 




130 










135 










140 










Gl u 


Gin 


Leu 


A-p 


Leu 


Ala 


Leu 


Glu 


Lys 


Ala 


Arg 


Ser 


Ser 


Arg 


He 


A-^p 


145 










150 










155 










160 


Leu 


Val 


Ala 


Glu 


Val 


Lys 


Lys 


Ala 


Tyr 


Leu 


Ser 


Val 


Leu 


Leu 


Ala 


Glu 










165 










170 










175 




Asp 


Sor 


Tyr 


Gly 


Val 


Phe 


Lys 


Arg 


Ser 


Tyr 


Aep 


Asn 


Ala 


L2U 


Ala 


A?n 








180 










185 










190 






•ryr 


hys 


Asn 


He 




Aip 


Lys 


Phe 


Asp 


Arg 


Gly 


Leu 


Val 


Ala 


Glu 


T:,*r 






195 










200 










205 








Asp 


Lys 


He 


Arg 


Ma 


Asn 


Val 


Gin 


Val 


Arg 


Asn 


He 


Glu 


Fro 


Asn 


Leu 




•11 0 










215 










220 










Leu 


Gin 


Ala 


Gin 


Asn 


Sec 


Val 


Ala 


Leu 


Ala 




Trp 


Gin 


Leu 


Lys 


Val 


225 










230 










235 








24 0 


Leu 


I'let 


Ser 


Met 


Glu 


Val 


Glu 


Thr 


Pro 


He 


Arg 


Leu 


Ser 


Gly 


Ser 


Leu 










245 










250 










255 




Ser 


Asp 


Tyr 


Lys 


Glu 


Gin 


Val 


Tyr 


Thr 


Gly 


Tyr 


Phe 


Ala 


Ala Asp 


Thr 








260 










265 










270 






Leu 


lie 


Ser 


AFn 


Asn 


Ser 


Ser 


Leu 


Arg 


Gin 


Leu 


Asp 


Il»* 


Gin 


Arg 


Arg 






275 










2fl0 










2e5 








Leu 


Ala 


Val 


Ser 


Ala 


Asp 


Lys 


Leu 


Asn 


Lys 


Tyr 


Ser 


Phe 


Leu 


Pro 


Thr 




290 










295 










300 










Leu 


Asn 


I<eu 


Cly 


Gly 


Gin 


Tyr 


Thr 


Tyr 


Ser 


Leu 


Acn 


Ser 


Asn 


Asp 


He 


305 










310 










315 










320 


Lys 


Phe 


Trp 


Gly 


Glu 


Gly 


Gin 


Arg 


Trp 


Thr 


Pro 


Phe 


Ser 


Thr 


He 


Ser 










32b 










330 










335 




Leu 


Ssr 


l.ou 


Tyr 


He 


Fro 


He 


Phe 


Asn 


Glv 


Gly 


Lye 


Axg 


Leu 


Tyr 


A^n 








340 










345 










350 






Val 


Lyr 


Gin 


Ser 


Ala 


Lt;u 


Ssr 


He 


Arg 


Gin 


He 


A5p 


Leu 


Gin 


Arg Axy 






355 










360 










365 








His 


lis 


Glu 


Gin 


Ser 


He 


Arg 


Met 


Gly 


He 


Lys 


A?n 


Gin 


Asn 


Asp Arg 




370 










375 










380 










Leu 


Arg 


Thr 


Cys 


1-lsc 


Gin 


Axg 


Phe 


Val 


Ala 


S^sr 


Glu 


Glu 


Ala 


V3] 


Arg 


305 










390 










395 










■J 00 


S-r 


Aia 


Glu 


Lys 


Gly 


Tyr 


Gin 


He 


Ala 


Glu 


Lys 


Arfv-Tyr 


Gin 


Thr 


Gly 










4 05 










410 










415 




Glu 


Giy 


Thr 


LtiU 


Val 


Glu 


Leu 


A5«n 


Asp 


Ala 


Asp 


Val 


Ala 


Leu 


Leu 


Gin 








-120 










425 








430 






Ala 


Arg 


L€U 


Asn 


Tyr 


Asn 


Gill 


Ala 


lie 


Phe 


Asp 


Phe 


Itet 


Thr 


Ala 


Lys 






4 35 










440 








445 








Ala 


Glu 


I.eu 


Asp 


Lys 


Ket 


Asn 


Gly 


(iet 


Gly 


He 


Pro 


Glu 


Gin 
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450 455 460 

{2) mFORtlATTOll fOR SCQ ID NO: 4 50 

ti) SEQUENCE CHARACTER I ST T a<; : 

fA) LfcIHGTH: 4 92 amino acid.<r 
(B] TYPE: amino acid 
(DJ TOFOLOG/: linear 

(ii) tlOLECULE TYfE: protein 

liii) HYPOTHETICAL: YES 

(vl) ORiailJAL SOURCE: 

(A) ORGAJflSTf: Porphyromonas gingivalif 

( ix ) rEATURE : 

(A) tlPi-IE/KEY: mi5c_f eaturt 

(B) LOCATION 1...492 



Ui) SEQUEUCE DESCRIPTION: SF.Q ID 110:450 



Met 


Trp 


Gly 


Asp 


Ser 


IIi£ 


Gly 


Val 


Ala 


Pro 


Asn 


Gin 


Val Arg Arg 


Thr 


1 








5 










1 0 






15 




Lsu 


Val 


Lys 


Val 


Ala 


Leu 


Ser 


Glu 


Ser 


Leu 


Pro 


Fro 


Gly Ala Lys 


Gin 








20 










25 








30 




He 


Arg 


Ho 


Gly 


Phe 


Ser 


Leu 


Pro 


Lys 


Glu 


Thr 


Glu 


Giu Lys Val 


Thr 






35 










40 










45 




Ala 


Leu 


Tyr 


Leu 


Leu 


Val 


Ser 


Acp 


Ser 


Leu 


Alfl 


Val 


Arg Arp Leu 


Pro 




50 










55 








SO 




Asp Tyr 


L'*'S 


ly 


\rg 


Val 


Ser 


Tyr 


Asp 


Ser 


Phe 


Pro 


He Ser Lys 


Glu 


65 










70 










75 






80 


Asp Arg 


Thr 


Thr 


Ala 


Leu 


Ser 


Ala 


Asp 


Ser 


Val 


Ala 


Gly Arg Arg 


Phe 










85 










90 






95 




Phe 


Tyr 


L-u 


Ala 


Ala 


Asp 


He 


Gly 


Pro 


Val 


Ala 


Ser 


Phe Ser Arg 


Ser 








100 










105 








110 




Ae^p Thr 


Lsu 


Thr 


Ala 


Arg 


Val 


Glu 


Glu 


Val 


Ala 


Val 


Asp Gly Arg 


Pro 






115 










J 20 










125 




Leu 


Pro 


Leu 


Lys 


Glu 


Leu 


Ser 


Pro 


Ala 


Ser 


Arg 


Arg 


Leu Tyr Arg 


Gly 




130 










135 










140 






Tyr 


Glu 


Ala 


Leu 


Phe 


Val 


Pro 


Gly 


Ajsp 


Gly 


Gly 


Ssr Arg Asn Tyr Arg 


145 










150 










155 






160 


He 


Pro 


Ala 


He 


Leu 


Lys 


The 


Ala 


Asn 


Gly 


Thr 


Leu 


He Ala Met 


Ala 










16S 










no 






175 




J\sp Axg 


Arg 


Lys 


Tyr 


Aen 


Gin 


Thr 


Asp 


Leu 


Pro 


Glu 


Asp He Asp 


He 








lOO 










185 








190 




Val 


list 


Arg 


Arg 


Ser 


Thr 


Asp 


Gly 


Giy 


Lys 


Ser 


Trp 


Ser Asp Pro 


Arg 






195 










200 










205 




He 


He 


Val 


Gin 


Gly 


Glu 


Gly 


Arg 


Asrn 


ULs 


Gly 


Phe 


Gly Asp Val 


Ala 




210 










215 










220 






Leu 


Val 


Gin 


The 


Gin 


Ala 


Gly 


Lys 


L&u 


Leu 


Met 


He 


Phe Val Cly Gly 


225 










230 










235 






240 


Val 


Gly 


Leu 


Trp 


Gin 


Ser 


Thr 


Pro 


Asp 


Arg 


Pro 


Gin 


Arg Thr Tyr 


He 










245 










250 






255 




Ser 


Glu 


Ser 


Arq 


Asp 


Glu 


Gly 


Leu 


Thr 


Trp 


Scr 


Pro 


Pro Arg Asp 


He 








260 










265 








270 




Thr 


His 


Phe 


He 


Ph« 


Gly 


Lya 


Asp 


Cys 


Ala 


Asp 


Pro 


Gly Arg Ser 


Ary 






275 










280 










285 




Trp 


Leu 


Ala 


Ser 


Phe 


Cys 


Ala 


Ser 


Gly 


Gin 


Gly 


L*u 


Val Leu Pro 


Ser 




290 










295 








300 






Giy 


Arg 


He 


Thr 


Fh^ 


Vul 


Ala 


Ala 


He 


Arg 


Glu 


Ser 


Gly Gin Glu 


Tyr 


305 










310 










315 




320 


Val 


Leu 


Asn 


Asn 


Tyr 


Val 


Leu 


Tyr 


Ser 


Asp 


Asp 


Glu Gly Asp Thr 


Trp 










325 










330 






335 




Gin 


Lou 


Ser 


Asp 


Cys 


Ala 


Tyr 


Arg 


Arg 


Gly 


Asp 


Glu 


TVlo Lys T.su 


Ser 








340 










345 








350 




Leu 


net 


Pro 


Asp 


Gly 


Arg 


Val 


Leu 


t-tet 


Ser 


He 


Arg 


Asn Gin Gly 


Ary 






355 










360 










365 




■31 n 


Glu 


Scr 


Arg 


Gin 


Arg 


Phe 


i?he 


Ala 


Leu 


Ser 


Ser 


Asp Asp Gly 


Leu 




31 0 










375 










380 




Thr 


Trp 


Glu 


Arg 


Ala 


Lys 


Gin 


Phe 


Glu 


Gly 


He 


His 


Asp Pro Gly 


C^'s 


365 










390 










395 








Aarn 


Gly 


Ala 


Met 


Leu 


Gin 


Val 


hys 


Arg 


Asn 


Gly 


Arg 


Asp Gin Val 


Leu 










4 05 










410 






415 




His 


Sec 


Lou 


Pro 


Leu 


Gly 


Pro 


Asp 


Gly 


Arg 


Arg 


Asp 


Gly Ala Val 


Tyr 
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425 

Leu Phe Asp His Val Ser Gly Arg Trp Ser Ala Pro Val Val Val A.^n 

440 445 
Ser Gly Ser Ser Ala Tyr Ser Asp Met Thr Leu Leu Al^ A^p Gly Thr 
455 

lie Gly Tyr Phe Val Glu Glu Gly fi^p Glu 11- Ser Leu Val Phe He 

Arg Phe Val Leu A-p Asp Leu Phe Asp Val Arg Gin 
485 4 

(2) INt'ORJIATION FOR SZQ ID NO:45i 

(1) SEQUENCE CHARACTFRISTICS; 

(A) LENGTH: 245 amino acids 

(B) TYPE: amino acid 
(n) TOPOLOGY: linear 



PCT/AU98/01023 



(ii) MOLECULE TYPE: protein 
20 (iil) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAIJISI I: Porphyromonar gi ngi 1 1 s 
25 (Ix) FEATURE: 

(A> MAI IE/KEY: miffc_featur- 
(SI LOCATIW 1 . . ,24 5 



(Xl) SEQUENCE DESCRIPTION: SEQ ID NO: 451 

Met Lys Lys Glu Lys Leu Trp He Ala He Val Ala Gly Leu Ala Phe 

Val Leu Gly Leu T^-r Ala Leu Gly Arg Ser Val Ala Gin Leu Irg Arg 

35 Ser Gin Pro Ser Val Thr Val Thr lly Met Ala Glu Arg Asn Phe Lys 

ser Asp Leu He Val Trp Thr aL Ser Tyr Gin Leu gL Met Met Asp 
55 60 
40 sT ""^^ ^'"'^ ""^^ ^^"^ ""^^ "-^^ "-^'^^ Leu Val Ala 

A^p Tyr Leu Lys Asn Lys Gin Leu Pro Asp Ser Ser Tyr He Phe S-r 

®5 90 qt 

Ser Val Ala He Ser Lys Glu Tyr Asn Tyr Tyr Tyr Asp Pro Arg Gin 

45 Glu Gin A^n Val Arg Thr Phe Ala Gl^y Tyr Leu Leu Ser cln Thr Val 

Thr Val Thr Ser Gin Asp He Glu Hir Val Glu Lys He Ser Arg Asp 

Tie Thr Glu Leu He Asn Gin Gly Val Glu He Thr Ser A.p Arg Pro 



Ala T-yr Tyr Tyr Thr Lys Leu Asn Asp Leu Lys Val Glu Met Leu Arg 

A.^n Ala Ser Glu Asp Ala Phe Asn Arg Ala Ser Val He Ala gIu Gly 
Ser 

195 " " 200 205 



Oi^ Ser Gly Ser Ser Val Gly Ly., Met Leu Ser Ser Ser Met Gly Val Phe 



Gin He Val Gly Leu Asn Ser Asn Glu Asp Tyr Ser Trp Gly Gly Ser 

215 220 
Phe fe,. Thr set ser J.ys Het Lys Thr Ala S,r He Thr V.l Ly. Ala 

230 

Ser Phe Ala Leu Lys 2**° 
245 

(2) IIJFORMAT^O^^ for SEQ id N0:452 

ti> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 amino acids 
[31 TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii> MOLECULE TYPE: protein 
(ill) HYPOTHF.TTCAL: YES 
75 (vi) ORIGINAL SOURCE: 
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iA) ORGAWISH: Porphyronwiias glnqivaXis 

dr.) FEATURE: 

lAl MAI1E/KEY; nd^c fcaxiure 
(B) LOCATION 1 . . -?7 6 



SEQUEflCE DESCRIPTIOH: 3EQ ID HO:4&2 



(let 


Lys 


hya 


Thr 


lie 


Ala 


He 


He 


Ala 


Ser 


Ala 


Leu 


Leu 


Ala 


Leu 


Gly 


1 








5 










10 










15 




Ala 


Val 


Gly 


Cys 


Lys 


Lys 


Asn 


Ala 


Asp 


Thr 


Thr 


Ala 


Val 


Ser 


Gl ti 


Lyf? 








20 










2b 










30 






A^p 


Sec 


He 


Ala 


Leu 


Sec 


Met 


Gly 


He 


Leu 


Tyr 


Gly Gin Asp 


Phe 


Ala 






35 










40 










45 








Asn 


Gin 


Ph* 


Glu 


tiet 


3er 


Arg 


Leu 


Gill 


Gly 


Gin 


Pro 


He 


Asp 


ser 


Val 




SO 










55 










60 










Ala 


Phfc 


Leu 


Ai-p 


Gly 


Phe 


Ly3 


Tyr 


Glv 


He 


Asp 


Thr 


Thr 


Arq 


Phe 


Ser 












70 










76 










80 


Tyr 


Am 


Leu 


Gly 


Ala 


He 


Tyr 


Ala 


Ser 


Asn 


He 


Ala 


Arg 


Gin 


Leu 


Ala 










85 










90 










95 




His 


Affp 


Ssr 


He 


Asp 


lie 


Arp 


Lys 


Phe 


Tyr 


A3 a 


Ala 


Met 


Arg 


Ala 


Ala 








100 










105 










J 10 






Leu 


Leu 


Ly.- 


Asp 


Thr 


Val 


Ser 


He 


Ala 


Het 


Lys 


Pro 


Ala 


Asp 


Als 


Gin 






115 










120 








125 






yMa 


Phc 


Met 


Gin 


Arg 


He 


Gin 


Ala 


Lys 


Lys 


Gin 


Arg 


Glu 


Asn 


Asn 


Met 




130 










135 










140 










Ly5 


Gin 


Phe 


31y 


Gin 


Asfi 


He 


Glu 


r.y.< 


Gly Am 


Glu 


Tyr 


He 


Asp 


Thr 


145 










150 










155 










160 


Ph« 


Lyc 


Lyc 


Glu 


ASD 


Gly 


Val 


Thr 


Val 


Thr 


Thr 


Thr 


Gly 


Leu 


Ala 


Tyr 










165 










170 










17i> 




Lys 


Thr 


Leu 


Gin 


Glu 


Gly 


Thr 


Gly 


Ala 


Thr 


Pro 


Ser 


Leu 


Ala 


Asp 


Thr 








100 










185 










190 






Val 


Arg 


Val 




Tyr 


Val 


Gly 


Thr 


Leu 


Val 


Asp Gly Lys 


Glu 


Phe Asp 






195 










2 00 










205 








Lys 


Asn 


Glu 


Glu 


Gly 


He 


Glu 


Phe 


Ala 


Val 


Thr 


Gly 


Val 


He 


Lys 


Gly 




21 0 










215 










220 










Trp 


Thr 


Glu 


I let 


Leu 


Gin 


Leu 


Met 


Lys 


Val 


Gly 


Gin 


Lys 


Val 


Arg 


Val 












230 










235 










240 


Val 


11^ 


Pro 


Gin 


Glu 


Leu 


Ala 


Tyr 


Gly 


Glu 


Thr 


Gly 


Asn 


Tyr 


Thr 


He 










245 










250 










255 




Glu 


Pro 


Phe 


Ser 


Thr 


Leu 


Thr 


Phe 


Glu 


Het 


Glu 


Leu 


He 


Gly 


He 


Lys 








260 










265 










270 






Pro 


Gly 


Lys 


Lys 






























27 5 





























(2) IW FORMATION FOR SEQ ID 140:453 

(i) SEQUEIICE OiARACTERISTICS: 

(A) LEIIGTU: 77 5 amino acids 

(B) TS'PR; amino acid 
(D) TOPOLOGY: linear 

(11) HOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

tvij ORIGINAL SOURCE: 

(A) ORGAIJXSH: Porphyromonas gingival is 

tlx) TEATURE; 

(A) »AME/KEi: miscteature 
<B) LOCATIOt^ I . , .775 



(Mil SEQLiEMCE DESCRIPTION: SEQ TO NO: 4 53 



net 


Lys 


Val 


Leu 


Arg 


Gin 


Val Phe 


Leu 


Pro 


He 


Leu 


Phe 


Val 


Leu 


Leu 


1 








5 








10 










15 




Thr 


Gly 


Ala 


Cys 


Ser 


Thr 


Thx Lys 


Asn 


Leu 


Pro 


Glu 


Gly 


Glu 


Gin 


Leu 








20 








25 










30 






Tyr 


He 


Gly 


Jlet 


Gly 


Lys 


Thr Gin 


He 


Leu 


Arg 


Gin 


Asp 


Lys 


Ser 


HI.- 






35 








40 










45 








Ala 


Gly 


Gin 


Gin 


Ala 


Lou 


Thr Glu 


Val 


Glu 


Ser 


Thr 


Leu 


Lys 


Val 


Thr 




50 










55 








60 










Pro 


A.*n 


Gly 


Ala 


He 


Phe 


Gly Ser 


Ala 


Ser 


Ala 


Ser 


Leu 


Pro 


Lys 


He 



65 70 75 80 



SUBSmUTE SHEET (Rule 26) (RO/AU) 



j:^ o Oi iLji ra ic: icii hus n™? 



wo 99/29870 PCT/AU98/01 023 

393/490 

Pro Phe Gly Leu Trp Lf^u Tyr Asn Ser Fhe Vai Giy Asp Ser The V«i 

85 90 05 

He i>er L'/5 Trp lie Fhe Asp Lys Phe Ala Ala Lyr Pro Val Phe H- 

100 105 HQ 

Ser Gin Val t.y.n Ser Asp Ser Arg Ala Lys Val Ala Thr Asn He L-u 

11£ 120 125 

Arg Glu His Gly T/r Phe Asp Ala LyE Val Lys S-r Sc-r Val Thr Thr 

130 135 no 

Leu Lys Lys Asp Ser Leu Lya Ala Lys He Ser Tyr Thr Val Asp H-t 
^'^^ 150 155 i6n 

/Ua Ser Pro Tyr His Tyr Asp Ser He He Pro Lsu Pro Tie Ser Thr 

165 170 175 

Phe Pro Asp Ser Ho Leu Ala Tyr Arg Gin Thr Pro Ser Leu He Aru 

leo 185 190 

Lys Gly Asp Gin Phe Asn Leu Ala Lya Leu His Glu Glu Arg Gin Thr 

195 200 205 

lis Ssr Ala Leu Leu Arg Asp Asn Gly Tyr Tyr Tyr Phe Arg Pro Gin 

210 215 220 

Asp He He Tyr Glu Aia Aep Thr Leu Leu Val Aro Gly Ala Val C'-« 
225 230 235 ' 2^0 

Leu Arg Al?. Lys Leu Ser Glu Asp Thr Pro Pro Gin Ala Met Arg Pre 

245 250 255 

Trp Arg He Gly Lys Arg Thr Ala Val Leu Leu Gly Met Asn Glv Glu 

2&0 265 270 

Ssr Pro Thr Asp Ser Leu Glu Val Glu Asp Met Lys Val L-u T'-r T-r 

275 280 28b ' ' 

Arg Lys I let Pro Val Ary Pro Lys Tie Lsu Ala Lys Arq Ph- Aro Ph- 
290 295 300 ' v - 

Phe Scr Gly A.-n Leu Tyr Arg Gin Lys Asp Asp Glu Thr Thr Arg Lv- 
33 0 315 ^ir, 

Ser Leu Ala Arg Leu Gly Ala Phe Ser Val He Asp Leu Asn Phe L-u 

^2b 330 335 

Gin Arg Asp Ser He Ser Gly Leu Leu Asp Val Arg Leu Leu Thr Thr 

340 345 350 

Leu Asp Lys Pro Trp Atp Ala Ser Leu Glu Thr Leu Phe Thr S-r L"^ 

355 360 365 - 

Ser Asn Asp Fhe He Gly Pro Gly Leu Asn Phe Ala Leu Ala Arq Aro 

370 375 390 

Asn Val Phe Gly Gly Gly Glu Asn Leu Ser Trp Asn He Glv Glv Ser 
385 390 395 400 

Tyr lalu Trp Glu Thr Gly Asn Arg Pro Glu Asn Ser Ser Asn Arg Leu 

405 410 415 

He Asp He A&n Ser Tyr Acn Met Asn Thr Ala Val Asn Leu Ser Ph» 

^20 425 430 

Pro Scr Tie Val Phe Pro Gly Leu Leu Asp Lys Tyr Tyr Tyr Tyr Pro 

435 440 445 

Thr Thr Thr Thr Phe Gin Ala Scr Ala Thr Ala Leu Asn Arg Ala Hi' 

450 455 
Tyr Phe S«r Itet Tyr Ser Fhe Gly Phe S^r Thr Thr Tyr Glu ?h- Gin 
470 475 ,.3^ 

Pro Ser Lys Glu His Arg HI a Ala He Phe Pro Leu Lys Leu Asn Tyr 

485 490 495 

Asn Leu Leu Gly His Gin Thr Glu Thr Phe Gin Ala He Thr Ala A^n 

n n . 510 

Afin Pro Pro Leu Leu Leu Ser Leu Gin Ser Gin PJie Leu Ala Gin M-t 

515 520 52b 

Gly Tyr He Tyr Thr Phe Asn Lys Scr Val Ser Glu Lys Ser Pro His 

530 535 
His Leu Trp Mot Gin Phe Gly Leu Ser Glu Ala Gly Asn L«u Leu Acn 

555 sen 
Leu He Tyr Leu Ala Ala Gly Lys Lys Tyr Ser Asp Thr Lys Asn Phe 

565 570 57:^ 

Val Gly Val Pro Ph= Ser Gin Phe lie Lys Ala Thr Gly Glu Leu Arq 

5flO 585 5^0 

Tyr Ser Tyr Thr lie Asp Arg Asn Gin S=r Leu Ala Thr Arg Phe Glv 

595 600 605 

Thr Gly Val He Tyr Ser Tyr Gly Asn Met Arg Val Ala Pro Tyr S"r 

610 615 6-^0 

Glu Gin Phe Tyr Val Gly Gly Ala Asn Ser He A^g Ala Phe Thr Val 
630 fi35 g^,-^ 

Arg Ser He Gly Pro Gly Arg Phe Asn Pro Asp Set hup Asn Gin T"r 

^'^^ 650 655 

Ser Tyr Leu A^p Gin Val Gly Glu Phe Lys Leu Glu Ala A^-n Val Glu 

665 eVO 
Tyi- Arg Gly :.ys Leu Phe Gly Asp Leu His Ala Ala Val Phe Leu Asp 
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67 5 






680 










60S 










Ala 


Gly 


Asn Val Trp 


Leu 


Leu 


Arg 


Glu 


Asp 


Ser 


Ser 


Arg 


Pro 


C\y 


Gly 






690 






69S 










7 00 












Ala 


L&u 


Sfer Glu Val 


Gly 


Sec 


Val 


Ser 


A5n 


Fhfe 


Leu 


Asn 


Ser 


He 


Ala 


5 


70& 






710 










715 










720 




Leu 


Cly 


Thr Gly Val 


Gly 


Leu 


Arg 


Tyr 


Asp 


Leu 


Ala 


Phe 


Leu 


Val 


Val 








725 










7 30 










735 






Arg 


Val 


Asp Val Gly 


Phe 


Gly 


Leu 


His 


Leu 


Pro 


Tyr 


Asn 


Thr 


Gly 


Lys 








740 








745 










7 50 






10 


Lys 


Gly 


Tyr Tyr Asn 


lie 


Pro 


Arg 


Phe 


Lys 


Asp 


Ala 


He 


Gly 


?hc 


His 








755 






760 










765 










Leu 


Als 


Val Gly Tyr 


Pro 


Phe 
























770 






775 




















15 




I U FORMATION FOR 


SEQ 


ID tlO: 15 I 




















(il 


SEQUEMCE CHARACTERISTICS: 






















<A) LENGTH: 77 4 amino 


acids 














20 






(Bl TYPE: 


amino acid 






















(D! TOPOt^y: 


linear 






















(III 


MOLECULE T' 


l'PE: 


protein 






















(iltl 


(lYPOTHRTICAL; YE^ 




















25 


































( vl } 


ORIGIUAL SOURCE: 


























(A) ORGAhllStl: 


Pcrphyroinot^as gingivalii 










30 




(ix) 


FEATURE : 




























(A) NATfE/KEY: 


miEC feature 




















IB) LOCATION 1, . .774 






















(xi 1 


3E0UENCE DESCRIPTION: SEQ ID IJO:JS4 










35 


Met 


Ser 


Ser His Ser 


Val 


Arg 


Tyr 


Leu 


Il« 


Gly 


He 


Ala 


Gly 


Cyff 


Leu 




1 




5 










10 








15 






Leu 


Leu 


Met Leu Ala 


Ser 


Ser 


Cys 


Ser 


Val 


Thr 


Arg 


Tyr 


Val 


Pro 


Asp 








20 








25 
















40 


Gly 


Ser 


Arq Leu Leu 


Asp 


Arg 


Val 


Thr 


He 


Ala 


Ser 


Glu 


Thr 


Gly 


Ser 






35 






40 










45 










lis 


Ala 


Leu Pro Glu 


Asp 


He 


Arg 


Asp 


Tyr 


Thr 


Leu 


Gin 


Gin 


Pro 


Asn 






50 






55 








60 












Tyr 


Arg 


Leu Phe Gly 


net 


Thr 


Arg 


Trp 


Leu 


Leu 


Arg 


Val 


Tyr 


Ser 


Ser 


45 


65 






70 










75 










80 


Ser 


Asn 


Pro Asn Ser 


Asn 


Ser 


Trp 


Trp 


Asn 


Arg 


Ser 


Leu 


Arg 


Lys 


Met 








85 










90 
















Gly 


Glu 


Pro Pro Val 


Leu 


He 


Asp 


Ser 


Val 


hen 


Thr 




Arg 


Thr 


Ala 








100 






105 








lie 






50 


As 11 


Ary 


Leu Ala Lys 


Ala 


Met 


Ala 


Gly 


Asp 


Gly 


Phe 


Leu 


Asp 


Ala 


Thr 






115 






120 










125 










Ala 


Ara 


Ala Val Val 


Asp 


Thr 


Gly 


Lou 


Tyr 


Lys 


Lys 


Ala 


Arg 


He 


Thr 






130 






13b 






140 












Tyr 


Leu 


He Gin Pro 


Gly 


Ser 


Axg 


Tyr 


Tyr 


He 


Arg 


Asrn 


Met 


Ala 


Leu 


55 


145 






150 










155 










160 


A5P 


Val 


Lys Asn Pro 


Leu 


Leu 


Pro 


Pro 


Val 


Ala 


Leu 


Gly 


Asn 


Ser 


Leu 








X6b 










170 










175 






Pro 


Ser 


Ala Tyr Lys 


Val 


Gly 


He 


Ser 


Glu 


Gly 


Ser 


Pro 


Leu 


Ser 


Pro 








IBO 








105 










1?0 






60 


lie 


Val 


Leu Asp Glu 


Glu 


Arg 


Lys 


Ala 


He 


Ala 


Arg 


His 


!Iet 


Arg 


Asn 






195 






200 










205 










Asn 


Gly 


Phft Trp Lys 


Phe 


Ser 


Ala 


Glu 


Asp 


Val 


Tyr 


Tyr 


Glu 


Ala 


Asp 






210 






215 










220 












TUr 


Thr 


Val Ser Gly 


Gly 


Scr 


Gly 


Thr 


Lys 


Ser 


Ala 


Asp 


Leu 


Lys 


Leu 


65 








230 










235 










240 


Val 


Val 


Asn Gly He 


Gly 


Arg 


Tyr 


Pro 


Tyr 


Arg 


lie 


Gly 


fjrg 


Val 


Phe 








245 










2 50 










255 






The 


tits 


Ala Asp Tyr 


Asp 


Pro 


Leu. 


Glu 


Ser 


Aep 


Phe 


Arg 


Val 


Gin 


Glu 








260 








265 










2"0 






70 


L^u 


Pro 


Arg Tie Asp 


Ser 


He 


Ser 


Acq 


Gly 


Asp 


Tyr- Thr 


Val 


Tyr 


Tyr 






275 






280 








205 










Gly 


Ser 


Arg Gly Ary 


Tyr 


He 


Arg 


A] =% 


Sec 


Ala 


Leu 


Thr 


Arg 


Ser 


Val 






290 






295 










300 












Ser 


Val 


Thr Pro Gly 


Ala 


Phe 


Phe 


Cys 


Glu 


Asp 


Asp 


Val 


GlJ 


Arg 


Ser 


75 


305 






310 










315 










320 


'I'yr 


He 


Lys Leu Arn 


Ala 


Leu 


Pro 


lie 


Val 


Arg 


Asn 


Val 




He 


Arg 
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325 










330 






phe 


Val 


Giu 


His 


Asn 


Gly 


Lys 


Acp 


Glu 


He 


Ala 


Leu 








340 










345 










Leu 


Val 


Asp Cys 


Tyr 


He 


Leu 


Thr 


Val 


Pro 


Ala 






3S5 










360 










Phe 


G ■ u 


Ala 


Glu 


Val 


Leu 


Gly 


Thr 


Af n 


Ser 


Ala 


Gly 




37 0 










375 










300 


Al a 


Le'j 


Ssr 


I. en 


Gly 


Phe 


Thr 


ASD 




a = n 


L**iJ 


Phe 


385 










390 










395 




1-1% t 




Asn 


lie 


Ly5 


Leu 


Lyf 


Gly 


Ala 


Tyr 


Glu 


Ma 










405 










410 








His 


Ssr 


Phe 


Met 


Glu 


Tvr 




Va 1 


Glu 


Ser 


Ser 








420 










425 








Arg 


L-u 




Phe 


Pro 


Phe 


He 


Ser 


Asp 


Gl u 


Thr 


Ar 






4 J 5 










r^^ 










A] a 


r 


Thr 




Trp 


Lys 


H* 


Gly 


ft 

lyr 




I yr 


Gin 




460 










4 55 










4 SO 


Phe 


Asp 


Arg 


Val 


He 


Leu 


Ser 


Ala 


in 


L"U 


Asn 




4 05 








7 










475 




T;r 


Li 11 


Ills 


Asn Arg 




Arg 


His 


Thr 


He 


Arg 


Leu 








485 










4 90 






Tyr 


Leu 


Hi 5 


Leu 


Pro 


Tyr 


He 


Asp 


Pro 


Asp 


Phe 










500 










50£ 








Pro 


Thr 


Thr 


Ala 


Leu 


Tyr 




Tyr 


Thr 


Glu 


Gin 


Phe 






51 5 










520 










Ai-n 


Tyr 


He 


L^u 


Asn 


Tyc 


Thr 


Thr 


Al ^ 


Ser 


Ser 


1 le^ 














535 










54 0 




Asn 


Pro 


Phe 


Thr 


Ala 


Arg 


e 


er 


He 


Gin 


Thr 


54 5 










550 










555 




Leu 


Gin 


Ala 


life 


Ser 


Tyr 


Leu 


Tnr 


A.Tp 


Ser 


Pro 


L"s 










565 










570 






L-U 


Tyr 


Lys 


Met 


Phe 


Gly 


Leu 


s 


iyr 


Ala 


Gin 


Phe 








580 










505 








Leu 




L-u 


Ala 


Lys 




Val 


Leu 


Leu 


Glu 


Lys 


n^p 




595 










600 










Ij&u 


nl£ 




Gly 


Phe 




Leu 


Ala 


Phe 


Pro 


Tyr 


Glv 




610 










615 










620 


lis 




Fhe 


Glu 


Leu 


Arg 


Tyr 


Phe 


AIa 


Gly Gly 


Ser 


625 










530 










635 




Gly 


Trp 


Ser 


Val 


Arq 


Thr 


Leu 


Gly 


Pro 


Gly 


Ser 


Met 










64 5 










650 






Asp 


Lyff 


Thi 


Phe 


Phe 


Asp 


Gin 


Met 


Gly 


A^p 


He 


Arq 








660 










665 








Val 


Glu 


•Vyc 


Arg 


Thr 


Lys 


Leu 


Phe 


Trp 


Lys 


Phe 


Arg 






61S 










680 










Val 




Ala 


Gly 


Asn 


Val 


Trp 


Thr 


He 


Lys 


Giu 


Tyr 




690 










6 95 










700 


Asp 


Gly 


Leu 


Phe 


Arg 


Phe 


Asp 


Arg 


Phe 


Tyr 


Lys 


Glu 


■705 










710 










715 




Tyr 


Gly 


LevJ 


Gly 


Leu 


Arg 


I.su 


Asp 


PUe 


Asp 


Tyr 


Ph« 










725 










730 






Affp 


Ala 


Gly 


Leu 


Lys 


Ala 


Tyr 


Asp 


Pro 


Gin 


Gin 


Thr 








740 










745 








Tup 


Ala 


He 


Thr 


Arrj 


Pro 


Asn 


Leu 


Ser 


Ser 


Asn 


Fhe 




755 










760 










Ala 


Val 


Gly 


Tyr 


Pro 


Phe 
















770 























(2) lUrORflATiai for SEQ id »0:455 

ti) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 867 amino acids 

(B) TYPE: amino acid 
(Ul TOPOLOGY: linear 

(ii) UOLECtn^F. TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGIMAl. SOURCE: 

(A) ORGAMISH: Pcrphyromona* gingival! 

(i::) FEATURE; 

CA) NAi^E/KEY: misc feature 

(B) LOCATION 1. . . 067 



33 ''s^" r.::ii «o jl. c:: 43 iOj iUf ii^n %3 
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490 







335 




Al a 


Asp 


Ser 


Ser 




350 






Lys 


Ser 


Lys 


Ser 


365 








Asp 


Phe 


Gly Ala 


Ary 


Gly 


Ala 


Glu 








4 00 


Ho 


Arg 


Lys 


Glv 






415 




Leu 


Arg 


Phe 


Pro 




430 






Arg 


Arg 


Leu 


Arq 


4 4 5 








Thr 




Pro 


Glu 


Ser 


Trp 


Gin 


Thr 








480 


eu 


sp 


Val 


Asp 






4 95 






S«»r 


Leu 


Pro 




SIC 






He 


T.CU 


Gly 


S-r 


525 












Thr 


Val 


Ala 


Gly 


Asn 


Leu 








560 


c^p 




His 


Gly 






575 




Val 


Lys 


Leu 


Asp 




590 






Asn 


Thr 


Leu 


Ala 


605 










Ala 


Arg 


His 


Asn 


Ser 


Val 


Arg 








64 0 


Lys 


HeL 


Thr 


Pro 






655 




Leu 


Asp 


Leu 


Asn 




670 






Ala 


Ala 


Ala 


Phe 


685 








Glu 


Asn 


Gl n 


Glu 


He 


Ala 


Leu 


Ala 








720 


Leu 


Val 


Arg 


Leu 






735 




Gly 


Arg 


Tyr 


Lys 




750 






Ala 


Trp 


His 


He 


765 
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{>:il SCQUEHCE DESCRI PTICII ; 



net 


Arg 




Arg 


Tlfi 


Leu 


Cln 


Leu 


1 








5 








LiU 


'Jly 


S'er 


Ser 


Leu 


Ala 


He 


Ala 








20 










lie 


Arp 


Ser 


Glu 


Thr 


Ser 


Glu 


Pro 






35 










4 0 


Gly- 


'iln 


Gly 


Ala 


Ser 


Leu 


Arg 


Gly 




50 










55 




phs 


Arg 


Phe 


Gl u 


Val 


Pro 


Ala 


Lys 


65 










70 






Gly 


Tyr 


Ala 


Thr 


Val 


Tlir 


Arg 


Sor 










85 








Lgu 


GVy 


Thr 


He 


Leu 


Leu 


Asp 


Pro 








100 










Gin 




Il« 


Ala 


Ser 


Val 


Val 


Pro 






115 










120 


Val 




A5n 


He 


Arg 


Val 


Ala 


Asp 




130 










135 




Glu 




Pro 


Glu 


Leu 


Vai 


Lys 


Ser 


145 










ISO 






01 y 


.Sf=rr 




Gly 


Phe 


Gly 


Asp 


Giy 










1 f>.S 








Tht 


Tyr 


Asn 


Phe 


Gly 


Val 


Leu 


He 








lao 










Gl u 




Gly 


Lyf 


Val 


Tyr 


Trp 


Ser 






195 










200 


Ala 


S$r 


Thr 


He 


Gin 


He 


Gin 


Arg 




210 










215 




He 


Ser 


Ser 


Val 


Gly 


Gly 


Thr 


Met 


22S 










230 






Ala 


Asn 


Thr 


Gly 


Gly 


Ser 


Ala 


Tyr 










245 








His 


Lyc 


Glu 


Ser 


Phe 


Ser 


He 


Ser 








260 










He 


i'hx 


lie 


Ala 


Gly 


Ser 


His 


Mot 






275 










280 


Leu 


Lys 


Gly 


A.rg 


Ala 


Phe 


Sec 


Tyr 














295 




Arn 


Glu 


Aru 


His 


Thr 


Leu 


Ser 


Leu 


305 










310 






Hi.? 


Am 


Gin 


Arg 


Ser 


Ser 


Lys 


Tyr 










325 








Giy 


ii« 


Arg 


His 


Asn 


Gl n 


Ser 


Phe 








340 










Pco 


Thr 


Ala 


Tyc 


Ala 


Tyr 


Asn 


Thr 






355 










360 


Asn 


Hit- 


Phe 


Trp 


l.ys 


Met: 


Asp 


Glu 




370 










375 




Tyr 


Ala 


Ser 


Leu 


Ala 


Thr 


Gly 


Gly 


385 










390 






Ser 


Lys 


Trp 


Val 


Leu 


He 


Asn 


Tyr 










405 








Thr 


Lys 


Val 


Thr 


Pro 


Asp 


Gly 


Leu 








420 










Ala 


Asn 


Ala 


Al a 


Ala 


Ser 


Asn 


Gly 






435 










440 


Ser 


Asn 




His 


Lys 


Trp 


Phe 


31y 




450 










455 




Leu 


Asn 




Sec 


Leu 


Thr 


Leu 


Thr 


4 65 










470 






Arg 


Gly 


A?p 


fli5 


Tyr 


Asp 


Lys 


He 










495 








Tyr 


He 


Glu 


Asp 


Pro 


Lys 


Thr 


Lys 








500 










r*ln 


Leu 


Lys 


Val 


Gly 


Asp 


He 


Val 






515 










520 




-CP 


His 


Gly 


Leu 


Phe 


Ala 


Gin 




sac- 








535 




A^p 


Ala 


Phe 


Val 


Ser 


Gly 


Ser 


He 












550 






Asn 


Tyr 


lily 


Gly 


Ser 


Lys 


ser 


Thr 



56b 



ID MO: 455 



Phe 


Leu 


Thr 


Ala 


Leu 


Leu 


Leu 


Ala 




10 










15 




Gin 


Thi' 


Val 


Val 


Thr 


Gly 


Ly." 


Val 


25 










30 






Leu 


He 


Gly 


Val 


Set 


Vai 


Sfer 


T!ir 










45 








Val 


Thr 


Thr 


Asp 


Met 


Asp 


Gly 


Glv 








60 










Ser 


Val 


Leu 


Thr 


Phe Arg 


Cys 


Val 






75 










BO 


He 


Gly 


Arg 


Gly 


Ser 


Gin 


Glu 


Asp 




90 










95 




Gin 


Ala 


He 


Gly 


Leu 


Asp 


Glu 


He 


105 










110 






Lys 


Asp 


Arg 


Met 


Thr 
125 


Pro 


Vai 


PrQ 


He 


Gin 


Ala 


Ala 


Ser 


Leu 


Acn 


Val 








140 










Thr 


Pro 


Ser 


Thr 


Tyr 


Thr 


Thr 


Lys 






155 










160 


Arg 


Thr 


Asn 


Val 


Arg 


Gly 


Fho 


Asp 




170 










175 




Asn 


Gly 


Val 


Pro 


Val 


Asn 


Gly 


Met 


185 










190 






Asn 


Trp 


Ser 


Gly 


Leu 


Met 


Asn 


Gin 










205 








Gly 


Leu 


Gly 


Ala 


Ser 


Lys 


Leu 


Gly 








220 










Asn 


He 


He 


Thr 


Lys 


Thr 


Thr 


Asp 






235 










240 


Val 


Gly list 


Gly 


Asn Asp 


Gly 


Leu 




250 










255 




Thr 


Gly 


Met 


Asn 


Asp 


Gly Trp 


Ala 


265 










270 






Thr 


Gly 


Leu 


Gly 


Tyr Vai 


Lys 


Gly 










285 








Phe 


Phe 


Asn. 


Val 


Ser 


Lys 


Lys 


Phe 








300 










Thr 


Gly 


Phe 


Gly 


Ala 


Pro 


Gin 


Trp 






315 










320 


Ser 


Val 


Alo 


Asp 


Tyr Asp Lys 


Tyr 




330 










335 




Gly 


Tyr 


Leu 


Arg 


Gly 


Glu 


Leu 


Thr 


345 










350 






Tyr 


His 


Lys 


Pro 


Gin 


Phe 


Ser 


Leu 








365 








Asn 


Thr 


Ser 


Leu 


Tyr 


Thr 


Ala 


Un); 








3G0 










Gly 


Arg Arg 


Ala 


Tyr 


Gly Lys 


Asn 






395 










4 00 


Asn 


Thr 


Gly Gin 


Pre 


Tyr 


Glu 


Gin 




410 










415 




He 


Asp 


Tyr 


A:?p 


Ala 


Val 


Leu 


Ala 


125 










430 






Ser 


Glu 


Ala 


He 


Phe 


Ala 


Leu 


Gly 










445 








Leu 


Leu 


Ser 


Ser 


Phe 


Lys 


Lys 


Lys 








460 










Ala 


(ily 


Tyr 


Asp 


Gly 


Arg 


Tyr 


Tyr 






475 










4 80 


Thr 


Asp 


Leu 


Leu 


Gly Gly 


Ser 


Tyr 




4 90 










4 95 




Leu 


Ala 


Tyr 


His 


Ala 


Glu 


Gly 


Gin 


505 










510 






Asn 


Arg 


Asp 


Tyr 


Thr 


Gly Glu 


He 










.325 








Met 


Glu 


His 


Ser 


Ser 


Glu 


Trp 


Ho 








540 










Asn 


Tyr 


Glu 


Leu 


Tyr Arg Asn 


Hi^ 






555 










560 


Cly 


Ti*r 


Leu 


Pro 


Gly Val 


Ser 


Pro 




570 










575 
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Trp Ly. Ser Fhe Leu Pro Trp s.r Gly Lys Ala Gly L.u S.. Tyr Lys 

Phe Ala Gin Gly His Asn V.l p.e A^f A.n Gly Gly Phe Phe The Arc, 

600 go= ^ 

Ala Pro Leu Phe G,y Asn II, Tyr Al, Ala Giy Al. He lie P„ ^„ 



A^P LV3 Ala Asn Het Glu lys Val Leu Thr Gly «u Val Gly Tyr Gly 



Phe Thr Am Hi., Ly« '^'r, Phe Glu Ph, Asn II^ A«„ Gly Tyr Tyr Thr 



Lv. Trp Mot A.P Arg Val Thr Ser Ly. Arg He Gly A,n Glu Tyr Val 



€<55 



Tyr ,..u A.n Gly Val A.p Ala Val His Cys Gly V.l Glu Ai° Glu Val 
15 ser Tyr Arg Pro He Arg Gin II. Asp Leu Arg Gly „et Phe Ser Leu 



Gly ASP Trp Thr Trp Gin A^n A»n Val Ser Tyr ^hr Ser Tyr Asp GU, 



Ala Gly A.„ Glu Thr Gly Gin A..p He Thr -ryr He Lys Gly Leu 



725 



V A.«P Ala Ala Gin Met Thr Ala Ala Val Ser Ala A..p II 



7At 



Glu 



Leu Phe Ly. Sly Phe His Val n« GlC. Ly. Tyr Asn Phe Le'u Giy Ly. 
25 A.n Tyr Ala G.y Phe A,n Pro Ala Thr Arg A.-n Al^ Gin Gin Tyr olu 



Ala ASP Gly Ly. Glu He Val Glu Ser Trp Lys Z!u Pro A-.p V.l Gl, 



7 90 

Leu Phe A.P ,,.„ ser Ala Ser Tyr Asn Phe Lyr Lou Gly Ser Le„ lt° 



805 

Thr Thr Phe Tyr Phe A.„ Met A,p A,n Val Ala A.p Ly.. Arg Val 

825 930 
Asp Asp Asn lie He «i ^- t v,^ r,._ . 

835 



ser Asp Ala A.,p A.p A.n He He sly Ly. Lys His A.p Glu Ala Ser 
35 Al, Leu Val Trp Tyr Gly Phe Gly Arg Thr Trp Ser rh^ Gly He Arg 



Val Asn Phe 
8Cb 



(2 1 IIiroRIIATlOd FOR SEQ ID HO: 4 56 

ti) SEWEIICE CHARACTERISTICS: 

(A) LENGTH: 4 31 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

lili) HYPOTHETICAL: YES 

(Vi] ORTGIJIAL SOURCE: 

(A) ORGAfJISM: Porphyromonas gingival!^ 
Ux) FKATLIRE: 

(A} rwiE/KEY: misc feature 

(B) LOCATION 1. . . 431 

<xl) SEQUENCE DESCRIPTION: SEQ ID HO: 4S6 
net Lys Phe S.r lie Arg Leu Phe Leu Cy. He Phe Leu Leu Ser 

Ala Phe He Leu Pro Ala Leu Gly Gin ^y. Ser Lys Gin Val oL Arg 
Leu Glu Ly. Gin Arg Ly. Glu AJa Leu Lys Ala He Glu Lys Thr A.p 
Arg Glu Leu Arg A.n Thr Ly. Ly. A.-p Ly. Gin Asp Ly. Gin Lys Hi. 
Leu A.n Leu Leu A.n Lys Gin Val Ala Gin Arg Ly. Gin Met Val Gin 
70 L.U Leu A.P A.n Glu Val Ly. Glu Leu Gin Ser Asp He A.p Ser Het 
Thr Gly Val Cy. Hi. Gin Leu Ser Val Gl'u Glu Ly. Ala Arg Ser Asp 
75 ^At ^''^ fiS Arg ser Leu 
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Aep Arg ile Leu PNe He Ser Ser Ala Lys Ser Phe Arp Glu Gly Met 

Arg Arg Met Arg Phe Leu Glu Gin Tyr Ala Ser Ala Tyr Lys L-u Ala 
ISO i^c^ j^g^j 

Ser Val Arg Leu Arg Asp Thr Arg Ser Lyc Leu Glu Thr Glu Arci Ala 
165 

Thr Val Glu Asp Ala Lys hys Glu Lys Gly Hxs Leu Leu Val He Arg 

180 185 190 

Glu Glu Glu Lys Lys Lys Leu Glu Gly Gin Gin Ala Clu Gin Arg Arg 

200 205 
Gin Val Gin Ala Leu Gly Ala Lys Gin Lys Asp Leu Glu Ala Gin Leu 

210 215 220 

Arg Lys Gin Lyi,- Lye Gin Ala Glu Al^ L-u A.sn Arg Ly^ He Glu Lvs 

Gin He Ala Lys Glu He Glu Ala Ala Glu Arg Arg Ala Arg Giu G^u 

Z-iS 250 255 

Arg Glu Arg Leu Ala Arg Glu Ala Lys Ala Lys Gly Lys Pro Val Pre 

260 265 270 

Ala Glu Pro Glu Arg Lys Ala Glu Thr Lys Gly Gly Tyr Ala Met Asp 

2'5 280 285 

Ala Ser Glu Arg Ala Leu Ser Gly Ser Phe Al« Gin Asn Lys Gly Arg 



Leu Pro Gly Pre Val Arg Gly Arg Tyr Arg He Val Ser A^p Phe Gly 

310 315 
Val His Gin His Ser Glu Leu Lys Lys Val Gin Val Asn Asn Gly Gl" 
325 ^ 
Tls Asp II- Ala Val Ala Thr Gly Ser Asp Ala Thr Ser Val Ph- Asn 

3"*° 3-15 350 ~ 

Gly Val Val Ser S«r Val Phe Val Ho Pro Gly Tyr Asn Ser Ala Val 

355 360 
Met Val Arg His Gly Asn Tyr He Thr Val Tyr Ala Asn Leu Ser Lys 

" 375 
Val Tyr Val Asn Ser Gly Thr Arg Val Lys Thr Gly Gin Ala Leu Gly 



Arg Ala Tyr Thr Asp Vrl Ser Asn Asn Gin Thr He He His Phe Glu 
•^OS 

He Trp Lys Glu Arg Ser Lys GJ n Asn Pro Arg Leu Trp Leu Arq 

425 430 

(2) IMFORMATIOM FOR SEQ ID NO: 4 57 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

tii) nOLLCULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGIUAL SOURCE: 

(A) ORGAfUSM: Porphyromonas gingivalis 

(ix) FEATURE: 

(A) HAIIE/KEY: misc faature 

(B) LOCATION 1. . .333 

Ixi) SEQUEUCF. DESCRIPTIOtJ: SEQ ID HO:457 
Met Ly<. Lys Tyr Leu Leu Tyr Ala Ser Leu Leu Thr Ser Val Leu Leu 
Phe Ser Cys Ser Lye Asn Asn Pro Asn Glu Pro Val Glu Asp Arg Ser 
He Glu He Ser He Arg Val Asp Asp Phe Thr Lys Thr Gly Glu Ala 
Val Arg Tyr Glu Arg Asn Gin Gly Ser Ala Ala Glu Arg Leu He Thr 



Arn L^u Tyr Leu Leu Leu P^e Asp Gin Ser Gly Ma Asn Pro Ala Ly. 



65 70 



Tyr Tyr He Thr Gly Asn Thr Ph- Thr Gly Gly Thr Trp Leu Pro A^o 
85 on 



Asp Met Lys Val Lys Leu Asp Met Thr Gin Ser Glu Ala Gly Glu Arg 

Lys Val Tyr Val Val Ala Asn Val Asp Asn Air. Val Lys Thr Ala Leu 
115 120 
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35 



45 
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Asp Ala Val Ala Asn Glu Ser Asp Leu Gin Thz Vol Lys Arg Thr Thr 

Ala Met Pre Trp Ser Thr Asp He Ala S^r Pro ^hl Leu Met Ser Glv 
_ 150 

5 Asn Lys Thr His Asp Phe Leu Ala A^-n Arg Leu Lou Asp Arn Val Pro 
, -"-^^ 175 

Leu val Arg Ala Tie Ala Lys Val Glu Leu Asn He Ser Leu S-r Glu 

j^g^ 

Lys Phe Gin He Val Pro Il« He Val A^n Gly Ser Leu Ser GJ u Ph^ 

ZOO 205 
Ly^ Phe Arg Tyr Val A^n Phe Asp Lys Glu Thr Tyr Val Val Lys Pro 

215 220 
Thr Thr Lys Pro Aep Asn Leu He Ser Ser Al s Asn Gly Val Trp Pro 

15 Gin He Thr Asp Trp Thr Val Trp Gly Ala Ser Leu Asn Thr Ser P^o 
245 2SC 255 

Ala Pro Asp Ala Gly Thr Gly Tyr Thr Leu A.<rp Ala Asn Gly Ly« Val 
260 265 270 

20 '^^'^ !*7S '^''^ '^^^ '^^'^ '^^"^ ^''^ S'^^ Lys Gly 

^ 280 2B5 

Ala Thr Val Glu Val Ala Leu Pro Arg Val A.p Asp Gly Thr Leu Pro 

Pro Pro Glu Phe Gly Pro Gl'u Leu Tyr Arg Leu Pro Leu Pro Asp Lys 

25 He Leu Arg Asn His Trp Tyr Lye: Tyr Glu Val Glu 13^ 
325 



(2) IHFORlIATIOl^ FOR SEQ ID NO: 4 58 

(i) SEQUEI/CE CHARACTERISTICS: 

<AJ LEHGTH: 315 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 



(Vl) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

(A) liAHE/KEY: misc^feature 

(B) LOCATIOH 1. . .315 

(Xi) SEQUEIICE DESCRIPTION: SEQ ID UO:AS8 



Het He Arg Thr He Leu Ser Arg Tyr Val Ser Ser Asn Phe Trp Ser 
50 Arg Gly Ala Thr Pl.e Phe Phe Thr He Phe Pro Ala Phe He ^eu Ala 
Ala Thr Ala I.«u Pro Ala Cys Gly Gly Gly Thr Ala Ser Gly Ser A^p 

55 ''''' lo' 55** ^^^^ ^'^^ ^---^ Ti'^ Ser 

lie Ala A^p Lys Ser Val Gin Val Val Ala Leu Val Pro Ala Gly Ser 
Asn Pro Glu Glu Tyr Asp Pro Ser Pro Thr Val Met Lys Arg Leu Ser 

bU Glu Ala Asp Ala Tyr Ph* Tj-r He Gly Gly L,u Sly Phe Glu Gin Arg 
lOO 105 110 

A-.n Leu Ala Ali IIb Arg Asp Aan A^n Pro L-,-' L*u Pro Leu Phe Glu 

65 isj' ""^'^ Ala Asp ill His Gly Ser 

Thr Asp His Ser Hl^ Thr Asp Leu His Ala S'ls Asp Pro His Tyr 
Trp Ser Ser Val Val G^y Ala Lys Ala Leu III Arg Ala Ala Tyr Asp 
70 Ala Leu V-1 Glu Lou Tyr Pro Asn Glu ^ys Asp Lys Irp Asp Jys Gl- 



75 



His Asp Arg Leu A^n Gly Arg He Asp Ser Vil Lys Arg tin Val A.-p 

200 205 
Thr Met Phe Ala Asn Gly Lys Ala A.p T.ys Ala Phe Val He Tyr His 
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Pro 


Sfer 


Leu 


Ser 


Phe 


Phe 


Ala 


Gin 


Glu 


Phe 


Gly 


Leu 


Arg 


Gin 


He 


Val 


225 










230 










235 










240 


He 


Glu 


t3Iu 


Asp 


Giy 


Lys 


Glu 


Pro 


Tht 


Ala 


Ala 


His 


Leu 


Arg 


Arg 


Val 










245 










250 










255 




He Asp 


Gin 


Ala 


Arg 


Ala 


Asp 


Gly 


Val 


Arg 


He 


Val 


Phe 


He 


Gin 


Pro 








260 










265 










270 








Phe 


Glu 


Thr 


Arg 


Gin 


Ala 


Glu 


Asp 


He 


Ala 


Arg 


Glu 


He 


Gly 


Ala 
















280 










285 








Acg 


Pro 


ViL 


Ary 


He 


Aen 


Pro 


I.eu 


Arq 


ser 


S^r 


Trp Glu 


Glu 


Glu 


He 




290 










295 










300 










Lsu 


Hia- 


He 


Ala 


Arg 


Ala 


Leu 


Ala 


His 


Glu 


Arg 













305 310 315 



(2) IMFORflATIOH fcOR SEQ ID UO: 4 SS 

U) SEQUEtlCF. CHARACTERISTICS: 

I A) LEllGTH: 949 amino acid? 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: procein 

(iii) nVPOTHETICAL; YES 

I vi ) ORIGIMAL SOURCE : 

(AJ ORGAlJISM: Porphyromonar glngivaliff 

<ix) FEATURE: 

(A) HAHE/KEY: mi:?=_f sature 
iB) LOCATION 1. . .848 



ixi) SEQUEIJCE DESCRIPTION: i'EQ ID NO: 469 



Met 


He 




Lys 


Lys 


He 


Ph« 


Phe 


He 


Leu 


Lexj 


Ala 


Leu 


He 


Ala 


Phfe 


1 








5 










IC 










15 




Ser 


Gly Leu 


Asn 


Ala 


Ala 


Thr 


Asp 


Thr 


Glu 


Phe 


Lys 


Tyr 


Pro 


Thr 


Asp 








20 










25 










30 






Ala 


Arn 


lie 


He 


Gly 


His 


Val 


Lyp 




Ser 


Lys 


Thr 


Gly 


Glu 


His 


Leu 






35 










40 










45 








Val 


Gly 


He 


Thr 


He 


Ala 


He 


Lys 


Giy 


Thr 


Thr 


Phe 


Gly 


Thr 


Ser 


Thr 




50 










55 










60 










Asp 


Ala 


Thr 


Gly 


His 


Tyr 


Tyr 


Leu 


Arg 


Asn 


Leu 


Arg 


Pro 


Gly Glu 


He 


65 










70 










75 










80 


Thr 


Lsj 


He 


net 


Arg 


Gly 


riet 


Gly 


T-yr 


Lys 


Ser 


Gin 


Glu 


Arg 


Val 


Val 










85 










90 










95 




Arg 


Val 


Glu 


Lys 


Asp 


Lys 


Thr 


He 


Glu 


Val 


Asn 


Phe 


Glu 


Ala 


Glu 


Giu 








100 










105 










110 






Asp 


Ala 


He 


Asn 


Leu 


Asp 


Glu 


Val 


Val 


He 


Scr 


Ala 


Asn 


Arg 


Clu 


Lfiu 






115 










120 










125 








Thr 


Leu Ary 


Arg 


Leu 


Ala 


Pro 


Thr 


Leu 


Val 


Asn 


Val 


Leu 


Asn 


Glu 


Lys 




130 










135 




















Val 


Phe 


Ser 


Gin 


Val 


Asn 


Ala 


Ser 


Asn 


Leu 


Ala 


Gin 


Gly 


L»u 


Ser 


Phe 


145 










150 










155 










160 


Gin 


Pro 


Gly 


Val 


Arg 


Val 


Glu 


Asn 


Asn 


Cys 


Gin 


Asn 


Cys 


Gly 


Phe 


Asn 










165 










170 










175 




Gin 


Val 


Arg 


He 


Asn 


Gly 


Leu 


Asp 


Gly 


Arg 


Tyr 


Ala 


Gin 


He 


Leu 


He 








ISO 










185 










190 






A5p 


Ser 


Arg 


Pro 


He 


Met 


Ser 


Ala 


Leu 


Ala 


Gly Val 


Tyr 


Gly 


Leu 


Giu 






195 










200 










205 








Gin 


He 


Pre 


Ala 


Asn 


Het 


He 


Glu 


Arg 


Val 


Glu 


Val 


Val 


Arg Gly 


Gly 




210 










215 










220 










Gly Ser Ala 


Leu 


Tyr 


Gly 


Ser 


Ser 


Ala 


He 


Ala 


Gly 


Val 


Val 


Asn 


He 


225 










230 










235 










240 


He 


Thr 


LV5 


Glu 


Pro 


Ser 


His 


Arn 


Scr 


Phe 


Thr 


Phe 


Aan 


Glu 


Ser 


Leu 










245 










250 










255 








Thr 


Gly 


Phe 


Ser 


Lys 


Leu 


Asp 


Asn 


Asn 


Thr 


Asn 


Phe 


Asn 


Ala 








260 










265 










270 






Ser 


He 


Val 


Ser 


Asp 


A&*p 


Af-n 


Arg 


Ala 


Gly 


Ala 


Met 


Val 


Phe 


Gly 


Gin 






275 










280 










-285 








Ala 


Arg 


Tyr 


Arg 


Aan 


Hla 


Trp 


Asp 


Ala 


Ain 


Asn 


Asp 


Gly 


Tyr 


Ser 


Glu 




290 










295 










300 










Leu 


Gly 


Lys 


He 


Asp 


Ala 


Arg 


Ser 


Leu 


Gly 


Ala 


His 


Ser 


Tyr 


Leu 


Arg 


305 










310 










315 










320 


Leu 


Ser 


A--r 


Tyr 


Ser 


Lys 


Leu 


Thr 


Gly 


Glu 


Phe 


Hl^ 


Thr 


He 


Sc!r 


Glu 










3;? 5 










330 










335 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



The 


eg 


Acg 


Gly 




Asp 


Arg 


lis Asp Lsu 


Pro 


Pro 


His 


Val 


Val 


Gly 








340 








J46 








350 






Val 




Glu 


Gin 


Thr 


Asp 


His 


Ser Val Phe 


Ser 


Gly Asn 


Leu 


Lys 


Tyr 






355 










360 






365 








Asp 


Leu 


Ph** 


S^c 


Ser 


Asn 


Tyr 


Lys His His 


Phe 


Gin 


Ala 


Tyr 


Thr 


Ser 












375 






380 












Gin 




Val 


Asn 


AC J 


Lys 


Ser Tyr Tyr 


Gly 


Gly 


He 


Gly Glu 


He 


385 










3 90 






395 










400 


Asp 


Val 






His 


Pro 


Gly Gly Thr Glu 


Gly 


Tyr 


Pro 


He 


Pro 


Gin 








4 05 






410 










415 




Asp 


Gin 


Tyr 


Gly 


Asn 


Asn 


Tyr 


Gly Val Thr 


Lys 


Gly 


Lys 


Thr 


Tyr 


Met 








420 








425 








430 






Gly 




J, ^ 


bin 


Tyr 




Tyr 


Asp Leu Asp 


Lys 


Phn 


Leu 


Leu 


Met 


Pro 






435 










440 






445 








Svr 


'31 n 


Leu 


Leu 






Ala 


Glu Tyr Thr 


Arg 


Asp 


Glu 


Leu 


Asn 


Asp 




4 50 










455 






460 










V 


Met 


Pro 


lie 


Lea 


Ser 


Trp 


Gin Thr Gly 


Gin 


Asp Ala 


Asn 


Gly Asn 


4 65 










470 






475 










480 


Thr 


11 6 


Pro 


Leu 


Tyr 


Pro 


G1 ti 


Leu Asp Gin 


Asn 


He 


Asn 


Asn 


Tyr 


Ser 










48b 






490 










4 95 




L^u 


Phff 


Gly 


Gin 


Asn 


Glu 


Trp 


Lys Asn Asp Arg 


Trp 


Ser 


He 


Leu 


Val 








500 








505 * 








510 






Gly 


Ala 


Arg 


Leu 


Aep 


r.ys 


His 


Ser Glu Val 


Lys 


Asp Met 


He 


Leu 


Ser 




515 










520 






525 








Pro 


Arq 


Thr 


Thr 


Leu 


Arg 


Phe 


Asn Val Asn 


Pro 


A.?p 


He 


Asn 


Leu 


Arg 




530 










535 






540 










Ala 


Thr 


Tyr 


Ala 


Lyc 


Gly 


Phe 


Arg Ala Pro 


Gin 


Val 


Phe 


Asp 


Glu 


Asp 


54 5 










550 






555 










560 


Leu 


His 


Val 


Gly 


Val 


Val 


Gly 


Gly Glu Ala 


Gin 


Lys 


Val 


Phe 


Asn Asp 










555 






570 










575 




Pro 


Asn 


Leu 


l.yr> 


Pro 


Glu 


He 


Ser His Ala 


Phe 


Ssr 


Leu 


Scr 


Ala 


Asp 








580 








585 








590 






Met 


Tyr 


His 


Arg 


Phe 


Gly 


Asn 


Val Gin Thr 


Asn 


Phe 


Leu 


Val 


Glu 


Gly 






5G5 










600 






605 








Phe 


Tyr 


Thr Arg 


Levi 


Leu 


Asp 


Val Phe Thr 


Asn 


Glu 


Glu 


Gin 


Pro 


Asp 




610 










615 






620 










Gin 


His 


Asp Gly 


lie Lys Arg Tyr Thr Arg 


He 


Asn Gly 


Ser 


Gly Ala 


625 










630 






635 










640 


Lys 


Val 


Phe 


Gly 


Leu 


Asn 


Leu 


Glu Gly Lys 


Val 


Ala 


Tyr 


Lys 


Ser 


Phe 










645 






650 










655 






L"u 


Gin 


Ala 


Gly Leu 


Thr 


Leu Ala Ser 


Asn 


Lys 


Tyr Asp 


Glu 


Ala 








660 








665 








67 0 






(ji n 


Glu 


Trp Gly 


Leu 


Asn 


Thr 


Val Lys Asp Thr 


Asn 


Gly Ala 


Phe 


Val 






€75 










680 






685 








Thr 


Glu 


Ala 


Asn 


Ala 


Asfi 


Gly Gin Gin Glu Tyr 


Lys 


Asn 


Glu 


Ser 


Met 




690 










695 






700 










Thr 


Asp 


Thr 


Gin 


He 


Thr 


Arg 


Thr Pro Ssr 


Val 


Tyr 


Gly 


Tyr 


Phe 


Thr 


705 








710 






715 










7 20 


Lsu 


Ala 


Tyr 


Asm 


Pro 


Ala 


His 


Sf^r Trp Asn 


He 


Ala 


Leu 


Thr 


Gly 


Ala 
















730 










7 35 




Tyr 


Thi- 


Gly 


Gin 


Her 


Tyr 


Val 


Pro His Ala 


He 


Glu 


Tyr Gly Val 


Lys 








740 








745 








7 50 






Ser 


Ala 


Glu 


Leu 


Asp 


He 


tSet 


Gin Asn Asn 


Pro 


Glu 


He 


Thr 


Asp 


Glu 






755 










760 






765 








Thr 


Gly 


I.ys 


Ala 


Pro 


Arg 


Il« 


Asp Glu Leu 


Lys 


Lys 


Thr 


Pro 


Ala 


Phe 




no 










775 






780 










Phe 


Asp 


Leu Gly 


Leu 


Lys 


Val 


Gly Tyr Asp 


Phe 


His 


Val 


Phe 


Gin 


Ala 


785 










7 90 






795 










800 


Thr 


Glu 


Val 


Gin 


Leu 


Tyr 


Val 


Gly ?tet Asn 


Arsn 


He 


Phe 


Asn 


Ser 


Phe 










805 






810 










815 




Gin 


Lys 


Afi-p 


Phe 


Asp 


A eg 


Gly Ala Ala Arg 


Asp 


Ser 


Gly Tyr 


He 


Tyr 








820 








825 








830 






Gly 


Pro 


Thr 


Gin 


Pro 


Acg 


Thr 


Gly Tyc Hfit 


Gly 


Leu 


Val 


Val 


Lys 


Phe 






835 










840 






845 









(2) INFOWIATIOH POR SEQ ID tIO:460 



(1) SEQUEUCE CHARACTERISTICS; 

(A) LEIIGTH: 202 amino acids 
70 {B} TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) IIOt.ECULE TYPE; protein 

75 (ill I HYPOTHETICAL: YES 



SUBSTmJTE SHEET (Rule 26) (RO/AU) 
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402/490 



(vi) ORIGINAL SOURCE: 

(A) ORGAllISM: Porphyromonas gingival i5 

tlx) FE^^TURE: 

(;\) umE/KEY: misc feature 
(Bl LOCATION 1 . . .202 



(r.i) SEQUENCE DESCRIPTIOt^I : SEQ ID MO: 4 60 



Met 


Thr 


Vai 


Lys 


Arg 


Ala 


Val 


Arg 


lie 


Ala 


Leu 


Leu 


Thr Leu 


He 


Gly 


1 








5 










10 








15 




lift 


L«ii 


Phe 


Ser 


Ser 


Pro 


Ser 


Leu 


Val 


Arg 


Ala 


Gin 


Ser Leu 


Phfi 


Ser 








20 










25 








30 






Thr 


Giu 


His 


Val 


Leu 


Gin 


Leu 


Tyr 


Asn 


Lys 


He 


Leu 


Tyr Gly 


Glu 


Ser 






3S 










40 










4 5 






Ala 


Aln 


A.?p 


Thr 


Val 


Ala 


Glu 


Lys 


Thr 


Ala 


Gly 


Glu 


Ser Ala 


Phe 


Pro 




50 










55 








60 








Phe 


lie 


Asp 


Lys 


Lfiu 


lie 


Acn 


Leu 


Gly 


Arg 


Thr 


Phe 


Leu Gly 


Lys 


Pro 


65 










-70 


















80 


Tyr 


Arg 


Tyr 


Arg 


Sly 


Pro 


Ser 


Pro 


Trp 


Pro 


!let 


Arp Cye Ser 


Gly Tyr 










85 










90 








95 




Vol 


Ser 


Tyr 


Leu 


Tyr 


Ser 


Lys 


Phe 


Asp 


He 


Lys 


Leu 


Pro Arg 


Gly Ala 








100 










106 








110 






Ala 


Air. 


Gin 


Ser 


Gin 


Tyr 


Thr 


Asn 


Pro 


He 


Glu 


Arg 


Glu Afp 


Val 


Arg 






115 










120 










125 






Pro 


Gly 


Asp 


Leu 


Leu 


Phe 


Phe 


Lys 


Gly 


Arg 


Asn 


Ala 


Arg Ser 


Asn 


Arg 




130 










135 










140 








lis 


Gly 


His 


Val 


Ala 


Leu 


Vai 


Val 


Ser 


Val 


Asp 


Glu 


Asp Asp 


He 


Thr 


145 










150 










155 








160 




Met 


His 


Ser 


Arg 


Asn 


Ser 


Arg 


Gly 


He 


Val 


He 


Glu Lys 


Leu 


Asn 










165 


















175 




Arg 


Ser 


Ala 


Tyr 


Phe 


Ser 


Arg 


Arg 


Le\j 


Val 


Ser 


Tyr 


Gly Arg Val 


Pro 








190 










185 








190 






Gly 


Ala 


Lys 


Arg 


Val 


lie 


Pro 


Arg 


Lys 


Set 
















195 










200 
















12) 


INFORMATIOI4 


FOR 


SEQ 


ID NO: 461 















U) SEQUEUCE CHARACTERISTTCS: 

(A) LENGTH; 455 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAinsil: Porphyromonas gingivalis 

(ix^ FEATURE: 

{AJ HAME/KEY: misc_Ceature 
IB) LOCATIOM 1. . . 455 



txi) SEQUEMCE DESCRIPTION: SEQ ID 710:4 61 



Met 


Lys 


Arg 


Thr 


He 


Leu 


teu 


Thr 


Ala 


Leu 


Thr 


Val 


Lsu Ser Ser 


Leu 


1 








5 










10 






15 




Ser 


Leu 


Leu 


Arg 


Ala 


Gin 


Asn 


Glu 


Ser 


Glu 


Ala 


Ser 


Thr PiSTx Pro 


Het 








20 










25 








30 




Ser 


Gly 


Leu 


Ser 


Leu 


Glu 


Asp 


Cys 


Tie 


Arg 


He 


Ala 


Lys Glu Arg 


Asn 






35 










40 










4S 




Leu 


Asn 


Leu 


Arg 


Arg 


Gin 


Glu 


He 


Glu 


Gin 


Glu 


Asn 


Arg He He 


Ser 




50 










55 










60 






Leu 


Asp 


Ala 


Ala 


Arg 


His 


Ser 


Phe 


Leu 


Pro 


Ser 


Val 


Asn Ala Gly He 


65 










70 










75 






00 


Gly 


His 


Asn 


Tyr 


Ser 


Phe 


Gly 


Arg 


Ser 


Lys 


Arp 


Lyf 


Thr Gly Val 


Thr 










95 










SO 






95 




Val 


Asp 


Arg 


Ser 


Ser 


Hst 


Aen 


Thr 


Asn 


Leu 


Ser 


He 


Gly Ala Ser 


Vol 








100 










105 








110 




Glu 


Val 


Phe 


Ser 


Gly 


Thr 


Arg 


Arg 


Leu 


His 


Asp 


Leu 


Lys Gin Gin 


Ly5 






115 










120 










125 




Tyr Asn 


Val 


Giu 


Asp 


Gly 


He 


Ala 


Arg 


Leu 


Gl n 


hy^ 


Ala Arg Glu 


A^p 




IJO 










135 










140 
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Leu 


Gin 


lie 


Ala 


Ala 


Leu 


Tyr 


He 




Leu 




145 










150 










155 






Glu 


Met 


Thr 


Arg 


Thr 


Ala 


Glu 


Thr 


Gin 


Leu 


Ala 


L*u 


lie 










165 










170 












Arq 


Thr 


Ala 


Glu 


Met 


Val 


Axg 


Val 


UAy 


ijyb 










180 










105 










Lys 


eu 


Leu 


Asp 


He 


A£n 


Ala 


Gin 


Mec 


Ala 


Lys 


Asp 


Glu 




195 










200 










205 


Val 


Gin 


Tyr Arq 


Ser 


Glu 


(»lu 


Glu 


Leu 


Ala 


Arg 


Leu 


Asp 




210 










215 










220 




Ala 


Leu 


kjI u 


Leu 


Slu 


His 


Pro 


Glu 


Ser 


He 


Ala 


Val 




225 










230 










235 






Ihr 


sp 


Val 


Leu 


Val 


Ala 


Glu Arg 


Leu 


Gly 


Ser 


Leu 


Leu 










245 










250 








Glu 


lie 


Tyr 


Arg 


Thr 


A] a 


Leu 


Gly 


Leu 


Lys 


Pro 


Ala 


Leu 








260 










265 










Glu 


Leu 


Gin 


He 


Ala 


Ser 


Ala 


Arg 


Glu 


Gly 


Leu 


Ala 


Ser 






275 










200 










285 


Al a 


T yr 


Phs 


Pro 


Thr 


Leu 


Ser 


Leu 


r 


Ala 


Gly 


Tyr 


Ser 




2 90 










2 95 










300 




Phe 


Arg 


Asp 


Leu 


Giy Lyff 


Glu 


Tyr 


Ala 


Ala 


He 


Arn 


Pro 


305 










310 










315 








Gin 


Trp 


Lyf 


Asn 


Asn 


Gly 


Ser 


Tyr 


Ser 


He 


Gly 


Leu 










325 










330 








He 


Pro 


lie 


Phe 


Ser 


Ala 


Met 


Gin 


Thr 


Gin Aip 


Arg 


Val 








340 










345 










Arg 


Leu 


Gin 


rie 


Arg 


Ser 


Ser 


Glu 


Leu 


Arg 


Leu 


Val 


Glu 




355 










360 










3 55 


Ala 


Leu 


Tyr 


Lys 


Glu 


He 


Arg 


Gin 


Ala 


Tyr 


Ser 


Asn 


Ala 




37 0 










375 










300 




Asp 


Lya 


Ala 


He 


Ala 


Ala 


Ala 


Glu 


Asn 


Ser 


Lys 


Ala 


Ala 


385 










390 










395 






Ala 


Tyr 


Glu 


Ti-r 


Ala 


Arg 


A£p Ser 


Phc 


Glu 


Ala 


Gly Arg 










405 










410 








Tyc 


Glu 


Tyr 


Ala 


Glu 


Ala 


Lys 


Thr 


Lys 


Tyr 


Ala 


Leu 


Ser 








420 










425 










Glu 


Leu 


Arg 


Ala 


Lys 


Tyr 


Asp 


Phe 


IXe 


Tyr 


h'/S 


Ala 


Lys 






435 










440 










445 


Phc 


Tyr 


Gin 


Gly Lys 


Asp 


Phe 
















450 










455 
















INFORMATIOH 


FOR 


SEg 


ID NO:462 











(i) SEQUEMCE CHARACTERISTICS: 

fA> LEHGTH: 444 amino acids 
IR) r/FE: amino acid 
[Dt TOPOLOGY: linear 



Ui) HOLECULb: TYPE: protein 

(lU) HYPOTHETICAL: YES 

Ivll ORIGIHAL SOURCE: 

{h) ORGAMISH: PorpUyromonas ginglvalls 

{Ixj FEATURE: 

(A) HAJ^E/KEY: ir.isc_£eat:ure 
(Bl LOCATION 1. . .444 



(xl) SEQUENCE DESCRIPTION: SEQ ID NO: 462 



Ket 


Arg 


Phe 


Gin 


His 


Tyr 


Leu 


Tie 


cys 


Thr 


Ala 


Ala 


Val 


1 








5 










10 








Ala 


Ala 


Affu 


Pro 


Leu 


Thr 


Gly 


Gin 


Ser 


Asa 


Ifet 


Thr 


Leu 








20 










25 










He 


Asp 


Tyr 


Ala 


Arg 


Arg 


His 


Ser 


Ser 


Ala 


Val 


Ala 


Leu 






35 










40 










4 5 


Glu 


Lou 


Glu 


Gin 


Ser 


Lys 


Ala 


Asp 


Tyr 


Leu 


Gin 


Ala 


Val 




SO 










55 














Leu 


Pro 


Arg 


Val 


Ser 


Ala 


Gly 


Thr 


Gly 


Ala 


Ser 


Trp 


Asn 


65 










70 










75 






Gly 


Leu 


Asp 


Ala 


Glu 


Thr 


A£n 


Thr 


Tyr 


Thr 


A^p 


He 


Asn 










85 










90 








Asn 


Ser 


Tyr 


Ser 


He 


His 


Ala 


Thr 


Met 


Thr 


Leu 


Phe 


Asp 



100 105 
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Phe 


Arg 


Gin 






160 


Arg 


Glu 


Gin 




175 




Ala 


Glu 


Gl V 


1 90 




Gin 


Leu 


Leu 


Leu 


Gly 


Gin 


Ala 


Pro 


Asp 






240 


Ala 


Pro 


Gl u 




255 




His 


Ser 


er 


270 






Ala 


Arg 


Al a 


Asn 


Gly 


Tyr 


Ser 


Phe 


Ser 






320 


Ser 


Leu 


Affn 




335 




Arg 


Ser 


Ser 


350 






Glu 


Ly« 


Lys 


Val 


Ala 


Ala 


Thr 


Leu 


Lys 






400 


Leu 


Ser 


Ala 




415 




Gin 


VaJ 


Gltl 


430 






Val 


Leu 


Asp 



Ala 


Ala 


Leu 




15 




Glu 


Glu 


Cys 


30 






Ser 


Ala 


Ala 


Gly 


Asn 


Phe 


Phe 


Gly 


Arg 






80 


Ser 


Phe 


Asn 




95 




Gly 


Leu 


Gin 


110 
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Vol 


T" r 


Arq 


Leu 


Arg 


Met 


Ala 


His 


Ala 


Arq 


Arg 


Glu 


Ala 


Ser 


Arq 






111* 










120 










125 








Leu 


Ser 


Val 


Arq 


Giu 


Gin 


Gin 


Glu 


Leu 


Ala 


Ala 


L«u 


Gly 


Thr 


Thr 


Glu 




130 










135 










140 










Al a 


Tyr 


Tyr 


Af p 


Lgu 


Val 


Tyr 


Ala 


Arg 


Gin 


HsL 


Gin 


Glu 


Leu 


Ala 


Met 


14 5 










150 










155 










160 


n J n 


i«y.. 


Tvr 


Glu 


Glu 


Ser 


Ser 


Arg 


Lgu 


Hl5 


Arg 


Gl n 


Thr 


Ala 


Arg 


Met 










165 










170 










175 




(31 u 


Gl u 


Lfcu 


Gly Met 


Lys 


Ser 


Arg 


Pro 


A^p 


Val 


L&u 


Glu 


IfGt 


CJn 


Ser 








IBO 










185 










190 








Nst 


Al a 


Gly Asp 


Arg 


L^fU 


Ala 


Leu 


Thr 


Gin 


Ala 


A:fp Asn 


Gin 


Cys 






1 95 










200 










205 








lie 


lie 


Ala 


Leu 


He 


Arg 


Leu 


Lys 


Glu 


Lys 


Met 


Asr. 


Phe 


Pro 


He A^p 




210 










215 










220 












Glu 




Val 


Val 


Asp Affp 


Met 


Pro 


Ala 


Affp 


Se r 


Leu 


Ser 


Ala 


A5P 


22^ 










230 










235 










240 


Het 


Ala 


Gl u 


Ser Asp 


Ser 


Ser 


Ala 


Uly 


Val 


Phe 


Al A 


Arg 


Ala 


Ala 


His 










245 










250 










255 




HI.-? 


His 


o 


Val 


Leu 


Leu 


Arg Ala 


Lys 


Leu 


Asp 


Gl 


Gin 


Ala 


Ala 


Thr 








260 










265 










270 






Asp 


Ai'j 


L&U 


Arq 


Ala 


Ala 


Arg 


Gly 


Ala 


Phe 


Leu 


Pro 


Scr 


Val 


Ser 


val 






27 5 










280 










285 








S<?r 


Glv 


Gly 


Trp 


Affn 


Thr Gly 


Pho 


Ser Arq 


Ph* 


Leu 


A5n 


Gly Ser 


Asp 




290 










295 










300 








Tyr 


The 


Fro 


Pho 


Ser 


Glu 


Gin 


Phe Arg 


Asn Arg 


Arg 


Gly Glu 


Tyr 


Val 


30t; 










310 










315 










320 


Ser 


Leu 


Arn 


Leu 


Ser 


lie 


?ro 


lie 


Phe 


Ser 


Gly 


Phe 


Ser 


Leu 


Val 


Ser 










325 










330 










335 




His 


Leu 


Arg 


Gin 


Ala 


Arq 


Ala 


Glu 


Arg 


Arg 


Ala 


Ala 


Tie 


Val 


Arg 


Arg 








340 










34 5 










350 






Gly 


Glu 


Ala 


Glu 


Arg 


Arg 


Leu 


Tyr 


Ser 


Glu 


lie 


Ala 


Gin 


Ala 


Wet 


Ala 






355 










360 










365 








Asp Ary 


A£p 


Ala 


Ala 


Leu 


Ala 


Ser 


Tyr Arg 


Gin 


Alo 


Lys 


Glu 


His 


Thr 




370 










375 










380 










Asp Ala 


llet 


Gin 


Thr 


Ala 


Tyr 


Glu 


Ala 


Val 


Leu 


Gin 


Arg Tyr 


Glu 


Glu 


365 










390 










395 










400 


Gly 


Leu 


Asn 


Thr 


Ala 


lie 


Asp 


Leu 


Thr 


Thr 


Gin 


Ala 


Asn 


Arg 


Leu 


Leu 










405 










410 










415 






Ala 


Arg 


Val 


Gin 


Arg 


Leu 


Arg 


Ala 


Ala 


lien 


Thr 


Tyr 


Arg 


Leu 


Lys 








420 










425 










4 30 






Cys 


hys 


Leu 


lie 


Ala 


Tyr 


Tyr 


Gly Cys 


Leu 


Ser 


Asp 














435 










440 



















(21 IK FORMAT lot I FOR SEQ ID HO: 4 63 

ii) SEQUEtJCE CHARACTKRISTICS: 

(A) LEKGTH: 940 amino acid^; 
(D) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) llOLfc:Cl.)LE TYPE: protein 

(iii) HYPOTHETICAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGAMISM: Porphyromonas ginglvaiie 

(IxJ FEATURE: 

(A) MAItE/KEY: misc feature 

(B) LOCATIOtI 1. . .940 







SEQUENCE DESCRIPTIOW: I 


SEQ ; 


[D MO: 463 










Het 


Asn 


hys 


Plie 


Tyr 


Lys 


Ser 


Leu 


Leu 


Gin Ser Giy 


Leu 


Ala 


Ala 


Ph- 


1 








5 










10 






15 




Val 


Scr 


I-tet 


Ala 


Thr 


Ala 


Leu 


Thr 


Ala 


Ser Ala Gin 


He 


Ser 


Phe 


Gly 








20 










25 






30 






Gly 


Glu 


Pre 


Leu 


Ser 


Phe 


Scr 


Ser 


Arg 


Ser Ala Gly Thr 


His 


Ser 


Phe 






35 










40 






45 








Asp Asp 


Ala 


Met 


The 


He 


Arg 


Leu 


The 


Pro As'p Phe 


Asn 


Pro 


Glu 


Asp 




50 










55 






60 










Leu 


He 


Ala 


Gin 


Ser 


Arg 


Trp 


Gin 


Sec 


Gin Arg Asp 


Gly Arq 


Pro 


Val 


65 










70 








75 








80 


Arg 


lie 


Gly 


Gin 


Val 


He 


Pro 


v«i 


Asp 


Vai Afp Phe 


Ala 


Ser 


Lys 


Ala 










85 










90 






95 





SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



ser 




He 


Ser 


Ser 


He 


Gly A£p Val 


Asp Val 


Tyr 


Arg 


Leu Gin Phe 








100 










105 










110 


Lys 


Leu 


Glu 


Gly 


Ala 


Lys 


Ala 


He 


Thr 


Leu 


Tyr 


Tyr 


A£p Al^ Phe Asn 




115 










120 










125 




n ft 


Pro 


Glu 


Gly Gly Arg 


Leu Tj'r 


Tie 


Tyr 


Thr 


Pro 


Asp His Glu He 




130 










135 










140 






Val 


Leu 


Gly 


Ala 


Tyr 


Thr 


Asn 


Ala 


Thr 


His 


Arg 


Arg 


Asn 


Gly Ala Phe 


145 










150 










155 






160 


Ale 


Thr 


Gl n 


Pro 


Val 


Pro 


Gly Ser 


Glu 


Lgu 


He 


Met 


Asp 


Tyr Glu Val 










165 










170 








175 


Ser 


Arg 


Gly 


Ojy 


Thr 


Leu 


Pro 


Asp 


He 


Lys 


He 


Ser 


Gly 


Ala Gly Tyr 








leo 










185 










190 


He 


Phe 


Afp 


Lys 


Val 


Gly 


Gly 


Arg 


Pro 


Val 


Thr 


Asp 


Asn 


His Tyr Gly 






195 










200 










205 




He 


Gly 


Glu 


Asp 


Asp 


Ser 


Asp 


Ser 


Asp 


Cys 


Glu 


He 


Asn 


He Asn Oys 




210 










215 










220 




Pro 


Glu 


Gly 


Ala 


A5rp Trp 


Gin 


Ala 


Glu 


Lys 




Gly Val 


Val Gin Met 


220 










230 










235 






240 


He 


net 


Val 


L/f 


Gly 


Gin 


Tyr 


He 


Ser 


Met 


Cys 


Ser 


Gly Asn Leu Leu 










245 










250 








255 


Asn 


Asn 


Thr 


Lys 


Gly 


Asp 


Phe 


Thr 


Pro 


T.CU 


He 


He 


Ser 


Ala Gly His 








260 










265 










270 


Cys 


Ala 


Ser 


He 


Thr 


Thr 


A5n 


Phe 


Gly 


Val 


Thr 


Gin 


Ser 


Glu Leu Asp 






27 










280 










285 




Lys 


Trp 


He 


Phe 


ThV 


Phe 


His 


Tyr 


Glu 


Lys 


Arg 




Cys 


Ser Asn Gly 




290 










295 










30O 






Thr 


Leu 


Ala 


He 


Phe Arg Gly A^n 


Ser 


He 


He 


Gly Ala 


Ser Met Lys- 












310 










315 






320 


Ala 


Phe 


Leu 


Pro 


He 


Lys 


Gly Lys 


.Ser 


Asp 


Gly 


Leu 


Leu 


Leu Gin Leu 










325 










330 








335 


Asn 


A^p 


Glu 


Val 


Pro 


Leu 


Arg 


Tyr 


Arg 


Val 


Tyr 


Tyr 


Asn 


Gl y Trp Asp 








340 










34 5 










350 


Ser 


Thr 


Pro 


Asp 


He 


Pro 


Ser 


Scr 


Gly 


Ala 


Gly 


He 


His 


His Pro Ala 






355 










360 






365 




Gly 


As-p 


Ala 


Met 


Ly;j 


He 


Ser 


He 


Leu 


Lys 


Lys 


Thr 


Pro 


Ala Leu Asn 




370 










375 










3B0 






Thr 


Trp 


He 


Ser 


Ser 


Ser Gly Scr 


Gly Gly Thr 


Asp 


Asp 


His Phe Tyr 


385 










390 










395 






400 


Phe 


Lys 


Tyr 


Asp 


Gin 


Gly Gly Thr 


Glu 


Gly 


Gly 


Ser 


Ser 


Gly Ser Ser 










405 










410 








415 


Leu 


Pho 


Aan 


Gin 


Asn 


Lys 


His 


Val 


Val 


Gly Thr 


Leu 


Thr Gly Gly Ala 








420 










425 










4 30 


Gly 


Asn 


Cys 


Gly 


Gly Thr 


Glu 


Phe 


Tyr 


Gly 


Arg 


Leu 


Afn 


Ser His Trp 






435 










440 










445 




Asn 


Glu 


Tyr 


Ala 


Ser Asp Gly 


Asn 


The 


Ser 


Arg 


Met 


Asp 


He Tyr Leu 




450 










455 










460 






Asp 


Pro 


Gin 


Atn 


Asn 


Gly 


Gin 


Thr 


Thr 


He 


Leu 


Asn 


Gly Thr Tyr Arg 


465 










470 










475 






460 


Arp 


Gly 


Tyr 


Lys 


Pro 


Leu 


Pro 


Ser 


Val 


Pro 


Arg 


Leu 


Leu 


Leu Sin Ser 










485 










490 








4 95 


Thr 


Gly 


Afp 


Gin 


Val 


Glu 


Leu 


Asn 


Trp 


Thr 


Ala 


Val 


Pro 


Ala Asp Gin 








50 0 










505 










510 


Tyr 


Pro 


Ser 


Ser Tyr Gin Val 


Glu 


Tyr 


His 


lie 


Phe Arg Asn Gly Lyr 






515 










520 










525 




Glu 


He 


Ala 


Thr 


Thr 


Lys 


Glu 


Leu Sf*r Tyr 


Ser 


Asp 


Ala 


He Aifp Glu 




530 










535 










540 






Ser 


He 


He 


Gly Ser Gly 


He 


He 


Arg Tyr 


Glu 


Val 


Ser 


Ala Arg Phe 


54 5 










550 










555 






£60 


life 


Tyr 


Pro 


Ser 


Pro 


Leu 


Asp 


Gly Val 


Glu 


Ser 


Tyr 


Lys 


Asp Thr Asp 










S65 










570 








575 


Lys 


Thr 


Ser 


Ala 


Affp 


Leu 


Ala 


He 


Gly Asp 


He 


Gin 


Thr 


Lys Leu Lys 








580 










585 










590 


Pro 


hsp 


Val 


Thr 


Pro 


Leu 


Pro Gly Gly 


Gly Val 


Ser 


Leu 


Ser Trp Lys 






S95 










€00 










€05 




Val 


Pro 


Phe 


Leu 


Sor 


Gin 


Leu 


Val 


Ser Arg 


Phe 


Gly 


Glu 


Ser Pro Asn 




610 










615 










620 






Pro 


Val 


Phe 


Lys 


Thr 


Phe 


Glu 


Val 


Pro 


Tyr 


Val 


Ser 


Ala 


Ala Ala Ala 


625 










630 










635 






540 


Gin 


Thr 


Pro 


Asn 


Pro 


Pro Val tily Val Val 


He 


Ala 


Asp 


Lys Phe Met 










64 5 










650 








655 


Ala 


Gly 


Thr 


Tyr 


Pro 


Glu 


Lys 


Ala 


Ala 


21e 


Ala 


Ala 


Val 


Tyr Val Hot 








660 










665 










670 


Pro 


Ser 


Ala 


Pro 


As'p 


Ser 


Thr 


Phe 


Hie 


Leu 


Phe 


Leu 


Lys 


Ser Asn Thr 






6TS 










6G0 










6R5 






flirt] 


Arg 


Leu 


Gin 


Lys Va) 


Thr 


Thr 


Pro 


Sei 


Asp 


Trp 


Gin Mn Gly 



SUBSTITUTE SHEET (Role 26) (RCVAU) 
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690 








695 










700 










Thr 


Trp 


Leu 


Arg He 


Asn 


Leu 


Asp 


Lys 


Pro 


Phs 


Pro 


Val 


Asn 


Asn Asp 


70S 








710 










715 










720 


His 


(-lit 


Leu 


Phe Ala 
725 


Gly 


He 


Arg 




Pro 
730 


Arn 


Lys 


Tyr 


Lys 


Leu 
735 


Asn 


Arg Ala 


He Arg Tyr 


Val 


Arg 


Asn 


Pro 


Asp 


Asn 


Leu 


Phe 


Ser 


He 


Thr 








7.10 








745 










750 






Gly 


hys 


Lys 


He Ser 


Tyr Arn 


Asn 


Gly Val 


Ser 


Phe 


Glu Gly Tyr Gly 






755 








760 










765 








He 


Pro 


Sfir 


Leu Leu 


Gly Tyr 


Met 


Ala 


He 


Lys 


Ty£ 


Leu 


Val 


Val 


^sn 




770 








775 










7Q0 










Thr 


Asp 


Ala 


Pro Lys 


He 


Asp 


Met 


Ser 


l.eu 


V^l 


Gin 


Glu 


Pro 


Tyr 


Ala 


705 








7 90 










7?5 










800 


hys 


Gly 


Thr 


Asn Val 
805 


Ala 


Pro 


Phe 


Pro 


Glu 
810 


Leu 


Vai 


Gly 


He 


Tyr 
815 


Val 


Tvr 


Lys 


Asn 


Uiy Thr 
820 


Phe 


He 


Gly 


Thr 

825 


Gin 


Asp 


Pro 


Ser 


Val 
830 


Thr 


Thr 


Tyr 


Ser 


Val 


Ser Aep 


Gly Thr 


Glu 


Ser 


Asp 


Glu 


Tyr 


Glu 


He 


Lys 


Leu 






835 








840 










845 








Val 


Tyr 
850 


Lys 


Gly Ser 


Gly 


He 
05b 


Ser 


Asn 


Gly 


Val 


Alft 
860 


Gin 


He 


Glu 


Asn 


Asn 


Asn 


Ala 


Val Val 


Ala 


Tyr 


Pro 


Ser 


Val 


Val 


Thr 


Aep Arg 


Phe 


Ser 


865 








870 










875 










880 


He 


hys 


Asn 


Ala His 
885 


Net 


Val 


His 


Ala 


Al^ 
890 


Ala 


Leu 


Tyr 


Ser 


Leu 
995 


Asp 


Gly Lys 


Gin 


Val Arg 


Ser 


Trp 


Asn 




Leu 


Arg 


Affn 


Gly 


Val 


Thr 


Phe 








900 


















910 






Ser 


Val 


Gin 
915 


Gly Leu 


Thr 


Ala 


Gly 
920 


Thr 


Tyr 


Met 


Leu 


Val 
925 


Met 


Gin 


Thr 


Ala 


A.?n 
930 


Gly 


Pro Val 


Ser 


Gin 
935 


Lys 


He 


Val 


Lys 


Gin 
9'tO 











(2 J IHtORIlATIOM FOR SEQ ID NO: 4 64 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 amino acld.- 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

tii) IIOLECULE TYPE: protein 

(iiij HYPOTHETICAL: YES 

(vi) OKIGIHAL SOURCE: 

(A) ORGAiasll: Porphyromonas gingival Is 

(ix) FEATURE: 

iA) NAIIE/KEY: misc feature 

(B) LOCATION 1, . .670 



(xl) SEQUENCE DESCRl Pl'IOH : SEQ ID HO: 4 64 



Met 


Lys 


Tyr 


Leu 


He 


Arg 


Lou 


Phe 


Leu 


Ser 


Leu 


Met 


Leu 


Leu 


Ser 


Leu 


1 








5 










10 










15 




Trp Thr Gly 


Cys 


Thr 


His 


Glu 


Glu 


Leu 


Sor 


He 


Cys 


Asp 


Gly Glu 


Asn 








20 










25 










30 






Thr 


Leu 


Val 


Leu 


Arg 


Val 


Glu 


Thr 


Gly 


Lys 


Ala 


Pro 


Asn 


Ala 


Arg Ala 






35 










40 










45 








Thr 


Glu 
50 


Pro 


Gly 


Gin 


Gly 


He 

55 


Tyr 


Asn 


Glu 


Asn 


60 


Val 


Gly 


Ser 


He 


Ser 


Val 


Leu 


Phe 


Tyr 


Leu 


Glu 


Gly 


Gin 


Leu 


Arg 


Trp 


Gin 


Val 


Lys 


Ser 


65 










70 










75 










80 


Thr 


Arp 


Tyr 


Gin 


He 
85 


His 


Glu 


Gly 


Ala 


Tyr 
90 


He 


T1- 


Pro 


Val 


Lys 
95 


Glu 


Gin 


/let 


Arg 


Pro 
100 


Leu 


Phe 


Asn 


Gly 


Asn 
105 


Asn 


/Ksn 


Phe 


Ser 


He 
HO 


Tyr 


Val 


Val 


Ala 


Asn 

lis 


Leu 


Asp 


Phe 


Asn 


Ala 
120 


Pre 


Ala 


Thr 


Glu 


Ala 
125 


Ala 


Leu 


Ser 


Gin 


Phe 


Val 


Val 


Clu 


Lys 


Ser 


He 


Glu 


Val 


Ser 


Ser- 


Thr 


Thr 


Ala 


Pro 




130 








135 










140 










Ala 


Asp 


Phe 


Val 


Met 


Leu 


Ala 


His 


Gly 


Asm 


I-ys 


Gin 


He 


Asn 


net 


Ala 


145 










150 










155 










160 


Thr 


Thr 


Glu 


Gly 


Lys 


Leu 


Leu 


Gly 


Asp 


Tyr 


Lya 


Leu 


Lys Arg Val 


Als 










165 










170 










175 




Ala 


Lys 


He 


Arg 


Met 


He 


Lys 


Pro 


Thr 


He 


Asn 


Val 


Gin 


Gly 


Tyr 


Glu 



SUBSTITUTE SHEET (Riile 26) (RQ/AU) 
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180 










185 










190 








Val 


Gly 


Asn 


He 


Gin 


Ala 


Lys 


Phe 


Arg 


Asn 


Sec 


Val 


Thr 


Lys 


Gly 






1 95 










200 










20S 








Phe 


Leu 


Thr 


Thr 


Glu 


Ala 


Gin 


Glu 


He 


Pro 


Ala 


A l 


Ala 


Ser 


Tyr 


Lys 




210 










215 










220 










Thr 


Sec 


Gl li 


1 J t 


Leu 


Asp 


lie 


Ala 


Glu 


Ser 


Ala 


Pro 


Ala 


Asn 


Ser 


lie 


225 










230 










235 










24 0 




Phe 


1 Vr 


Ser 


T- -r 
L i L 


Tyr 


A^r 




i rp 


Thr 


Leu 


Ser 


Thr 


Pro 


Glu 


Lys 










24 5 










250 










255 




rg 


Pro 




Ph- 


Phe 


He 


Me't 


Val 




Phe 


Lys 




Thr 


Gly Gin 


Pro 








260 










265 
















Asp 


sn 


Thr 


Al a 


Lys 


ro 


ryr 


Tyr 


Tyr 


J. 

Hcg 


Val 


Pro 


Leu 


Glu 


Ser 


Gin 
















260 










285 








Asp 


Asn 


Gl n 


Val 


X*ys 


Ser 


A*»n 


Va 1 


eu 


ryr 




Leu 


Asn 


Val 


Lys 


He 




290 




















300 










Glu 


lie 


Leu 


iixy 


Ser 


Leu. 


Gin 


Glu 


Pro 


Glu 


Ala 


Val 


Ser 


Val 


Asn 


Gly 


30S 










310 










315 










320 


Thr 




la 


He 


Giu 


Glu 


rrp 


He 


Leu 


Hi s 


Gin 


Asp 


Ala 


Phe 


Asn 


Leu 










325 










330 










335 




Pro 


Ala 


Thr 


Ai: II 


Tyr 


Leu 


Tl a 

J .L S 


Val 


Glu 


Gin 


xs 


Glu 


He 


Phe 


Met 


Asn 








340 








345 










350 








Val 


Asn 


Thr 


Tyr 


Ser 


Val 


Lys 


Tyr 


Gin 


Thr 


See 


Gln 


Lys 


Pro 


e 






355 










360 










365 










ILe 


Ser 


He 


C51 j\ 


Ser 


Val 


Thr 


Phe 


Ser 


Tyr 


Val 


Ser 


Sec 


Asp 


Gly 




3'/0 










37 5 










380 










Thr 


Gin 


His 


A*n 


Asp 


Leu 


Val 


Ala 


Ser 


Ser 


Scr 


Asp 


Gin 


Tyr 


Pro 


Inn 


385 










390 










3?5 












lie 


Till' 


5er 


Asp 


Acn 


Thr 




He 


He 




Thr 


Ssr 


Lys 


He 


Pro 


V- 










4 05 










410 










415 




Asn 


Asn 


Va 1 


Pro 


Lys 


Lys 


lie 


Val 


Phe 


Glu 


V tt 


Thr 


Asn 


Gly Val 


Ala 


















425 










430 






— 

Gly 








T c 


Val 


Thr 


Val 


Leu 




Tyr 


Pro 


Ala 


Gin 


Phe 


He 














440 










445 








Val 


Asn 


T r 


eu 






Al a 


er 


Ala 


Trp 


Arg 


Pro Asp Gly 


Ser 


Leu 




450 










455 










4 60 










Al a 


Pro 


Gly 




sn 






Ala 


He 


ryr 


His 


Val 


Val 


Val 


Leu 


Val 


4 65 








470 










475 










4 80 


Pro 


Pro 


Glu 


Asn 




Phe 


Giu 


Asp 






Gin 


Thr 




He 


Gly 


Tyr 




















4 90 










455 




Pco 


Pro 


Thr 


Glu 




I le 


er 




His 


Lys 


Lys 


Glu 


Asn 


Asn 


Thr 


X yr 








500 










505 










510 






Pro 


lie 


Val 


Trp 


Ser 


Asp 


Thr 


Asn 


Thr 


Thr 


Lys 


Gin 


Asp 


Leu 


Glu 


Thr 






515 










520 










525 








Sec 


Arg 


Met 


He 


Ser 


Pro 


Ser 


Phe 


Glu 


Leu 


Ala 


Ser 


Gin 


Leu 


Gly 


Ala 




530 










535 










540 










Thr 


LfttJ 


Pro 


Met 


Pro 


Tyr 


Leu 


Glu 


Tyr 


Trp 


Pro 


Gly Thr 


Scr 


Tyr 


Leu 


545 










550 










555 










560 


Leu 


Asp 


Tyr 


Ser 


Gly 


A?n 


Tyr 


Asn 


Asn 


Lys 


Arg 


Tyr 


Ala 


Leu 


Phe 


Asn 










565 










570 










575 




Cys 


Ala 


Phe 


Tyr 


Trp 


Glu 


Lys 


Aig 


Lys 


Val 


Asn 


Am 


Glu 


Glu 


He 


Lys 








580 










53S 










590 






Phe 


Asp 


Asp 


Trp 


Arg 


Leu 


Pro 


Thr 


Glu 


Ala 


Glu 


He Lys 


Leu 


He 


Asp 






595 










600 










60S 








Lys 


Leu 


Gin 


His 


Asn 


Glu 


Gin 


Ser 


Ala 


Val 


Gin 


Ala 


He 


Met 


Thr 


Gly 




610 










615 










620 










Asn 


Tyr 


Tyc 


Trp 


Asp 


Ser 


Tyr 


Ser 


Ala 


Asn 


Gly Ser 


Tyr 


Lys 


Het 


Gin 


625 










630 










635 










640 


Gly 


Gly 


Gly 


Gly 


Gin 


Gly 


Asn 


Ser 


Ser 


Lys 


Ala 


Tyr 


Val 


Arg 


Cys 


Val 










645 










650 










655 




Arg 




Val 


Lyr 


Lys 


Pro 


He 


Arg 


Aep 


Lys 


Lys 


Ser 


Gly 


Lys 












660 










665 










670 







10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



(2) INFORHATION FOR SEQ ID NO: 4 65 

65 (1) SEQ»JEHCE CHARACTERISTICS: 

lA) LEUGTH: 1282 amino acids 
ID) TYPE: amino acid 
(D) TOPOLOGY: linear 

70 (ii) MOLECULE TrPE; protein 

till I HYPOTHETICAL: YES 

(vij ORIGIHAL SOURCE: 
75 (A) ORGAUISH: Porphyromonas gingivalis 



SXJBSTrrUTE SHEET (Rule 26) (RO/AU) 
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10 
15 
20 
25 
30 
35 

* 40 
45 
50 
55 
60 
65 
70 
75 



{iy.) FEATURE : 

(A) MAI-IE/ KEY: mis cofeature 

(B) LOCATIOIJ 1 . . . 1282 



SEQUEHCfc; DESCRIPriOU: SEQ ID NO: 465 



Met 


Arg 


Lye 


Tic 


Leu 


Ser 


Phe 


Leu 


Met 


Het 


f~ys 


Ser 


Leu 


His 


Leu 












5 










10 










15 




Leu 


Gl n 


Ser 


Gin 


Thr 


Trp 


His 


GLv 
v» J. y 


Asp 


Pro 




Sc r 


Val 


Ala 


Ala 


Leu 








20 










25 










30 






Pro 


Ser 


lie 


Gly 


T.lG 


Gin 


Clu 


Ser 


Ser 


cys 


Thr 




He 


Thr 


Phe 


"lu 






3b 










4 0 










4 5 








Val 


Val 


Phe 


Pro 


Gly 


Fhe 


Tyr 


See 


Val 


Glu 


Lys 


Arg 


Glu 


Gly Asn 






50 










55 




















Val 


Phc 


G]n 


Arg 


lie 


Ser 


Met 


Pro 


Gly Cys 


Gly 




Phe 


Gly Asn 


Leu 


65 










70 










75 










SO 


Giy 


Glu 


Ala 


Glu 


Leu 


Pro 


Val 


Leu 


Lys 


l.ya 


Met 


He 


Ala 


Val 


Pro 


Glu 










95 










90 














The 


Ser 


Thr 


Ala 


Asn 


Val 


Ala 


Val 


Lys 


He 


Lys 


Glu 


Thr 


Glu 


Thr 


Phe 








100 










105 










110 






Aip 


A^^-ii 


Tyr 


Acn 


lie 


Tyr 


Pro 


Asn 


Pro 


Thr 


Tyr 


Val 


Vai 


Glu 


Glu 


Leu 






lib 










120 










125 








Pro 




Gly 


Gly 


Thr 


Tyr 


Leu 


Val 


Glu 


Ala 


Phe 


Alo 


He 


Asn 


Asn 


Asp 




130 










135 










14 0 










Tyr 


Tyr 


Ser 


Gin 


ASI! 


Val 


Ser 


Leu 


Pro 


Ser 


Thr 


His 


T^-r 


Val 


Tyr 


Ser 


14S 










IbO 










1 55 












Gin 


As-p 


Gly 


Tyr 


Phe 


Arg 


Ser 




Arg 


Phe 


He 


Glu 


Val 


Thr 


Leu 


Tyr 










165 










170 










175 




fro 


the 


Arg 


Tyr 


Aen 


Pro 


Val 


Arg 


Gin 


Gl u 


He 


Leu 




Ala 


Lys 


Lys 








100 










185 










190 






lie 


Glu 


Val 


Thr 


lie 


Thr 


Phe 


Asp 


Asn 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 






1$5 










200 


















Asn 


Thr 


Gly He 


Phe Asn Lys 


Val 


Ala 


flftr 


*jsr 


Ala 




He 


Asn 


Tyr 




210 










215 




















Glu 


Ala 


Asp 


Gly 


Lys 


Ser 


Ala 


He 


Glu 


Asn 


Asp 




Val 


Phe 


Ser 


Arg 


225 










230 










23i> 










240 


Gly Thr 


Thr 


Thr 


Tyi 


I1& 


S=r 


Giy 


Asn 


Val 


Ala 


Ser 


Asn 


Leu 


Pro 


Gin 










24b 










250 










255 




Acrv 


Cys 


Ajsp 


Tyr 


Leu 


Val 


He 


Tyr 


Atfp Asp 


Met 


Phe 


Asn 


Val 


Asn 


Gin 








260 










265 










270 






C51n 


Pro 


His 


Asp 


Glu 


He 


Lys 


Arg 


Leu 


Cys 


Glu 


His 




Ala 


Phe Tyc 






275 










280 


















Acn 


Gly 


Phe 


Asp 


Val 


Ala 


Ala 


Val 


Ser 


Ha 


• 

Lys 


Asp 


Val 


Leu 


Asn 


Ser 




290 










295 




















eh- 


Pro 


Ser 


Asn 


Ala 


Thr 


Scr 


1 yr 


He 


Asn 


Glu 


Thr 




Leu 


Lys 


Asn 


305 










310 










315 










320 


Ph- 


lis 


Arq 


Ser 


Val 


Tyr 


Asn 


Gin 


Ser 


Asn 


Ala 


It J £1 


Ar<3 


Thr 


Leu 


Asp 










325 










330 










335 




Giy 


Lys 


Leu 


Gly 


Tyr 


Val 


Leu 


Leu 


He 


Gly 


Lys 


Pro 


Leu 


Sec 


Lys 


Tyc 








a-io 










34 5 










350 






Leu 


Ala 


Asp 


Thr 


Asp 


Asn 


Thr 


Lys 


Val 


Pro 


Thr 


Ser 


Phe 


He 


His 


Asn 






355 










360 










355 








Val 


Ser 


Leu 


lie 


Pro 


S»r 


His 


Pro 


Thr 


Phe 


Gly 


Ser 


H£ 


Cys 


Ala 


Ser 




370 










375 










380 










Asp 


Tyr 


Phe 


Phe 


Ser 


Cys 


Val 


Ser 


Pro 


Lsu 


Asp 


Thr 


Val 


Gly Asp 


Leu 


385 










390 










395 










400 


Phe 


lie 


Gly 


Arg 


The 


Ser 


Val 


Thr 


Asn 


Ala 


His 


Glu 


Leu 


Kis 


Asn 


Lflu 










405 










410 










415 




lie 


Glu 


Lys 


Thr 


He 


Asn 


Lys 


Glu 


rio 


Ser 


Tyr 


Asn 


Pro 


He 


Ala 


His 








420 










425 










430 






Lys 


Asn 


Ilo 


Leu 


Tyr 


Ala 


Glu 


Gly 


Lys 


Gly 


Cys 


Ai?p 


Ala 


Pro 


He 


Leu 






^35 










440 










445 








Arg 


Lsu 


Phe 


Leu 


Lys 


Glu 


He 


Ala 


Ser 


Gly 


Tyr 


Thr 


Val 


Asn 


Ser 


He 




450 










4 55 










460 










Leu 


Lys 


Ser 


Aen 


Gin 


Val 


Ser 


Ala 


He 


Asp 


Ser 


He 


Phfe 


Ab-p 


C-ys 


Leu 


465 










470 










475 






480 


Asn 


Ajn 


Gly 


Ser 


His 


Hi£ 


Phe 


Tyr 


Phe 


Asn 


Thr 


His 


Gly 


Met 


Pro 


Thr 










485 










490 










495 




Val 


Trp 


Gly 


lie 


Gly 


Gin 


Gly 


Leu 


Asp 


Val 


Asn 


T)ir 


Leu 


Thr 


Ala 


Arg 








500 










505 










510 






Leu 


Asn 


A^n 


Thr 


Ser 


Ser 


Gin 


Gly 


Leu 


cys 


Thr 


Ser 


Leu 


Ser 


Cys 


Ser 






515 










520 










525 






5er 


Ala 


Val 


Ala 


Asp 


Ser 


Thr 


rie 


Arg 


Ser 


Lsu 


Gly 


Glu 


Val 


Leu 


Thr 




530 










535 










540 
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Thr Tyr Ala Pro Asn Lys Gly Phe Ser Ala Phe Lsu Gly Gly Ser Arg 
545 550 555 560 

Ala Thr Gin Tyr Ala Val Tyr Leu GXu Gly Pro Cys Pro Pro Ser Glu 

565 570 575 

Fhe Tyr Gin Tyr Leu Pro Tyr Ser Leo Tyr His Asn Leu Ser Thr Val 

580 585 590 

Val Gly Glu !Wt Leu Leu Ser Ser lie lie Acn Thr Ann Ser Val Asp 

595 600 605 

Thr Tyr Sec Lys Pho Asn Phe Asn Leu Leu Gly Asp Pro Ala Leu Asn 

610 615 620 

lie Met Ala His Gly Met Glu Val Ser Asn Cys lie Thr Leu Pro Asn 
625 630 635 640 

Pisn Thr 11p He Ser Ser Pro lie Thr lie Lys Asn Gly Gly Cys Leu 

6riS 650 655 

Lys He Pro Glu Lys Gly Val Leu His Phe Thr Asn Asn Gly Ser He 

660 665 670 

Gin V^l Met Ser Gly Gly Thr Leu Glu He Gly Asn Gin Ala Lys He 

575 580 685 

Ser Gly Glu TVtr Gly Ala Asn Pro Thr Phe He Thr Val Tyr Gly Asp 

690 6?5 7 00 

Gly Leu Ala' He Asn Lys Gin Val Glu He Asp Asn Ho Asp Arg I.e>i 
705 710 /lb 720 

Asn Leu Phe Set Thr Hie Ser Val Met Pro Lys Phe His Phe Asp Ser 

7:^5 730 735 

Val I.yr^ Phn A.7n Ser Ala Pro Leu Tyr Thr Thr Asn Cys He Val Glu 

7 40 7 4 5 750 

He Ser Asn Cys Glu Phe Thr Asn Arg Ser Asp He He Ser Lys Asn 

755 760 765 

Cys Asp I.eti Ser Val Glu Asn Ser Met Phe Ser Ser Ser Gly He Thr 

770 775 780 

Vai Phe Lys Pro Met Ala Thr Ser Ser He Thr Gly Leu Ser Thr Lys 
785 790 795 BOO 

Ma Lyr He Thr Asp Asn Thr Phe Phe Ala Thr Gly Asn Phs Ala Tyr 

e05 010 815 

ilia He 'Ihr Asn Thr Pro Gly Leu Thr Ala Thr Ser Asn Ala Ala He 

82C 825 830 

Lys Leu Asp Asn He Pro Glu Tyr Tyr He Ser Gly Asn Lys He Val 

835 840 845 

Asn Cys Asp Glu Ala Leu Val Leu Asn Asn Ser Gly Asn Arg Thr Asn 

850 8S5 860 

Arg Leu IJls Asn He Thr Arg Asn Val He Lys Asn Cys Arg He Gly 
865 970 875 880 

Ser Thr Leu Tyr Asn Scr Tyr Gly He Tyr Asn Arg Asn Lys He Ser 

885 890 895 

Asn Asn His lie Gly Val Arg Leu Lsu Asn Asn Ser Cys Phe Tyr Phe 

90n 905 910 

Asp A£i> Ala Pro Val Ha Asn Glu Glu Asp Lys Gin Thr Phe He S^^r 

915 970 925 

Asn Arq Thr Trp Gin Leu Tyr Ser Ser Asn Gly Thr Phe Pre Leu Asn 

930 935 940 

Phe His Tyr Asn Scr Leu Gin Gly Gly Asp Thr Asp Thr Trp Hh Tyr 
945 950 955 960 

Asn Asp Thr Tyf Thr Asn Arg Tyr Ho Asp Val Ser Asn Asn Jlis Trp 

965 970 975 

Gly Asn Asn Asp Leu Phe Asp Pro fKsn Gin Val Phe Asn The Pro Asp 

980 " 985 990 

Leu Ph» He Trp He Pro Phe Trp Asp Gly Leu Pro Asn Gly Arg Ser 

995 1000 1005 

Gly Ain Ser Ser Al^ Glu Ala Val Glu Phe Gin Thr Ala Leu Asp Cys 

1010 1015 1020 

He Gly Asn Set Asp Tyr Leu S=r Ala Lys Val Ala Leu Lys Met Het 
1025 1030 1035 ] 

Val Glu Thr Tyc Pro Glu Ser Asp Phe Ala He Ala Ala Leu Lys Glu 

1045 1050 1055 

Leu Phe Arg He Giu Lye Het Soc Gly Asn Asp Tyr Glu Gly Leu Lys 

1060 1065 1070 

Asp Tyr Phe Arg Ser Asn Pro Thr He He Ser Ser Gin Asn Leu Phe 

1075 1080 1085 

Pro Thr Ala Asp Phe Leu Ser Ala Arg Cys Asp He.. Val Cys Glu Asn 

1090 1095 noo 

Tyr Gin Ser Alo He Asp Trp Tyr Glu Asn Arg Lsu Asn Ser Glu Tie 
lies 1110 1115 1 

Ser Tyr Gin Asp Ser Val Phe Ala Val He Asp Leu Gly Asp He Tyr 

1125 1130 1135 

Trp Asn Met Gin Leu Asp Ser Lc-u Arg Gly Thr Gly He Asp Leu Asn 



SUBSTITUTE SHEET (Rule 26) (RQ/AU) 
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1140 1145 IISO 

Tie Leii Ser Cys Glu Gin Arg Lys Ser Leu GLu Ssr Hi.-? Gin Asn Vai 

1155 1160 1165 

Lys Asn Tyr Leu Leu Ser The Leu Pco Glu Ser Thr Gly Thr Leu Leu 
5 1170 11-75 1180 

Pro Pro Leu Glu Cys Asn Lys Sec Ser Leu Asp hys Ser Lys lie lie 
1185 1190 1195 1200 

Ser lis Sec Pro Asn Pro Ala Lys Ala Val Val Thr lie lie Tyr Tyr 
1205 1210 1215 

10 Thr Asp Asn Pro Sec Cys Ser Val lie Lys lie Tyr Gly lie Afn Gly 
1220 1225 1230 

Ala Ser Ala Asp lie Thr Gly Leu Pro Lyt- His Leu Ser Glu Gly Tyr 

1235 1240 1245 

Tyr Ser lie Gin Phe Asa Thr Scr Asn Phe A*p Pro Gly Phe Tyr Leu 
15 1250 1255 1260 

Vai Thr Leu Asn Val Asp Gin Lys lie lie Aep Thr Glu Lys Leu Arg 
1265 1270 1275 1280 

lie Ly5 



20 



40 



45 



50 



55 



60 



65 



70 



75 



(2) IIIFORilATIOU rOR SZQ ID 110:466 



(1) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 127^ amino acids 
25 (6) TYPE; amino acid 

(D) TOPOLOGY: linear 

(il) MOLECULE TYPE: protein 

30 (iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISli: Porphyr onion as glngivalis 

35 (ix) FEATURE: 

(A) NAtlE/KEY: misc feature 

(B) LOCATTOM 1 . . .1274 



(xi) SEQCJEHCE DESCRIPTIOM: SEQ ID 110:466 



Met 


Met 




Ser 


Leu 


His 


Leu 


Gly 


Leu 


Gin 


Ser 


Gin 


Thr 


Trp 


His 


Gly 


1 








5 










10 










15 




Arp 


Pro 


Asp 


Ser 


Val 


Ala 


Ala 


Leu 


Pro 


Ser 


He 


Gly 


He 


Gin 


Glu 


Ser 






20 










25 










30 






Ser 


eye 


Thr 


Arg 


lie 


Thr 


Phe 


Glu 


Vai 


Val 


Pile 


Pro 


Gly 


Phe Tyr 


Ser 






35 










40 










45 








Val 


Glu 


hys 


Arg 


Glu 


Gly 


Asn 


Gin 


Val 


Phe 


Gin 


Arq 


He 


Ser 


Het 


Pro 




50 










55 










60 










Gly 


Cys 


Gly 


Ser 


Phe 


Gly 


Asn 


Leu 


Gly 


Giu 


Ala 


Glu 


Leu 


Pro 


Vra 


Leu 


65 










70 








75 










eo 


Lys 


Lys 


Met 


lie 


Ala 


Val 


Pro 


Glu 


Phe 


Ser 


Thr 


Ala 


Asn 


Val 


Ala 


Val 










85 










50 










95 




Lys 


lie 


Lys 


Glu 


Thr 


Glu 


Thr 


Phe 


Asp 


Asn 


T-/r 


Asn 


He 


Tyr 


Pro 


Asn 








100 










105 










110 






Pro 


Thr 


Tyr 


Val 


Vai 


Glu 


Glu 


Leu 


Pro 


Glu 


GLy 


Gly 


Thr 


Tyr 


Leu 


Val 






115 










120 










125 








Glu 


Ala 


Phe 


Ala 


He 


Asn 


Asn 


Asp 


Tyr 


Tyr 


Ser 


Gin 


Asn 


Val 


Ser 


Leu 




130 










135 










110 










Pro 


Ser 


Thr 


His 


Tyc 


Val 


Tyr 


Ser 


Gin 


Asp 


Gly 


Tyr 


Phe 


Arg 


Ser 


Gin 


145 










150 








1£5 










160 


Ary 


Phe 


tie 


Glu 


Val 


Thr 


Leu 


Tyr 


Pro 


Phe 


Arg 


Tyr 


Asn 


Pro 


Val 


Arg 










165 










170 










175 




Gin 


Glu 


lie 


Leu 


Phe 


Ala 


Lys 


Lys 


He 


Glu 


Val 


Thr 


He 


Thr 


Phe 


Asp 








180 










185 










190 






A^n 


Pro 


Gin 


Pro 


Pro 


Leu 


Gin 


Lys 


Asn 


Thr 


Gly 


He 


Phe 


Asn 


Lys 


Val 






195 










200 










205 








Ala 


Ser 


Ser 


Ala 


Phe 


He 


Am 


Tyr 


Glu 


Ala 


Asp 


Gly 


Lys 


Ser 


Ala 


He 




220 










215 










220 










Glu 


Aen 


fKsp 


Met 


Val 


Phe 


Ser 


Arg 


Gly 


Thr 


Thr 


Thr, 


Tyr 


He 


Ser 


Gly 


225 










230 










235 










240 


Arn 


Val 


Ala 


Ser 


Asn 


Lgii 


Pro 


Gin 


Asn 


C^-s 


Arp 


Tyr 


Leu 


Val 


He 


Tyr 










245 










250 










255 




Asp Asp 


Met 


Phe 


Asn 


Val 


Asn 


Gin 


Gin 


Pro 


Hi^ 


Asp 


Glu 


He 


Lys 


Arg 








260 










265 










270 






Leu 


cys 


Glu 


His 


Arg 


Ala 


Phe 


Tyr 


Asn 


Gly 


I'he 


Asp 


Val 


Ala 


Ala 


Vai 
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275 280 



Ser 


He 


Lys 


Asp 


Val 


Leu 


Asn 


Ser 


Phe 


Pro 


Ser 


Asn 




290 










295 










300 


He 


Asn 


Glu 


Thr 


Lys 


Leu 


Lys 


Asn 


Phe 


He 


Arg 


Ser 


305 










310 










315 




Ser 


Asn 


Ala 


Lys 


Arg 


Thr 


Leu 


Asp 


Gly 


Lys 


Leu 


Gly 










325 










330 






He 


Gly 


Lys 


Pro 


Leu 


Ser 


Lys 


Tyr 


Leu 


Ala 


Asp 


Thr 








340 










345 








VhX 


Pro 


Thr 


S^;r 


Phe 


He 


His 


Asn 


Val 


Ser 


Leu 


He 






35b 










360 










Thr 


Phe 


Gly 


Ser 


He 


C^'s 


Ala 


Ser 


Asp 


Tyr 


Phe 


Phe 




370 










375 










380 


Pro 


Leu 


Asp 


Thr 


Val 


Gly 


Affp 


Leu 


Phe 


He 


Gly 


Arg 


365 










390 










3?55 






Ala 


His 


Glu 


Leu 


His 


Asn 


Leu 


He 


Glu 


Lys 


Thr 










405 










410 






He 


Ser 


Tyc 


Asn 


Pro 


He 


Ala 


His 


Lys 


Asn 


lis 


Leu 






420 










425 








Lyn 


Gly 


Cys 


Asp 


Ala 


Pro 


He 


Leu 


Arg 


Leu 


Phe 


Leu 




435 










440 










Ser 


Gly 


Tyr 


The 


Val 


Asn 


Ser 


Ilfi 


Leu 


Lys 


Ser 


Asn 




450 










455 










460 


Tie 


Asp 


Ser 


11** 


Phe 


Asp 


Cys 


Leu 


Asn 


Asn 


Gly 


Ser 


•165 










470 










475 




Phe 


A;? 11 


Thr 


His 


Gly 


Met 


Pre 


Thr 


Val 


Trp 


Gly 


He 










485 










4 90 






Asp 


Val 


Aj?n 


Thr 


Leu 


Thr 


Ala 


Arg 


Leu 


Asn 


Asn 


Thr 






500 










505 








Leu 


C>'s- 


Thr 


Ser 


Leu 


Ser 


Cyn 


Ser 


Ser 


Ala 


Val 


Ala 




515 










520 










Arg 


Ser 


Leu 


Gly 


Glu 


Vai 


Leu 


Thr 


Thr 


Tyr 


Ala 


Pro 


530 










535 










540 


Ser 


Ala 


Phe 


Leu 


Gly 


Gly 


Ser 


Arq 


Ala 


Thr 


Gin 


Tyr 


545 










550 










555 




Glu 


Gly 


Pro 


Cys 


Pro 


Pro 


Ser 


Glu 


Phe 


Tyr 


Glu 


Tyr 










565 










57 0 






Ley 


Tyr 


His 


Asn 


Leu 


Ser 


Thr 


Vai 


Val 


Gly 


Glu 


Ket 








580 










585 








He 


He 


Asn 


Thr 


Asn 


Ser 


Val 


Asp 


Thr 


Tyr 


Ser 


Lys 






595 










600 










Leu 


Leu 


Gly 


Asp 


Pro 


Ala 


Leu 


Asn 


He 


Met 


Ala 


His 




610 










615 










620 


Ser 


Asn 


Cys 


He 


Thr 


Leu 


Pro 


Asn 


Asn 


Thr 


He 


He 


625 










630 










635 




Thr 


He 


Ly£ 


Acn 


Gly 


Gly 


Cys 


Leu 


Lys 


He 


Pro 


Glu 










645 










650 






Ills 


Phe 


Thr 


Asn 


Asn 


Gly 


Ser 


He 


Gin 


Val 




Ser 








660 










665 








Glu 


He 


Gly 


Asn 


Gin 


Ala 


Lys 


He 


Ser 


Gly 


Glu 


Thr 






675 










680 










Thr 


Phe 


He 


Thr 


Val 


Tyr 


Gly 


Asp 


Gly 


Leu 


Ala 


He 




690 










695 










700 


Glu 


He 


Asp 


Asn 


He 


Asp 


Arq 


Leu 


Asn 


Leu 


Phe 


Ser 


705 










710 










715 




Het 


Pro 


Lys 


Phe 


His 


Phe 


Asp 


Scr 


Val 


Lys 


Phe 


Asn 










725 










730 






Tyr 


Thr 


Thr 




Cys 


He 


Val 


Glu 


He 


Ser 


Asn 


Cys 
















745 








Arg 


Ser 


Asp 


He 


He 


Ser 


Lys 


Asn 


Cys 


Asp 


Leu 


Ser 






7S5 










760 










Met 


Phe 


Ser 


Ser 


Ser 


Gly 


He 


Thr 


Val 


Phe 


Lys 


Pro 




770 










775 








780 


Ser 


He 


Thr 


Gly 


Leu 


Ser 


Thr 


Lys 


Ala 


Lys 


He 


Thr 


785 










790 










795 




Phe 


Ala 


Thr 


Gly 


Asn 


Phe 


Ala 


Tyr 


His 


He 


Thr 


Asn 










805 










810 






Thr 


Ala 


The 


Ser 


Asn 


Ala 


Ala 


He 


Lys 


Leu 


Asp 


Asn 








82C 










825 








Tyr 


He 


Ser 


Gly 


Asn 


Lys 


He 


Val 


Asn 


Cys 


Asp 


Glu 






835 










840 










Asn 


Asn 


Ser 


Gly 


Asn 


Arg 


Thr 


Asn 


Arg 


Leu 


His 


A^n 




850 










855 










860 


Vai 


He 


Lys 


Am 


Cys 


Arg 


He 


Gly 


Ser 


Thr 


Lsu 


Tyr 


06-5 










870 










875 
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490 



235 








Ala 


Thr 


Ser 


Tyr 


Vai 


Tyr 


Asn 


Gin 








320 


Tyr 


Val 


Leu 


Leu 






335 




Asp 


Asn 


Thr 


Lys 




350 






Pro 


Ser 


His 


Pro 


365 








Ser 


Cys 


Val 


Ser 


Phe 


Ser 


Val 


Thr 








400 


Ho 


Asn 


Lys 


Glu 






115 




Tyr 


Ala 


Glu 


Gly 




430 






Lys 


Glu 


He 


Ala 


445 








Gin 


Vai 


Ser 


Ala 


His 


His 


Phe 


Tyr 








'180 


Gly 


Gin 


Gly 


Leu 






495 




Ser 


Scr 


Gin 


Gly 




510 






Asp 


Ser 


Thr 


He 


525 








Asn 


Lys 


Gly 


Phe 


Ala 


Val 


Tyr 


Leu 








560 


Leu 


Pro 


Tyr 


Ser 






57 5 




Leu 


Leu 


Ser 


Ser 




590 






Phe 


Asn 


Phe 


Asn 


605 








Gly 


Met 


Glu 


Val 


Ser 


Ser 


Pro 


lie 








540 


Lys 


Gly 


Val 


Leu 






655 




Gly 


Gly 


Thr 


Leu 




670 






Gly 


Ala 


Asn 


Pro 


685 








Asn 


Lys 


Gin 


Val 


Thr 


HU 


Ser 


Val 








720 


Ser 


Ala 


Pro 


Leu 






735 




Glu 


Phe 


Thr 


Asn 




7 50 






Val 


Glu 


Asn 


Ser 


765 








Het 


Ala 


Thr 


Scr 


Asp 


Asn 


Thr 


Phe 








800 


Thr 


Pro 


Gly 


Leu 






83 5 




He 


Pro 


Glu 


Tyr 




830 






Ala 


Leu 


Val 


Leu 


845 








Tie 


Thr 


Arg 


Asn 


Asn 


Scr 


Tyr 


Gly 
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n* Tyr Arn Arg Asn Lys llv Sec Aen Asn Hie lie Gly Val Arg Leu 

aaS 890 895 

L^::u Ai>n Asn Ser Cys Phe Tyr Ph* Asp Asn Ala t*ro Val lie Asn Glu 

900 905 910 

Glu Asp Lys Gin The Phe lie S«r Asn Arg Thr Trp Gin Leu Tyr S«r 

?15 920 925 

Ser Asn Gly Thr Phe Pro Leu Asn Ph« His Tyr A^n Ser Leu Gin Gly 

930 935 940 

Gly Asp Thr Pisp Thr Trp lie Tyr Asn Asp Thr Tyr Thr Asn Arg Tyr 
94S 950 955 960 

Tie Asp VsL Ser Asn Asn. His Tcp Gly Asn Asn Asp Leu Phe Asp Pro 

965 970 975 

Asn Gin Val Pht Asn Thr Pro Asp L*ii Phe lie Trp He Pro Phe Trp 

900 985 990 

Asp Gly Leu Pro Asn Gly Arg Sec Gly Asn Ser Ser Ala Glu Ala Val 

996 1000 1005 

Glu fhe Gin Thr Ala Leu Arp Cys lis Gly Asn Ser Asp Tyr Lsu Ser 

1010 1015 1020 

Ala Lys Val Ala Leu Lys Het Met Val Glu Thr Tyr Pro Glu Ser Asp 
1025 1030 1035 1040 

Phe Ala lie Ala Ala Leu Lys Glu Leu Phe Arg Ho Glu Lys Met. Ser 

1045 IC&O 1055 

Gly A£n Asp Tyr Glu Gly J.eu Lys Asp Tyr Phe Arg Ser Asn Pro Thr 

1060 1065 1070 

lie He Ser Ser Gin Asn Leu Phe Pro Thr Alu Asp Phe Lsu Ssr Ala 

1075 lOBO 1085 

Arg Cys Asp He Val Cys Glu Asn Tyr Gin Ser Ala He Asp Trp Tyr 

1090 1095 1100 

Glu Asn Arg Leu Asn Ser Glu He Ser Tyr Gin Afp Ser Val Phe Ala 
1105 1110 1115 1120 

Val He Asp Leu Gly Asp He Tyr Trp Asn Met Gin Leu Asp Ser Leu 

1125 1130 1135 

Arg Gly Thr Gly He Asp Leu Asn He Leu Ser Cys Glu Gin Arg Lys 

1140 1145 1150 

Scr Leu Glu Ser His Gin Asn Val Lys A£n Tyr hitxi Leu Ser Thr Leu 

1155 1160 1165 

Pro Glu Sec Thr Gly Thr Leu TjGU Pro Pro Leu Glu Cys Asn Lys Ser 

1170 1175 1180 

Ser Leu Asp Lys Ser Lys He He Ser He Ser Pro Asn Pro Ala Lys 
1185 1190 1195 1200 

Ala Val Val Thr He Ho Tyr Tyr Thr Asp Asn Pro Ser Cys Ser Val 

1205 1210 1215 

He Lys He Tyr Gly He Asn Gly Ala Ser Ala Asp He Thr Gly Leu 

12:^0 1225 1230 

Pro Lys His Leu Ser Glu Gly Tyr Tyr Ser He Gin Phe Asn Thr Ser 

1235 1240 1245 

Asn Phs Asp Pro Gly Phe Tyr Leu Val The Ltu Asn Val Asp Gin Lys 

1250 1255 1260 

He He Asp Thr Glu Lys Leu Arg He Lys 
1265 1270 

12) IHFORJlATIOri FOR SEQ ID NO: 467 

[i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 925 amino acids 

(B) TYPF.: amino acid 
<D) TOPOLOGY: linear 

(H) irOLECULt TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi| ORIGINAL SOURCE: 

(A) ORGAIIISM: Porphyzomonas gingivalis 

(ix) rEATURE: 

tAJ NAJIE/KEY: misc feature 
tB) LOCATIOM 1. . -925 

(xl) SEQUEKCE DESCRlPTIOtl: SEQ ID NO: 4 67 

Met Ala He Met Met Lys Ser He Val Phe Arg Ala Phe Leu Thr He 

15 10 15 

Leu Leu Ser Trp Ala Ala He Thr Asn Pro Thr Ala Gin Glu He Snr 
20 25 30 
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40 



45 



50 
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60 



65 



70 



75 
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Gly 


Met 


Asn 


Ala 


Scr 


Cys 


Leu 


Ala Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


n e 




35 










40 








■15 








Leu 


Tyc 


Glu 


Ser 


Phe 


Glu 


A^n 


Gly Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Glu 




50 










55 








60 










lie 


Asp 


Ala 


Asp Ala 


Asp Gly 


Ala Thr 


Trp Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


65 










70 








75 










80 


Phe 


Ser 


Val 


Pro 


Tyr 


Gly His 


Asn Gly 


Leu 


cys 


Thr 


Tyr 


Ser 


His 


He 










85 








90 










95 




Arg 


Ser 


Gly 


He 


Ser 


Thr 


Ala 


Gly Asn 


Tyr 


Leu 


Tie 


Thr 


Pro 


Asn 


He 






100 








105 










110 






Glu 


Gly 


Ala 


Lyf 


Arg 


Val 


Lys 


Tyr Trp 


Val 


cys 


Asn 


Gin 


Tyr 


Ser 


Thi 




115 










120 








125 








Asn 


Pro 


'jIu 


His 


Tyr 


Ala 


Val 


Met Vdl 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 














135 








I'lO 










Glu 


Asp 




Val 


Leu 


Leu 


Phe Asp Asp 


S-r 


H- 


Thr 


Gly 


Lys 


Pro 


Thr 


145 










150 








155 










160 


Pro 


Leu 


Val 


Trp 


Arg 


Arg 


Arg 


He Val 


Asp 


Lou 


Pro 


Glu 


Gly 


Thr 


Lys' 










165 








170 










175 




Tyr 


He 


Ala 


Trp 


Arg 


His 


Tyr 


Lys Val 


Thr 


A^p 


Ser 


His 


Thr 


Glu 


Phe 








180 








185 










190 






Leu 


Ly.-? 


Leu 


Asp 


Asp 


Val 


Thr 


Val Tyr 


Arg 


Ser 


He 


Glu 


Gly Pro 


Glu 






195 










200 








205 








Pro 


Ala 


Thr 


Afp 


Phe 


Thr 


Val 


He Asn 


He 


Gly 


Gin 


Asn 


Val 


Gly Arg 




210 










215 


















L-u 


Thr 


Trp 


Asn 


Tyr 


Vro Glu Affp Tyr 


Glr. 


Pro 


Glu 


Gly 


Lys 


Gly 


Asn 


225 










230 








235 










240 


Glu 


Glu 


Leu 


Gin 


Leu 


Ser 


Gly 


Tyr Asn 


He 


Tyr 


Ala 


Asn 


Gly Thr 


Leu 










245 








250 










255 




Leu 


Ala 


Gin 


He 


Lys 


Asp 


Val 


Ser He 


Leu 


Glu 


Tyr 


Val 


A:?p 


Ser 


Thr 








260 








265 










270 






Tyr 


Sar 


Leu 


Arg Asp Asn 


Pro 


Leu Gin 


Val 


Glu 


Tyr 


Cys 


Val 


Tlir 


Ala 






275 










280 








285 








Val 


Tyr 


Asp 


Glu 


Ser 


He 


Glu 


Ser- Ser 


Thr 


Val 


Cys 


Gly 


Thr 


Leu 


His 




290 










295 








300 










Tj-r 


Ala 


Thr 


Arp 


Ala 


He 


Leu 


Tyr Glu 


Asn 


Phe 


Glu 


Asn 


Gly 


Pro 


Vdl 


305 










310 








315 










320 


Pro 


Asn 


Gly 


Trp 


Leu 


Val 


He Asp Ala 


Asp 


Gly 


Asp 


Gly 


Fhe 


Ser 


Trp 










325 








330 










335 




Gly 


His 


Tyr 


Leu 


Asn 


Ala 


Tyr 


Asp Ala 


Phe 


Pro 


Gly 


His 


Asn 


Gly Gly 






340 








345 










350 






His 


Cys 


Ser 


Leu 


Ser 


Ala 


Ser Tyr Val 


Pro 


Gly 


He 


Gly 


Pro 


Val 


Thr 






355 










360 








365 








Pro 


Asp 


Asn 


Tyr 


Leu 


He 


Thr 


Pro Lys 


Val 


Glu 


Gly 


Ala 


Lys 


Acg 


Val 




37 0 










375 








380 










Lys 


Tyr 


Trp 


Val 


Ser 


Thr 


Gin 


Asp Ala 


Asn 


Trp 


Ala 


Ala 


Glu 


His 


Tyr 


385 










390 








395 










400 


Ala 


Val 


net 


Ala 


yer 


Thr 


Thr 


Gly Thr 


Ala 


Val 


Gly 


Asp 


Phe 


Val 


He 










4 05 








410 








415 




Lau 


Fhe 


Glu 


Glu 


Thr 


Met 


Thr 


Alo Ly.e 


Pro 


Thr 


Gly 


Ala 


Trp 


Tyr 


Glu 








420 








425 










430 






Arg 


Tiir 


He 


Asn 


Leu 


Pro 


Glu 


Gly Thr 


Lys 


Tyr 


He 


Ala 


Trp Arg 


Hl£: 






435 










440 








445 








Tyr 


Asn 


Cys 


Thr 


Asp 


He 


Tyr 


Phe Leu 


Lys 


Leu 


Asp 


Asp 


He 


Thr 


Val 




450 










455 








460 










Ph« 


Gly 


TKr 


Pre 


Ala 


Ser 


Glu 


Pro Glu 


Pro 


Val 


The 


Asp 


Phe 


Val 


Val 


4 65 










470 








475 










480 


Sec 


Leu 


11^ 


Glu 


Asn 


Asn 


Lys 


Gly Arg 


Leu 


Lys 


Trp 


Asn 


Tyr 


Pro 


Asn 










485 








490 










4 95 




Gly 


Tyr 


Gli: 


Pro 


Asp 


Lys 


Thr 


Asp Asp 


Lys 


Asp 


Pro 


Leu 


Gin 


T.-u 


Ala 








500 








505 










510 






Gly 


Tyr 


Afn 


He 


Tyr 


Ala 


Asn 


Gly Ser 


Leu 


Leu 


Val 


His 


He 


Gin 


Asp 






515 










520 








525 








Pro 


Thr 


Val 


Leu 


Glu 


Tyr 


He 


Asp Glu 


Thr 


Tyr 


Ser 


Ser 


Arg 


Asp 


Asp 




530 










535 








540 










Glti 


Val 


Glu 


Val 


Glu 


Tyr 


cys 


Val Thr 


Ala 


Val 


Tyr 


Asn 


Asp 


Asn 


Ho 


545 










550 








555 










560 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Asp 


Lys Leu 


Tie 


Tyr 


Asp 


Ser 


Gin 


Ser 


Asp 










565 








57 0 










57 5 




lie 


He 


Leu 


Tyr 


Glu 


Gly 


Phe 


Glu Ala 


Gly 


Ser 


He 


Pro 


Glu Gly Trp 








580 








S85 










590 






Leu 


Leu 


He 


Asp Ala Asp Gly 


Asp Asn 


Val 




Trp 


Asp 


Tyr 


Tyr 


Pro 






595 










600 








605 








Trp 


Thr 


Met 


Tyr 


Gly 


His 


Asp 


Ser Glu 


Lys 


Cys 


He 


Ala 


Ser 


Pro 


Ser 




610 










615 








620 










Tyr 


Leu 


Pro 


Met 


He 


Gly 


Val 


Leu Thr 


Pro 


Asp 


Asn 


Tyr 


Leu 


Val 


Thr 
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625 










630 










635 








64 0 


Pro 


Arg 


Leu 


Glu 


Gly Ala 


Lys 


Leu 


Val 


Lys 


Tyr 


Trp 


Val 


Ser Ala 


Gin 


















650 








655 




Asp 


Ala 


Val 


Tyr 


Ser 


Ala 


Glu 


His 


Tyr 


Ala 


Val 


HeL 


Val 


Ser Thr 


Thr 






660 










665 










670 




Gly 


Thr 


Ala 


Val 


Glu Afp 


Phe 


Val 


Leu 


Leu 


Phe 


Glu 


Glu 


Thr Met 


Thr 




675 










680 










685 






Ala 


Lys 


Ala 


Asn 


Gly Ala 


Trp 


Tyr 


Glu 


Arg 


Thr 


He 


Thr 


Leu Pro 


Al a 




690 










695 










700 








Gly 


Thr 


Lys 


Tyr 


He 


Ala 


Trp 


Arg 


His 


Tyr 


Arp 


Cys 


Thr 


Asp Met 


Phe 


7 0S 










710 










715 








720 


Phe 


Leu 


Leu 


Leu 


Asp 


Asp 


He 


Thr 


Val 


Ti'r 


Arg 


Ser 


Thr 


Glu Thr 


Val 










725 










73G 








735 




Pro 


Glu 


Pro 


Val 


Thr Asp 


Phe 


Val 


Val 


Ser 


Leu 


He 


Glu 


Asn Asn 


Lys 








740 










745 










750 




Gly 


Arg 


Leu 


Lys 


Trp Asn 


Tyr 


Pro 


Asn 


Gly 


Tyr 


Glu 


Pro 


Asp Lys 


Thr 






755 










760 










765 






Asp 


Asp 


Lys 


Lys 


Pro 


Leu 


Gin 


Leu 


Thr 


Gl y 


Tyr 


A^sn 


He 


Tyr Ala 


Asn 


770 










77b 










780 








Glv Ser 


Leu 


Leu 


Val 


His 


He 


Gin 


A5p 


Pro 


Thr 


Val 


Leu 


Glu Tyr 


He 


765 










790 










795 








800 


Asp 


Glu 


•rhr 


Tyr 


Ser 


Ser 


Ary 


Asp 


Gly 


01 n 


V?%1 


Glu 


Het 


Glu Tyr 


Cys 








805 










910 








615 




Val 


Thr 


Ala 


Val 


Tyr 


Asn 


Asp 


Asn 


He 


Glu 


Sec 


Gin 


Ser 


Val Cys 


Asp 








82C 










825 










830 




hys 


Leu 


A^n 


Tyr 


Thr 


He 


Thr 


Ser 


Leu 


A5P 


Am 


He 


Gin 


Ser Asp 


Thr 




835 










8*10 










845 






Ser 


Leu 


Ly^ 


He 


Tyr 


Pro 


Pisn 


Pro 


Ala 


Ser 


Tyr 


Val 


Val 


Arg He 


Glu 




850 








855 










860 








Gly 


Leu 


Ser 


Arg 


Ser 


Lys 


Ser 


Thr 


lie 


Glu 


Leu 


Tyr Asn 


Ala Leu 


Gly 


665 








870 










875 








980 


He 


cys 


He 


Leu 


Arg 


Glu 


Glu 


Thr 


His 


Sec 


Glu 


Lys 


Thr 


Glu He 


Asp 








885 










890 








995 




Val 


Ser 


Arg 


Leu Asri Asp 


Gly 


Val 


Tyr 


Leu 


He 


Lys 


Val 


Val Gly 


Gly 








9 GO 










905 










910 




Am 


hys 


Thr 


Thr 


Thr 


Glu 


Lys 


Val 


Glu 


He 


Lys 


Arg 


Pro 








915 










920 










925 







(2) INFORI'lATIOM FOR SEQ ID NO: 4 68 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEriGTH: 922 amino acids 
(B> TYPE: acaino acid 
\0) TOPOLOGY: linear 

(iil IIOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A| ORGANISM: Porphyromonas gingivalis 



(ix) FEATURE: 

{A) NAUE/KEY: misc feature 
IB) LOCATIOU 1. . .922 
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SEQUEHtr. DESCRIPTION: SEQ ID 110:468 










Met 


Met 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


He 


Leu 


Leu 


Ser 


1 








5 








10 










15 




Trp Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly Met 


Asn 








20 










25 










30 






Ala 


Ser 


Cys 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu 






35 










40 










45 








Ser 


Phe 


Glu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Trp 


Leu 


Giu 


He 


Asp 


Ala 




50 










55 










60 










Asp 


Ala 


Asp 


Gly 


Ala 


Thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly Ser 


Phe 


Ser 


Val 


65 








70 










75 










80 


Pro 


Tyr 


Gly 


His 


Asn 


Gly 


Leu 


eye 


Thr 


Tyr 


Ser 


Jlis 


He 


Arg 


Ser 


'31 y 










85 










90 










95 




He 


Ser 


Thr 


Ala 


Gly 


Asn 


Tyr 


Leu 


lift 


Thr 


Pro 


Asn 


He 


Glu 


Gly 


Ala 








100 








105 










110 






Lys 


Arg 


Val 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu 






115 










120 










125 








His 


Tyr 


Ala 


Val 


Met 


Val 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu 


Asp 


Phe 
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10 



15 



20 



25 



30 



35 



40 



45^ 



50 



55 



60 



65 



70 



75 



Val Leu L.U Fhe Asp A.p S^r He Thr Gly L,^. p^o The Fro Leu Val 

Trp Arg Arg Arg lie Val Asp Leu Pro Glu Gly Thr Lys Tyr He A^a 

TCP Arg Hi^ Tyr Lys Val Thr Asp Ser ^iL^ Thr Glu Phe L^u 12 L«u 

^ 185 1 qn 

A.p A.P Val Thr vaa Tyr Arg Ser Xle Glu GXy Pro Glu Pro Ala Thr 

200 

A.p Phe Thr V.l He A.n Ue Gly Gin Asn Val Gly Arg L,u Thr Trp 

A.n Ti-r Pro Glu A,p Tyr gL Pro Glu Gly Ly. Gly A.n Glu Glu Leu 

Gin Leu Ser Gly Tyr A.n He Tyr Ala Asn ^ly Thr Leu Leu Ala G^n 

lie Ly. A.P Val S=r He Leu Glu Tyr "l A=p Ser Thr Tyr Ser Leu 

265 njo 
Arg A^p A»n Pro Leu Gin v,l Glu Tyr Ci* Val Thr Ala Val Tyr A^p 

Glu Ser lie Glu Ser Ser Thr Val" Cy. Gly Thr Leu hL- Tyr Ala Thr 

A..p Ala lie Leu Tyr Glu Asn Phe Glu A^n Gly Pro Val Pro A^n Gly 

Trp Leu Val He A.p Ala Asp Gly A«p Gly P^e Ser Tro Gly Hi. Tyr 

Leu A=n Ala Tyr Asp Ala Phe Pr-^ Gly His Asn Sly Gly Hl^ c^s Ser 

Leu Ser Ala Ser Tyr Val Pro Gly He Gly Pro Val Thr Pro A»n Asn 

" 360 3gt 

Tyr Leu He Thr Pro Lys Val Gl,. Gly Ala Lys Arg Val Lys Tyr Trp 

- j75 380 

val ser Thr Gin Asp Ala Asn Trp Ala Ala Glu Hi. Tyr Ala Val l.et 

Ala Ser Thr Thr Gly Thr Ala Vnl Gly Asp Phe Val He Leu Phe G^u 

Glu Thr .,et Thr Ala Lys Pro Thr Gly Ala Trp Tyr Glu Ar, ^^r He 

Asn Leu Pro Glu Gly Thr Lys Tyr He Ala Trp Arg His Jyr Asn Cys 

Thr A|p He Tyr Phe Leu Lys Leu Asp Asp He Thr Vaf Phe Gly Thr 

Pro Ala Ser Glu Pro Glu Pro Val Thr A-sp Phe vtl Val ser Leu He 

Glu Asn Asn Ly. Cly Arg Leu Lys Trp Asn ^yr Pro Asn Gly Tyr g!u 

Pro ASP Lys Thr Asp Asp Lys Asp Pro Leu cm Leu Ala Gly Tyr Asn 

He Tyr Ala Asn Gly Ser Leu Leu ^al His He Gin Asp P^o Thr Val 

Leu Glu Tyr He Asp Glu Thr Tyr Ser Ser Arg Asp Asp Gin Val Glu 

val Glu Tyr Cys Val Thr Ala Val Tyr As„ Asp ^n He Glu Ser Gin 

Ser val Cys Asp Lys Leu 11= Tyr Asp Ser Gin Ser Asp He He Le2 

Tyr Glu Gly Phe Glu Ala Gly Ser He Iro Glu Gly Trp Leu feu He 

SOS <;cT 
Asp Ala Asp Gly Asp Asn Val Asn Trp Asp Tyr Tyr Pro Trp Thr H-t 

600 

Tyr Gly His Asp Ser Glu Lys Cys He Ala Ser Pro Sor Tyr Leu Pro 

Het He Gly V.l Leu Thr Pro Asp Asn Tyr Leu vf i Thr Pro Arg Leu 

Glu Gly Ala Lys Leu Val Lys Tyr Trp Val til Ala Gin Asp Ala Va? 

Tyr ser Al. Glu His Tyr Ala Val Met Val Ser Thr Thr Gly Thr Ala 

Val Glu Asp Phe Val Leu Leu Phe Glu Glu Thr Her Thr All Lys Ala 

680 — fiR ^ 

Asn Gly Al« Trp Tyr Glu Arg Thr He Thr Leu Pro Ala Gly Thr Lys 

Tyr He Ala Trp Arg His Tyr Asp c^-s Thr Asp Ket Phe Phe Leu Leu 
Leu Asp ASP He Thr Val Tyr Arg Ser Thr Thr Val Pro Glu "o 

■'30 735 " 
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Va 1 


Thr 


Asp 


Phe 
740 


Val 


Val 


Ser 


Lgu 


He 
7 '1 5 


Glu 


Asn 




Lys 


Gly 
750 


Arg 


Leu 


Lys 


Trp 


Asn Tyr 


Pro 


Asn 


Gly 


•i'yr 


Giu 


Pro 


Asp 


Lys 


Thr 


Asp 


Asp 


Lys 






755 










7S0 










765 








LV^ 


Pro 


Leu 


Gin 


Leu 


Thr 


Gly Tyr 


Asn 


He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 




770 










775 










780 










Leu 


Val 


His 


rie 


Gin 


Asp 


Pro 


The 


Val 


Leu 


Glu 


Tyc He Asp Glu Thr 


lab 










7 90 










795 










800 


Tyr 


S*r 


S-r 


Aig 


Asp 


Gly Gin 


Vol 


Glu 


liet 


Giu 


Tyr 


Cys 


Val 


Thr 


Ala 










005 










810 










815 




Vol 


Tyr 


Acn 


Asp Asn 


11- 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Af p 


Lys 


Leu 


Asn 








820 










825 










830 






Tyr 


Thr 


lie 


Thr 


Ser 


Leu A^p Asn 


Tic 


Gin 


Ser 


Asp 


Thr 


Ssr 


Leu 


Lys 






835 




















845 








He 


Tyr 


Pro 


Asn 


Pro 


Ala 


Ser 


Tyr 


Val 


Val 


Arg 


He 


Glu 


Gly 


Leu 


Ssr 




850 










855 










860 








Arg 


5*r 


Lyy 


Ser 


Thr 


lie 


Glu 


Leu 


Tyr 


A.Tn 


Ala 


Leu 


Gly 


He 


Cys 


He 


865 










870 










875 










8B0 


Leu 


Ac'4 


Giu 


Glu 


Thr 


His 


Ser 


Glu 


Lys 


Thr 


Glu 


He 


Ai-p 


Val 


Ser 


Arg 










885 










890 










895 


Levi 


A^n 


Asp Gly Val 


Tyr 


Leu 


He 


Lyr 


Val 


Val 


Gly 


Gly 


Asn 


Lys 


Thr 








900 










?05 










910 






Thr 


Tlir 


Glu 
915 


Lys 


Val 


Glu 


He 


Lys 
920 


Arg 


Pro 















t2J rilFORttATIOH FOR SEQ ID NOt469 



U) SEQUEJJCE CHARACTERISTICS: 

(A) LENGTH: 921 amine acids 
30 (D) TVPfc): amino acid 

(D) TOPOLOjY: linear 

Uil MOLECULE TYPE: protein 

35 (111) HYPOTHETICAL: YES 

Jvil ORIGINAL SOCtRCE: 

(A) ORGAHISM: Porphyromonas gingivalis 

40 t Ix ) FEATURE : 

(A) MAHE/KEY: mlsrt_feature 
(D) LOCATION 1. . .921 



t>:i) SEQUENCE DESCRTPTIOt^ : SEQ ID HO; 465 



tie-L 


Lys 


Ser 


He 


Val 


Phe 


Arg 


Ala 


Phe 


Leu 


Thr 


T\e 


Levi 


Leu 


Ser Trp 


1 








5 










10 










15 


Ala 


Ala 


He 


Thr 


Asn 


Pro 


Thr 


Ala 


Gin 


Glu 


He 


Ser 


Gly Het Asn A.T;^ 








20 










25 










30 




Ser 


cys 


Leu 


Ala 


Ala 


Pro 


Ala 


Gin 


Pro 


Asp 


Thr 


He 


Leu 


Tyr 


Glu Set 






35 










40 










45 






Phe 


Giu 


Asn 


Gly 


Pro 


Val 


Pro 


Asn 


Gly 


Tip 


Leu 


Glu 


He Asp Ala Asp 




50 










55 










60 








Ala 


Asp 


Gly Ala 


thr 


Trp 


Gly 


Ser 


Pro 


Ser 


Gly 


Ser 


Phe 


Ser 


Val Pro 


65 










70 










75 








80 


Tyr Gly His Asn 


Gly Leu 


Cys 


Thr 


Tyr 


Sec 


His 


He 


Arg 


Ser 


Gly He 










85 










90 










95 


Ser 


Thr 


Aia 


Gly 


Asn 


Tyr 


Leu 


He 


Thr 


Pro 


Asn 


lie 


Glu Gly Ala l.ys 








100 










105 










110 




Arg 


Va I 


Lys 


Tyr 


Trp 


Val 


Cys 


Asn 


Gin 


Tyr 


Ser 


Thr 


Asn 


Pro 


Glu His 






H5 










120 










125 






Tyr 


Aia 


Val 


Met 


Val 


ser 


Thr 


Thr 


Gly 


Thr 


Ala 


He 


Glu Asp 


Phe Val 




1-^0 










IBS 










140 








Leu 


Leu 


Phe 


A3p 


Asp 


Ser 


He 


Thr 


Gly 


Lys 


Pro 


Thr 


Pro 


Leu 


Val Trp 


145 










ISO 










155 








160 


Arg 


Arg 


Arg 


He 


Val 


Asp 


Leu 


Pro 


Glu 


Gly 


Thr 


Lys 


Tyr 


He 


Al3 Trp 










165 










170 










175 


Arg 


Hi:? 


Tyr 


Lys 


Val 


Thr 


JKsp 


Ser 


His 


Thr 


Glu 


Phe 


Leu 


Lys 


Leu Asp 








IBO 










185 










190 


Asp 


Val 


Thr 


Val 


Tyr 


Arg 


•Ser 


He 


Giu 


Gly 


Pro 


Glu 


Pro 


Ala 


Thr Asp 






195 










200 










205 






Phe 


Thr 


Val 


He 


Asn 


He 


Gly 


Gin 


Asn 


Val 


Gly Arg 


Leu 


Thr Trp Asn 




210 










215 










220 








Tyr 


Pro 


Glu 




Tyr 


Gin 


Pro 


Giu 


Gly 


Lyc 


Gly 


Asn 


Glu 


Glu 


Leu Gin 


225 










230 








235 








240 
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Gly 


Tyr 


Asn 


He 


Tyr 


Ala 


Asn 


Gly 


Thr 


Leu 


Leu 


Ala 


Gin 


He 










245 










2 50 










255 






A5"P 


Val 


Ser 


He 


Leu 


Glu 


Tyr 


Val 


Asp 


Ser 


Thr 


Tyr 


Ser 


Leu 


Arg 








260 










265 










270 






Asp 


Asn 


Pro 




Gin 


Val 


Glu 


Tvr 


Cys 


Val 


Thr 


Ala 


Val 


Tyr 


Asp 


Gl u 




27 5 










280 










235 








Sei" 


lie 


Glu 


Ser 


Ser 


Thr 


Val 


Cys 


Gly 


Thr 


Leu 


His 


Tvr 


Ala 


Thr 


ASD 




290 










295 










300 










M d 


T 1 f: 


Leu 


T'-r 


Glu 


Asn 


Phe 


Glu 


Asrv 


Gly 


Pro 


Val 


Pro 


AkSn 


Gly 




305 










31 Q 










315 










320 




Val 


He 


Asp 


Ala 


A*.p 


Gly 


Asp 


Gly 


Phe 


Ser 


Trp 


Gly 


His 


Tyr 












325 










330 










335 




Asn 


A 1 .1 


Tvr 


Ajd 


Ala 


Phe 


Pro 


Gly 


His 


Asn 


Gly 


Gly 


His 


Cys 


Sor 


Leu 








3"! 0 










345 










350 






Ser 


Ala 


Se r 


Tyr 


Val 


Pro 


Glv 


I le 


Glv 


Pro 


Val 


Thr 


Pro 


Asp Asn 








355 










360 










365 










lift 


Thr 


Pro 


Lvs 


Val 


Glu 


Glv 


Ala 


Lys 


Aro 


Val 


L> s 


Tyr Trp 


VaJ 




37 0 










375 










380 










Set 


Thr 


J ri 


sp 


Ala 


AsT\ 


Trn 
1 rp 


Ala 


Ala 


Glu 


His 


T"-r 


Al 3 


Val 


Met 


Ala 


30S 










390 










3 95 










400 


Ser 


T X 






Thr 


Ala 


Val 




i^ip 


Phe 


Val 


He 


_ 


Phf- 


Glu 


Glu 










405 










410 










415 




Thr 




Tnr 


It! - 


Lys 




Th r 


ly 


Ala 


Trp 


fj r 




Arg 


Thr 


He 


Asn 








4 20 










425 










430 






r.cii 


Pre 


Glu 


Gly 


tK 


Lyff 


i yr 


He 


Ala 


trp 


Arg 


Hie 


Tyr 


Asn 


Cy« 


Thr 






4 35 










4 4 0 










4 45 








Asp 


11 - 


Tyr 








L»u 


Asp 


sp 


He 


Thr 


Va 1 


Ph^ 


Gly Thr 


Pro 




4 50 










^55 




















Ala 


Ser 


Glu 


Fro 


OlU 


P 


Val 


Thr 


Asp 




VqI 


Va 1 


ser 


Leu 


He 




•1 65 










470 










'175 










480 


Asn 




L"s 


iji.y 




Leu 




1 rp 


Asn 


I yr 


Pro 






Tyr Glu 


P ro 










4 85 




















495 




Asp 


Lys 


Thr 


Arp 


Asp 


Lys 


Asp 


Pro 


L"U 


Gl n 


Leu 


Al a 


Gly 


Tyr 


Asn 


He 








500 










50S 










510 






Tyr 


Ala 


c 


Gly 


Ser 


Leu 


Leu 


Val 


His 




Gin 


Asp 


Pro 


Thr 


Val 






51 5 










520 










525 










Tyr 


lie 


Asp 




Thr 




Ser 


Soc 


Arg 


Aso 




Gin 


Val 


Glu 


Val 




^'^^ 










535 










54 0 










Glu 


Tyr 




Val 


Thr 


Ala 


Val 


Tyr 


As 


A* 
sp 


Asn 


He 


Glu 


Ser 


Gin 


Ser 










550 










555 










560 


Val 


Cys 


Asp 




L»u 


He 


xyr 


Asp 


Ser 


Gin 


Ser 


Asp 


He 


He 


Leu 


Tyr 










565 










570 










575 




Gl u 


Gly 


Ph& 


Glu 


Ala 


Glv 


Ser 


He 


Pro 


Glu 


Glv 


I rp 


Leu 


Leu 


He 


ASD 








580 










583 










590 






Ala 


A'p 


Gly 


Asp 


Asn 


Val 


Asn 


Trp 


Asp 


Tyr 


Tyr 


Pre 


Trp 


Thr 


Met 


Tyr 






595 










600 










605 








Gl"" 


Hia 




Ser 


Glv) 




Cys 


He 


Ala 


Ser 


Pro 


Ser 


Tyr 


Leu 


Pro 


M«L 




610 










615 










620 










lis 


jly 


Val 


Leu 


Thr 


Pro 


Asp 


Asn 


T'T 


Leu 


Val 


Thr 


Pro 


Arg 


Leu 


Glu 


625 










630 










635 










640 


Gly 


Ala 


I 


Leu 


Val 


Lv* 


i. J r 


1 rp 


Val 


Ser 


Ala 


Gin 


As 


Ala 


Val 


Tyr 










645 










550 










655 




Ser 


Ala 


Glu 


Hi5 


Tyr 


Ala 


Val 


Met 


Vol 


Ser 


Thr 


Thr 


Gly 


Thr 


Ala 


Val 








660 










665 










670 






Glu 


Asp 


Phs 


Val 


Leu 


Leu 


Phe 


Glu 


Glu 


Thr 


net 


Thr 


Ala 


Lys 


Ala 


Asn 




675 










680 










685 








Gly 


Ala 


Trp 


Tyr 


Glu 


Arg 


Thr 


He 


Thr 


Leu 


Fro 


Ala 


Gly 


Thr 


I.ys 


Tyr 




690 










695 










700 










Tin 


Ala 


Trp 


Arg 


His 


Tyr 


Asp 


Cys 


Tlir 


Acp 


Met 


Phe 


Phe 


Leu 


Leu 


Leu 


70S 










710 










715 










720 


Asp 




He 


Thr 


Val 


Tyr 


Arg 


S=r 


Thr 


Glu 


Thr 


Val 


Pro 


Glu 


Pro 


Vai 










725 










730 










735 




Thr 


T\ep 


Phe 


Val 


Val 


Ser 


Leu 


He 


Glu 


Asn 


Asn 


Ly3 


Gly 


Arg 


Leu 


Lys 








740 










74t. 










750 






Trp 


Asn 


Tyr 


Pro 


Asn 


Gly 


Tyr 


Glu 


Pro 


Asp 


Lys 


Thr 


Atip 


Asp 


Lys 


Lys 






7b5 










760 










7 65 








Pro 


Leu 


Gin 


Leu 


Thr 


Gly 


Tyr 


Ai-n 


.He 


Tyr 


Ala 


Asn 


Gly 


Ser 


Leu 


Leu 




770 










775 










7p0 










Val 


His 


He 


Gin 


Aep 


Pro 


Thr 


Val 


L-'J 


Glu 


Tyr 


He- 


Asp 


Glu 


Thr 


Tyr 


705 










790 










795 










bOO 


Scr 


Ser 


Arg 


Asp 


Gly 


Gin 


Val 


Glu 


Met 


Glu 


Tyr 




Val 


Thr 


Ala 


Val 










805 










810 








BIS 




Tyr 


Asn 


Asp 


A;?n 


He 


Glu 


Ser 


Gin 


Ser 


Val 


Cys 


Asp 


Lys 


Leu 


Asn 


Tyr 








870 










B2b 










830 






Thr 


11* 


Thr 


Ser 


Lou 


Asp 


Aa-n 


He 


Gin 


Ser 


AiTp 


Thr 


Ser 


Leu 


Lys 


He 
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835 










840 










845 








Tyr 


Pro 


Asn 


Pro 


Ala 


S*>r 


Tyr 


Val 


Val 


Arg 


He 


Glu 


Gly 


Leu 


Ser 


Arg 


8b0 










855 










860 










S-r 


Lvs 


Ser 


Thr 


He 


Giu 


Leu 


Tyr 


Asn 


M 


t.TMl 


C^ly 


T1& 


Cys 


He 


Leu 


865 




















875 










880 


Axg 


Glu 


Glu 


Thr 


His 
885 


Ser 


Glu 


Lys 


I'hr 


Glu 
890 


lie 


A^p 


Val 


Ser 


Arg 
895 


Leu 


Asn 




Giy 


Val 


Tyr 


Leu 


He 


Lys 


Val 


Val 


Gly 


Gly 




Ly^ 


Thr 


Thr 






900 










905 










910 






Thr 


Glu 


Lvf 
915 


Vai 


Glu 


He 


Lys 


Arg 
920 


Pro 

















(2> IHFOWlATIOti FOR SZQ ID HO: 470 



15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 593 amino acidf 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

20 (11) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 
25 (AJ ORGAIIISH: Porphyromondfc* gingiva lis 

( ix 1 E*EATURE ; 

(A) HA! IE/KEY: misc feat.urft 
ID) LOCATiOt-^ i...S93 



ir.i) SEQUENCE DESCRIPTION: SEQ ID MO: 410 



Met 


Acn 


Ser 


He 


Met 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


net 


Ala 


1 








5 










10 










IS 




Leu 


Ser 


Val 


Ser 


Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 








20 










25 










30 






Lys 


Arg 


Pro 


Asp 


TKr 


Leu 


Arg 


Arg 


Glu 


Leu 


Thr 


He 


Val 


P^n Asp 


Gin 






35 










40 










45 








Thr 


Val 


Glu 


Met 


Glu 


His 


Ala 


Asp 


Pro 


Lfiu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 




SO 










55 










60 










He 


Glu 


Pro 


Arg 


Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


65 










70 










75 








80 


Fhe 


Gly 


Phe 


Val 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 










85 










90 










95 




Acn 


He 


Lftu 


Pro 


Thr 


Glu 


Gly 


His 


Met 


LyE 


His 


Arg 


Gly Tyr 


Leu 


Asn 








100 










10b 










110 






He Gly 


He 


Gly 


His 


Thr 


Leu 


Asn 


Gin 


Arg 


llet 


Asp 


Ala 


Gly 


Tyr 


Arg 






115 










120 










125 








Leu 


n ft 


Asp 


Ala 


Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


T.eu 


Phe 


Leu 


Ser 


Tyr Arg 




130 










135 










140 










Gly Met 


Lye 


Ser 


Ala 


Phe 


Asn 


Thr 


Gly 


Asp 


Phe 


Asp 


Gly At-p Arg 


Ly- 


145 










150 










155 










160 


Asp 


Arg 


Arg 


Met 


net 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 










165 










170 










17 5 




Phe 


Val 


Leu 


Ala 


Thr 


Gly 


Leu 


Tyr 


Tyr 


Ser 


A£n 


His 


Tyr 


Phe 


Asn 


Asn 








180 










185 








ISO 






Tyr 


Gly 


Arg 


Gly 


Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 






195 










200 










205 








Thr 


Pro 


Val 


Thr 


Pro 


Gin 


Met 


Asp 


Asn 


Gly 


Thr 


His 


Ai'n 


Val 


Arg 


Val 




210 










215 










220 










Tyc 


Leu 


Gly 


Ala 


Lys 


Asn 


Asp 


Val 


He 


Asp 


Ala 


Arg 


He 


Asp 


Tyr 


Arg 


225 










230 










235 










240 


Phs 


Phe 


Axg 


Ser 


He 


Pro 


Tyr 


Leu 


Gly 


Thr 


Asp 


Pro 


Met 


Lys 


Ala 


Leu 










;>4 5 










250 










255 




Thr 


Glu 


His 


Thr 


Pro 


Glu 


Leu 


Asn 


Val 


Thr 


Met 


Ser 


A^n 


Glu 


><eu 


Ser 








260 










265 










27 C 






Affp 


Arp 


lit* 


Lys 


Leu 


Gly 


Vai 


Glu 


Val 


Arg 


Thr 


Gly 


Gly 


r.*t: 


Phe 


Fht 






2-75 










280 










285 








Ala 


LyJ 


Aen 


Ser 


Glu 


Met 


He 


Gin 


Thr 


Gly 


val 


Leu 


-4Jer 


Glu 


Thr 


Affp 




290 










295 










300 










Aig 


Aen 


Leu 


T-yr 


Tyr 


Val 


Glu 


Giy 


Ala 


Pro 


Thr 


11« 


Gly 


Phe 


Vat 


Gly 


305 










310 










315 










320 


Asp 


Ser 


Asp 


Asn 


Met 


Gin 


Trp 


Asn 


He 


Gin 


Ala 


Gly 


Val 


Gly 


He 


Ser 










325 










330 










335 




Ser 


His 


Phe 


Gly 


Ala 


Lys 


Gly 


Arg 


Leu 


Phe 


Phe 


Trp 


Pro 


Lys 


I.eu 


Asp 
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34 5 



350 



Ala 


Sec 


Leu 


Sttr 


lie 


Phe 


Pro 


Ser 


Trp 


Arg 


Val Tyr 


Ala 


Lys 


Ala 


Phe 






355 










360 








365 








Gly Gly 


Va 1 


lis 


Acq 


Asn 


Gly 


Leu 


Ala 


Asp 


Vai Itet 


Gin 


Glu 


Glu 


Met 














37 5 








380 












Tyr 


Leu 


Met 


?xo 


Asn 


Thr 


He 


Val 


Leu 


Pro Ser 


Arg 


Asn 


Ala 


Leu 


385 










390 










395 








400 


Thr 


Ala 


Gin 


Leu 


Gly 


Val 


Lys 


Gly 


Asn 


He 


Ala Asp Val 


Val 


Arg 


net 










405 










410 








415 




Glu 


Val 


Tyt 


Gly 


Asp 


Phe 


Ssr 


Lyr 


Leu 


Thr 


Gly Val 


Pro 


Phe 


Tyr 


Thr 








420 










425 








430 






Pro 


The 


Leu 


Pro 


Leu 


Tyr 


Asn 


Pro 


Ser 


Aep 


Leu Tyr 


Gin 


Tyr 


Asn 


Val 






435 










440 








445 








S-ar 


Phe 


Leu 


Pro 


He 


Tyr 


Ala 


Asp 


Gly 


Ser 


Arg Trp Arg 


Ala 


Gly Gly 




450 










455 








460 










Lyo 


Leu 


Glu 


Tyr 


Ser 


Tyr 


Arg 


Asp 


Met 


Leu 


Arg Phe 


Leu 


Val 


Asp 


Ala 


465 










470 










47b 








480 


Ser 


Tytr 


Gly 


Lys 


Trp 


Asn 


Leu 


Asp 


Gly 


Gly Leu Val 


Ala 


Ser 


Met 


Gin 










485 










4 90 








495 




Pea 


Asp 


Leu 


He 


Leu 


Lys 


Ala 


Glu 


Val 


Gly Val His 


Pro 


He 


Ala 


Pro 








500 










505 








510 






Leu 


Asp 


Val 


Acq 


Leu 


Arg 


Tyr 


Thr 


Gin 


Leu Asn Gly Arg 


Tyr 


Arg Tyr 






515 










520 








525 








Ser 


the 


Gly 


Ser 


Ala 


Gly 


Ser 


Glu 


Ala 


Leu 


C.ly He 


Gly 


Asn 


Val 


His 




530 










535 








5*10 










Leu 




Sec 


Ala 


Asp 


Val 


Ser 


Tyr 


Lys 


Leu 


Lya Lys 


Asn 


Leu 


Ser 


Leu 


545 










550 










555 








56Q 


Tyr 


Leu 


LV5 


lie 


Asp 


Asn 


Met 


Leu 


Ala 


Glu 


Thr Thr 


Glu 


Leu 


He 


Gly 








565 










570 








57 5 




Tyr 


Tyr 


Pro 


net 


Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala Gly 


Phe 


Ser Trp Thr 


















585 








590 





































(2) II I FORI tAT I ON FOR SEQ ID HC:4 7l 

(1) SEQUF.IICF. CHARACTERISTICS: 

(A) LEHGTH; 58 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(111) JiYPOTHETICAL: YES 

(vil ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas glngivalis 

fix) FEATURE : 

iA) Umt/KE'i: misc feature 

(B) LOCATIOM 1. . ,589 





(>:i) 


SEQUEUCE DESCRIPTIOI-I: SEQ \ 


ID NO: 471 










MeL 


Lys 


Tyr 


Gin 


Leu 


Tyr 


Thr 


Ala 


Val 


He 


Met 


Ala 


Leu 


Ser 


Val 


Ser 


1 






5 










10 










15 




Ser 


Val 


Cys 


Gly 


Gin 


Thr 


Pro 


Arg 


Asn 


Thr 


Glu 


Thr 


Lys 


Arg 


Pro 


Asp 








20 










25 










30 






Thr 


Leu 


Arg 


Arg 


GlM 


Leu 


Thr 


He 


Vai 


Asn 


Asp 


Gin 


Thr 


Val 


Glu 


Het 






35 










40 










4b 








Glu 


HI-*? 


Ma 


Asp 


Pro 


Leu 


Pro 


Ala 


Ala 


Tyr 


Lys 


Ala 


He 


Glu 


Pro 


Arg 




5C 










55 










60 










Leu 


Lys 


Pro 


Phe 


Arg 


Pro 


Glu 


Tyr 


Asn 


Lys 


Arg 


Thr 


Phe 


Gly 


Phe 


Val 


65 










70 










75 










90 


Pro 


Glu 


Val 


Ser 


Ser 


Ser 


Gly 


Arg 


Asn 


Asn 


Leu 


Pro 


Asn 


He 


Leu 


Pro 










8S 










90 










95 




Thr 


Glu 


Gly 


His 


Met 


Vyn 


His 


Arg 


Gly 


Tyr 


Leu 


Asn 


He 


Gly 


He 


ray 








100 










105 










HO 






His 


The 


Leu 


Asn 


Gin 


Arg 


Met 


Aap 


Ala 


Gly 


Tyr 


Arg. Leu 


He 


Asp Ala 






115 










120 










125 








Glu 


Gin 


Glu 


Arg 


Leu 


Asn 


Leu 


Phe 


Leu 


Ser 


Tyr 


Arg 


Gly 


Het 


Lys 


Ser 




130 










135 










140 










Ala 


Phe 


Asn 


Thr 


Gly 


Arp 


Phe 


Asp 


Gly 


Asp 


Arg 


Lys 


Asp 


Arg 


Arg 


Met 


145 










150 










155 










160 


Met 


Ala 


Gly 


Val 


Asp 


Tyr 


Glu 


Gin 


Arg 


Arg 


Pro 


Ser 


Phe 


Val 


Leu 


Ala 
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65 











165 










170 










175 




The 


Gly 




Tyr 


Tyr 


Ser 


Asn 


His 


Tyr 


Phe 


Asn 


Asn 


Tyr 


Gly 


Arg 


Gly 








180 










185 










190 






Ala 


Thr 


Thr 


Asn 


Val 


Gly 


Ser 


He 


Pro 


Gin 


Leu 


Ser 


Thr 


Pro 


Val 


Thr 






19S 








200 










205 








Pro 


-jln 




Asp 


Affn 


Gly 


Thr 


His 


Asn 


Val 


Arg 


Val 


Tyr 


Leu 


Gly 


Ala 




210 










215 










220 










hyc 


Asn 


Afp 


Val 


He 


Asp 


Ala 


Arg 


He 


Afp 


Tyr 


Arg 


Phe 


Phe 


Arg 


Sec 


225 










230 










235 










240 


il< 


Pro 


Tyr 


Leu 


Gly 


The 


Asp 


Pro 


Met 


hyc 


Ma 


Leu 


Thr 


Glu 


His 


Thr 










245 










250 










255 




Pro 


Gl u 


Leu 


Asn 


Val 


Thr 


Met 


Ser 


Asn 


Glu 


Leu 


S'er 


Asp Asp 


He 


Lys 








260 










265 










270 






Leu 


Gly 


Val 


Glu 


Val 


Arcj 


Thr 


Gly 


Gly 


Lsu 


Phe 


Phe 


Ala 


Lys 


Asn 


Ser 






275 










280 










205 








Glu 


Met 


T)- 


Gin 


Thr 


Gly 


Val 


Leu 


Ser 


Glu 


Thr 


Asp Arg 


Asn 


Leu 


Tyr 




290 










295 










300 










Tyi: 


Vai 


Glu 


Gly 


Ala 


Pro 


Thr 


He 


Gly 


Phe 


Va 1 


Gly 


Aap 


Ser 


Asp 


Asn 


305 










310 










315 










320 


net 


Gin 


Trp 


Asn 


lie 


Gin 


Ala 


Gly 


Val 


Gly 


He 


Ser 


Ser 


His 


Phe 


Gly 










325 










330 










335 




Ala 


Lys 


Gly 


Arg 


Leu 


Phs 


Phe 


Trp 


Pro 


Ly^ 


Leu 


Asp 


Ala 


Ser 


Leu 


Ser 








340 










345 










350 






Tie 


Phft 


Pro 


Ser 


Trp 


Arg 


Val 


Tyr 


Ala 


Lys 


Ala 


Phe 


Gly 


Gly 


Val 


He 






355 










360 










365 








Acq 


A£-li 


Gly 


L«u 


Ala 


Aep 


Val 


Met 


Gin 


Glu 


Glu 


net 


Pro 


Tyr 


Leu 


Met 


37 0 










375 










360 










Pro 


A?n 


Thr 


Tie 


Val 


Leu 


Pro 


Ser 


Arg 


Asn 


Ala 


Leu 


Thr 


Ala 


Gin 


Leu 


385 










390 










395 










400 


Gly 


Val 


hys 


Gly Afii 


He 


Ala 


Asp 


Val 


Val 


Arg 


Met 


Glu 


Val 


Tyr 


Gly 








405 










410 










415 




Asp 


Phe 


Scr 


Lys 


Leu 


Thr 


Gly 


Val 


Fro 


Phe 


Tyr 


Thr 


Pro 


Thr 


Leu 


Pro 






420 










425 










430 






Lsu 


Tyr 


Asn 


Pro 


Ser 


Asp 


Lsu 


Tyr 


Gin 


Tyr 


Asn 


Val 


Ser 


Phe 


Leu 


Pro 






435 










440 










445 








He 


Tyr 


Ma 


Asp 


Gly 


Sue 


Arg 


Trp 


Arg 


Ala 


Gly 


Gly 


Lys 


Leu 


Glu 


Tyr 




450 










455 










4 60 










S-r 


Tyr 


Arg 


Asp 


Met 


Leu 


Arg 


Phe 


T.ou 


Val 


Arp 


Ala 


Ser Tyr 


Gly 


Lys 


4 65 










470 










475 










480 


Trp 


Acn 


L-su 


Asp 


Gly 


Gly 


Leu 


Val 


Ala 


Ser 


Met 


Gin 


Pro 


Asp 


Lsu 


He 








485 










490 










49S 




Leu 


Lys 


Ala 


Glu 


Val 


Gly 


Val 


His 


Pro 


He 


Ala 


Pro 


Leu 


Asp 


Val 


Arg 








500 










505 










510 






T.GU 


Arg 


Tyr 


Thr 


Gin 


Leu 


Asn 


Gly 


Arg 


Tyr 


Aro 


Tyr 


Ser 


Phe 


Gly 


Ser 






515 










520 










525 








Ala 


Gly 


Ser 


Glu 


Ala 


Lsu 


Gly 


He 


Gly 


Asn 


Val 


His 


Leu 


Leu 


Ser 


Ala 




530 










535 










540 










Asp 


Val 


ser 


Tyr 


Lys 


Leu 


Lys 


Lys 


Asn 


Leu 


Ser 


Leu 


Tyr 


Leu 


Lys 


lie 












550 










555 










550 


Asp 


Asri 


Het 


Leu 


Ala 


Glu 


Thr 


Thr 


Glu 


Leu 


He 


Gly Tyr Tyr 


Pro 


Met 








565 










570 










575 




Gin 


Pro 


Phe 


His 


Cys 


Phe 


Ala 


Gly 


Phe 


Scr 


Trp 


Thr 


Phe 














580 










585 

















10 



15 



20 



25 



30 



35 



40 



45 



50 



55 12) INFORHATIW POR SF.Q ID NO: 472 

(.1 ) SEQUEIJCE CHARACTERISTICS: 

(A) LENGTH: 34 6 amino acids 
(Bl TYPE: amino acid 
60 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(tii) nVPOTHETICAL: YES 



(vU ORIGIllAL SOURCE: 

(AJ ORGAIflSII: Porphyromona qinalvalls 



(ix) FEATURE: 
70 (A) tIA/IE/ICEV: misc_feature 

(R) LOCATICU 1...34 0 

ixi) SEQUEHCE DESCRIPTIOU: SEQ ID HO: 472 

75 Met Met Glu Lys Cys He Phe Ala HI.? Tyr Pro His Asn Leu Val Phe 



SUBSnrUTE sheet (Rule 26) (RO/AU) 
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421/ 



490 



1 



5 



10 



IS 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Met 


lie 


Arq 


Lys 


His 


Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Scr 


Lou 


Val 








20 










25 










30 






Phe 


Ser 


Ala 


Gly 


Ala 


Gin 


Gin 


Gl u 


Ly.7 


Gin 


Va 1 


Fhe 


His 


Phe 


Leu 


Asn 






35 










40 










*1S 








Leu 


Pro 


AlA 


Thr 


Ala 


Gin 


Ala 


Leu 


Ala 


Ala 


Gly 


Gly 


Lys 


Ala 


He 


Thr 




50 










55 










60 










lie 


Val 


A.-P 


hC'p 


Asn 


Pro Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 


65 








70 










75 










80 


Cilv 


■i'yr 


Glu 


Ser 


Gly 


Gly Arg 


Ala 


Phe 


Leu 


Ser 


Tyr 


Leu 


Tyr 


Tyr 


Het 










85 










90 










95 




Ser 


Gly 


Ser 


Hir 


Met 


Gly Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val 


Gly 


Glu 








100 










105 










110 






Arg 


Gly Met 


Trp 


Gly 


Val 


Gly Met Arg 


Phe 


l.eu 


A^n 


Tyr 


Gly 


Ser 


Met 






115 










120 










125 








Gin 


Gly 


Tyr 


Afp 


Gin 


Asa 


Ala 


He 


Ala 


Thr 


Gly 


Ser 


Phe 


Ser 


Ala 


Ser 




130 










135 










140 










Arp 


lie 


Ala 


Val 


Gin 


Gly 


Phe 


Tyr 


Ser 


His 


Glu 


Leu 


Ser 


Asn 


His 


Phe 












150 










155 










160 


Arg 


Gly Gly 


Val 


Ser 


Leu 


Lys 


Ala 


Leu Tyr 


Ser 


Ser 


He 


Glu 


Thr 


Tyr 










1 65 










170 










17i. 




Ser 


Ser 


Phe 


Gly 


Leu 


Gly 


Val 


Asp V3l 


Gly 


He 


Ser 


Tyr 


Tyr 


Asp 


Asp 








190 










185 










190 






Asp 


T.ys 




Tyr 


Ser 


Ala 


ser 


Ala 


Leu 


Phe 


Lys 


Asn 


Val 


Gly Ala 


Gin 




195 










200 










205 








Leu 


Lys 


Gly 


Tyr 


Asn 


Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Affp 


Trp 


Asp 


Fhe 


Gin 














215 










220 










Leu 


Gly 


Phe 


Ser 


Arg 


Ser 


Phe 


He 


Asn 


Ala 


Pre 


Phe 


Arg 


Leu 


His 


He 


225 










230 










235 










240 


Thr 


L^u 


Phe 


Aan 


Leu 


Asn 


Pro 


His 


Tyr 


Phe 


T.ys 


A eg 


Leu 


Val 


Pro 


Arq 










245 










250 










255 




A£p 


Leu 


ser 


l.ys 


Met 


Gin 


Lys 


Phe 


Leu Arg 


His 


Phe 


Ser 


He Gly Ala 








260 










265 










270 






Glu 


Phe 


Thr 


Pro 


Ser 


Glu Arg 


Phe 


Trp 


Val 


Gly 


Leu 


Gly 


Tyr 


Thr 


Pro 






275 










280 










285 








Gin 


lis 


Ala 


Gl n 


Asp 


Phe 


Glu 


Val 


Giu 


Gly 


Gly 


Asn 


Lys 


Trp Gly Gly 




290 










2 95 










300 










Leu 


Ser 


Ala 


Gly 


Val 


Gly 


Phe 


Thr 


Sor 


Gly 


Val 


Val 


Arg 


Val 


Gly 


Val 


305 










310 










315 










320 


Ser 


Al A 


Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


Met 


Cys 


Ser 


Val 










325 










330 










335 




Gly 


He 


Acq 


Leu 


Asp 


Asp 


Lys 


Ser 


He 


Phe 















(2) TNFORHATTOH FOR SEQ ID NO: 473 

(il SEQUENCE CHARACTERISTICS: 

{Al LENGTH: 345 amino acids 
\n) TYPEt amino acid 
fO) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

<ili) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A} ORGAMISH: Porphyroraonas ginglvalle 

<ixl FEATURE: 

(A) MAIIE/KEY: misc_featare 
(Dl LOCATIOH 1 . . .345 

(xi) SEQUENCE DESCRIPTIOM: SEQ ID HO: 473 

MsL Glu Lys Cy5 He Fhe Ala His Tyr Pro His Aitn Leu Val Fhe Met 

15 10 15 

He Arg Lys His Phe Gly He He Leu Gly Phe Leu Ser Leu Val Phe 

20 25 30 

Ser Ala Gly Ala Gin Gin Glu Lys Gin Val Phe Hle-Phe Leu Asn Leu 

35 40 45 

Pro Ala Thr Ala Gin Ala Leu Ala Ala Gly Gly hys Ala He Tht He 

50 • 55 60 

Val Asp Asp Aan Pro Gly Leu Ala Phe Glu Asn Pro Ala Leu Leu Gly 
65 70 75 30 

Tyr Glu Ser Gly Gly Arg Ala Phe Leu Ssr Tyr Leu Tyr Tyr Met Ser 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 
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05 90 95 



Gly 


5er 


His 


Mot". 


Gly 


Af n 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Val Gly Glu Arg 








100 










105 








110 




(ily 


Met 


Trp 


Gly 


Val 


Giy 


[WL 


Arg 


Phe 


Leu 


As-n 


Tyr 


Gly Ser Met 


Gin 






115 










120 










125 




Gly Tyr 


Asp 


Gin 


Asn 


Ala 


lie 


Ala 


Thr Gly 


Ser 


Phe 


Ser Ala Ser 


Asp 




130 










135 










140 




Ila 


Ala 


Val 


Gin 


Gly 


Ph = 


Tyr 


Ser 


(lis 


Glu 


Leu 


Ser 


Asn His Phe 


Arg 


145 










150 










155 






160 


Gly 


Gly 


Val 


Ser 


Leu 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


He 


Glu Thr Tyr 


Ser 










165 










170 






175 




Ser 


Phe 


Giy 


Leu 


Gly Val 


Asp 


Val 


Gly 


He 


Ser 


Tyr 


Tyr Asp Asp Afp 








IBO 










185 








190 




hys 


Gly 


Tyr 


Ser 


Ala 


Ser 


Aia 


Leu 


Phe 


Lys 


Asn 


Val 


Gly Ala Gin 


Leu 






195 










200 










205 




Lys 


Gly 


Tyr 


Arn 


Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Asp 


Trp Asp Phe Gin 


Leu 




210 










215 










220 






Gly 


Phe 


Ser 


Arg 


Ser 


Phe 


He 


Affn 


Ala 


Pro 


Phe 


Arg 


Leu His He 


Thr 


225 










230 










235 






240 


Leu 


Pha 


Asn 


Leu 


Asn 


Pro 


Hi." 


Tyr 


Phe 


Lys 


Arg 


Leu 


Val Pro Arg 


Asp 










2H5 










250 






255 




L*u 


Ser 


Lys 


Hex. 


Gin 


Lys 


Ph« 


Leu 


Axg 


Hlc 


Phe 


Ser 


He Gly Ala 


Glu 








260 










265 








270 




Phe 


Thr 


Pro 


Sor 


Glu 


Arg 


Phs 


Trp 


Val 


rjly 


Leu 


Giy Tyr Thr Pro 


Gin 






275 










280 










285 




lie 


Ala 


Gin 


Asp 


Phe 


Glu 


Val 


Glu 


Gly 


Gly 


A^n 


Lyr 


Trp Gly Gly 


Leu 




290 










295 










300 






Ser 


Ala 


Gly 


Val 


Gly 


Pho 


Thr 


Ser 


Gly Val 


Val 


Arg 


Val Gly Vai 


Ser 


305 










310 










315 






320 


Ala 


Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe 


Met 


cys Sar Val 


Gly 










3.?5 










33C 






335 




lie 


Arg 


L«u 


Asp Aep Lys 


Ser 


lie 















340 345 

(21 IHFORHATIOH FOR SEQ ID HO: 474 

(i) SEQUENCE CHARACTERISTICS: 

fA) LENGTH: 330 amino acidn 
(B! TYPE: amino acid 
{D\ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

{111) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISil: Porphyromonas gingivalls 

(ix) FEATURE: 

(AJ NAHE/KE/: misc feature 

(B) LOCATION 1. . . 330 

{y.i) SEQUENCE DESCRIPTION: SEQ ID NO: 47 4 



Met He 


Arg 


Lys 


His 


Phe 


Gly 


He 


He 


Leu 


Gly 


Phe 


Leu 


Ser 


Leu 


Val 


1 






5 










10 










15 




Phe Ser 


Ala 


Gly 


Ala 


Gin 


Gin 


Glu 


Lys 


Gin 


Val 


Phe 


His 


Phe 


Leu 


A£n 






20 










25 










30 






Leu Pro 


Ala 


Thr 


Aia 


Gin 


Ala 


Lttu 


Ala 


Ala 


Gly 


Gly 


Ly.-. 


Ala 


He 


Thr 




35 










40 








45 








lie Val 


Aep 


Asp 


Asn 


Pro 


Gly 


Leu 


Ala 


Phe 


Glu 


Asn 


Pro 


Ala 


Leu 


Leu 


50 










55 










60 










Gly Tyr 


Clu 


Ser 


Gly 


Gly 


Arg 


Ala 


Phe 


Leu 


Scr 


Tyr 


Leu 


Tyr 


Tyr 


Met 


65 








70 










75 








80 


Ser Giy 


Set 


His 


Hot 


Gly 


Asn 


Ala 


Cys 


Tyr 


Ala 


Ser 


Ser 


Vol 


Gly 


Glu 








85 










90 










95 




Arg Gly 


net 


Trp 


Gly 


Val 


Gly 


Met 


Arg 


Phe 


Leu 


Asn 


Tyr Gly 


Ser 


Met 






100 










105 










110 






Gin Gly 


Tyr 


Asp 


Gin 


A:sn 


Ala 


He 


Aia 


Thr 


Gly 


Ser- 


Phe 


Ser 


Ala 


Ser 




115 










120 










125 








Asp He 


Ala 


Val 


Gin 


Gly 


Phe 


Tyr 


Ser 


His 


Glu 


Leu 


Ser 


Asn 


His 


Phe 


130 










135 










140 










Arg Cly Gly 


Val 


Ser 


Leu 


Lys 


Ala 


Leu 


Tyr 


Ser 


Ser 


He 


Glu 


Thr 


Tyr 


145 








150 










l&S 










160 


Ser Sier 


Phe 


Gly 


Leu 


Gly 


Val 


Arp 


Vai 


Gly 


He 


Ssr 


Tyr 


Tyr 


Asp Asp 
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165 



170 



175 



5 
10 
15 
20 
25 
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Asp 


Lys 


Gly 


Tyr 


Ser 


Ala 


Ser 


Ala 


Leu 


Phe 


Lys 


Asn Val 


Gly Ala 


Gin 






180 










185 








190 




Leu 


Lys 


Gly 


Tyr 


Asn 


Glu 


Glu 


Arg 


Glu 


Pro 


Leu 


Affp Trp 


Asp Phe 


Gin 






195 










200 








205 






l.cu 


Gly 


Phe 


Ser 


Arg 


Ser 


Phe 


lie 


Asn 


Ala 


Pro 


Phe Arg 


Leu His 


lie 




210 










215 










220 






Thr 


Leu 


Phe 


Asm 


Ltiu 


A£-n 


Pro 


His 


Tyr 


Phe 


Lye 


Arg Lou 


Val Pro 


Arg 












230 










235 






240 


Arp 


Leu 


Ssr 


I.yc 


Met 


Gin 


Lys 


Phe 


Leu 


Arg 


His 


?hs Ser 


lie Gly 


Ala 










245 










250 






255 




Glu 


Phe 


Thr 


Pro 


Ser 


Glu 


Arg 


Phe 


Trp 


Val 


■31 y 


Leu Gly Tyr Thr 


Pro 








260 










265 








270 




Gin 


lie 


Ala 


Gin 


A5P 


Phe 


Glu 


Vai 


Glu 


Gly 


Gly 


Asn Lys 


Tip Gly 


Gly 






275 










200 








205 






Leu 


Ser 


Ala 


Gly Val 


Gly 


Phe 


Thr 


Ser 


Gly 


VaI 


Val Arg 


Val Gly 


Val 




290 










295 










300 










Ala 


Thr 


Tyr 


His 


Pro 


Ala 


Ala 


Leu 


Ser 


Phe Met 


Cys Ser 


Val 


305 










31 0 










315 






320 


Gly 


lie 


Arg 


Leu 


Asp 


Asp 


Lys 


Ser 


T3- 


Phe 











(21 IHFOW^tATTQH FOR SEQ ID NO: 475 

f i ) iJKQUEMCE CHARACTERI STICS : 

(A) LEHGTH: 32 4 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

<ii) MOLECUTX TYPE: protein 

(iii» HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAItTSM: Porphyromonas gingival is 

Ux) ttlATURE; 

(A) HAHE/KEY: misc_{eature 
(Bt LOGATIOfJ 1...324 



(xi) iEQC'ElICE DESCRIPTIOH: SEQ ID 110:475 



net 


Cys 


Leu 


Glu 


Pro 
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iZ) INFORIIATTOH FOR SEQ ID MO: 4 76 

11} SEQUENCE CHARACTERISTICS: 

(AJ LENGTH: 547 amino acids 
(BJ TVPS: amino acid 
(D) TOPOLOGV: linear 

(H) MOLECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

Ivl) ORIGinAL SOURCE: 

;A1 ORGAWIStI: Porphyromonas gingivalis 

(1x1 FELfi.TURE;- 

tA) HAllF./KEY: mi2c_f eatur« 
(B) LOCATIOfI 1 . . .547 



ixl) SEUUeHCE DESCRIPTIOH: SEQ ID MO; 47 8 
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(C) lllfOKJ lATIOW FOR SEQ ID f 10: 477 

(1} SF.QUKUCC CHARACTiiRISTICS: 

fA) LEMGTH: 7 50 amino acid^ 
(B) TYPE; amino acid 
tD» TOPOLOGY: linear 

(ii) MOLECULE TYPR: protein 

(lii) HYPOTHETICAL: YES 

<vl) ORIGINAL SOURCE: 

fA) ORGAHISM: Porphyromonas gingivalls 

(ix) FEATURE: 

tA) tlAJIE/KEY: misc_i eature 
(B) LOCATTOII 1 . , .750 



{>:!) SEQUEMCE DESCRIPTICH: SEQ It) HO: 477 
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(2 > illt'Oiy iATiai FOR SEQ ID HO: 47 8 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 amino acids 
(01 TYPE: amino acid 
(DJ TOPOLOG-i:: linear 



(11) HOLECULE TYPE: protein 
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{ilil m"POTHETICAL: YES 

(vi) ORIGIIIAL SOURCE: 

(A) ORGAMISH: Porphyrofnonas gingi vails 

(ix) FEATLFRR: 

(A) NAME/KEi*; ml sc f eat uro 

(B) LOCATIOH 1. . . 4 94 



Ixi) SEQUENCE DESCRIPTION: SEQ ID 1*0:478 
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Lys 


Arg 


Arg 


Phe 


Leu 


Set 


Leu 


Leu 


Leu 


Leu 


Tyr 


He 


Leu 


Ser 


5*3 r 


1 








5 










10 










1 5 




tic 


Scr 




Sc r 


AlA 


Gin 


Arg 


Phe 


Pro 


Met 


Val 


Gin 


Gly 


He 


Glu 


Leu 








20 










25 










30 






Asp 


Thr 


Asp 


Ser 


Leu 


Phe 


Ser 


Leu 


Pro 


Lys 


Arg 


Pro 


Trp 


Arg 


Ala 


He 






35 










40 










45 








Gly 


T,ys 


Th r 


T 1 e 


Gly 


V^l 


Asn 


Leu 


Ala 


Val 


Trp 


Gly 


Phe 


Asp 


His 


Phe 


50 










55 










60 










He 


Met 


Asn 


Glu 


Asp 


Phe 


Ala 


Asp 


He 


Ser 


Trp 


Gin 


Thr 


He 


Lys 


Ser 


65 










70 










7 5 










80 


A5n 


Phf! 


Gin 


Thr 


Gly 


Phe 


Gly 


Trp 


Asp 


Asn 


Asp 


Lys 


Phe 


Val 


Thr 


Asn 










85 










90 










95 




Leu 


Phe 


Ala 


His 


Pro 


Tyr 


His 


Gly 


Ser 


Lou 


Tyr 


Phe 


Asn 


Aia 


Ala 


Arg 








100 










105 










110 






Ser 


Asn 


Gly 


Leu 


Ser 


Phe 


Arg 


His 


Ser 


Ala 


Pro 


Phe 


Ala 


Phe 


Phe 


Gly 






H5 










120 










125 








Ser 


Leu 


Met 


Trp 


Glu 


Leu 


Leu 


Met 


GLu 


Asn 


Glu 


Pro 


Pro 


Ser 


He 


A^rn 




130 










135 










140 










Asp 


Leu 


Cys 


Ala 


Thr 


Thr 


He 


Gly 


Gly 


He 


Ala 


Leu 


Gly 


Glu 


Met 


Gly 


145 










150 










155 










160 


His 


Arg 


Leu 


Ser 


Asp 


Leu 


Leu 


He 


Affp 


Asn 


Arg 


Thr 


Thr 


Gly 


Trp 


Glu 










16b 










170 










175 




Arg 


H-t 


Gly 


Arg 


Glu 


Val 


Ala 


He 


Ala 


Leu 


He 


Asn 


Pro 


Met 


Arg 


Phe 






180 










IBS 










1 90 






Leu 


Asn 


Arg 


Leu 


Thr 


Ala 


Gly 


Glu 


Val 


Thr 


Scr 


Val 


Gly 


Scr 


Arg 


Ser 






1?5 










200 










205 








Gly 


Gin 


He 


Phe 


Gin 


Ser 


Val 


Pro 


He 


Asn 


He 


Val 


Val 


Asp 


Aia 


Gly 




210 










215 










220 










Phe 


Arg 


Ph^ 


Leu 


Ala 


Asp 


Lys 


Arg 


His 


Ala 


Arg 


Thr 


Gly 


Ala 


Thr 


Ala 


225 










230 










235 










240 


Leu 


The 


L'-u 




Leu 


& fit 
Arg 


Phe 


Asp 


Tyr 




Asp 


Pro 


Phe 


Arg 


Ser 


Glu 










245 










2SC 










255 




ThjT 


Phe 


Ser 


Pro 


Tyr 


A£p 


Phe 


Phe 


Gin 


Phe 


L'i's 


Ala 


Gly 


Leu 


Ser 


Phe 








260 










265 










270 






Ser 


Glu 


Ser 


Gin 


Pro 


Leu 


Leu 


Ser 


Gin 


He 


Asn 


Leu 


He 


Gly 


He 


Leu 






.?75 










200 










2tiS 








Ser 


Gly 


Cy£ 


Gin 


Leu 


Leu 


Ala 


His 


Glu 


Acq 


Thr 


Val 


Leu 


Val 


Gly 


Gly 




290 










295 










300 










L^ii 


Fh- 


Gin 


Ills 


Phe 


Asp 


Tyr 


Tyr 


Asn 


Ser 


Glu 


Lys 


Arg 


He 


Ser 


Lys 


305 










310 










315 










320 


Asn 


Ser 


Glu 


Glu 


Val 


Leu 


Val 


Thr 


Pro 


Tyr 


Arg 


He 


Ser 


Gin 


Val 


Ala 










325 










330 










335 




Ala 


Leu 


Gly 


Gly 


Gly 


Leu 


He 


Phe 


Gin 


His 


His 


Gly 


Lys 


Phe 


Arg 


Arg 








340 










345 










350 






Arg 


Pro 


Leu 


Glu 


Leu 


Tyr 


Ala 


Glu 


Thr 


Tyr 


Leu 


Asn 


Val 


Val 


Pro 


Met 




355 










360 










365 








Gly 


Ala 


Ser 


Leu 


Ser 


A^p 


His 


Tyr 


Asii 


Val 


Asp 


Asn 


Arg 


Ai-p 


Tyr 


Asn 




37 0 










375 










380 










I.ru Gly 


Ser 


Gly 


Leu 


Ser 


Gly 


Lys 


Leu 


Ti-r 


Leu 


Gly 


Ala 


Thr 


Tyr 


Asn 


385 










390 










395 










400 


Asp 


Leu 


Trp 


Ser 


Trp 


Leu 


Leu 


Gly 


Val 


Glu 


Ser 


Tyr 


Arg 


Leu 


Tyr 


Thr 










405 










410 










415 




Trp 


He 


Gly 


Tyr 


Glu 


Glu 


Pro 


His 


Gin 


Lys 


A;sn 


Thr 


Asp 


Val 


Ser 


Ser 








420 










425 










4 30 






Phe 


Met 


Val 


Gin 


Gly 


Asp 


Glu 


Ser 


Lys 


Ala 


Arg 


Leu 


Leu 


Vai 


Thr 


Ser 






435 










440 










445 








Ser 


Gl\i 


Phe 


Ala 


Phe 


His 


Pro 


Gly 


Pro 


Trp 


Kis 


Val 


Ala 


He 


Val 


Ala 




450 










455 










46€ 










Arg 


Arg 


Phe 


He 


Arg 


Lys 


Thr 


Ala 


Tyr 


Gin 


Phe 


Tyr 


Pro 


A.-rn 


Val 


Ser 


465 










470 










475 










480 


Phe 


Asp 


Thr 


Gly 


Asp 


He 


Gin 


Leu 


Arg 


Val 


Gly 


Phe 


His 


Pile 







4 65 4 90 



(2) INFORIIATIOW FOR SEO ID NO: 47 9 



SUBSTITUTE SHEET O^c 26) (RO/AU) 



fP^li iH^ii vrfi 




tP^'h -rt*' •iP'ii fF^l 

•iUi O r:: 



wo 99/29870 PCTMU98/01023 

428/490 



fi) SEQUEHCF, CHARACTERISTICS: 

(A) LENGTH: 29'! amino acids 
(8) type:: amino acid 
tDJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTflETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) OKGAtilSI-l; Porphyromonas gingivalis- 

UxJ FEATURE: 

Ih) HAJIE/KEi; miEc_fe3turc 
lU) LOCATION 1 . . .294 



(xi) SEQLfE>ICE DESCRIPTIOH: SZQ ID 110:47 9 



Met 


Lys 


Arg 


Leu 


He 


Val 


Fhe 


Leu 


Ala 


Het Gly 


Gly Leu 


Leu 


Phe 


Thr 


1 








5 










10 










15 




Leu 


Ma 


Asn 


Ala 


Gin 


Glu 


Ala 


Asn 


Thr 


Ala 


Ser 


Asp Thr 


Pro 


Lys 


Lys 








20 










25 










30 






A.'FP 


Trp 


Thr 


He 


Lys 


Gly 


Val 


Thr 


Gly 


Leu 


Asn 


Ala 


Ser 


Gin 


Thr 


Ssr 
















40 










45 








Lea 


Thr 


Arn 


Trp 


Al a 


Ala 


Gly 


Gly 


Glu 


Asn 


Thr 


Val 


Ala 


Gly Asn 


Leu 




50 










55 










60 










Tyr 


L«u 


Arn 


He 


Asp 


Ala 


Aea 


Tyr 


Leu 


Lys 


Asp 


Lis 


Trp 


Ser 


Trp 


Asp 


65 










70 










75 










80 


Asn 


Gly 


L^iU 


Arg 


Thr 


Asp 


Phe 


Gly 


Leu 


Thr 


Tyr 


Thr 


Thr 


Ala 


Asn 


Lys 










85 










90 










95 




Trp 


Arn 


Lys 


Ser 


Val 


Asp 


Lys 


He 


Glu 


Leu 


Phe 


Thr 


Lys 


Ala 


Gly 


Tyr 








100 










105 










110 






Glu 


lie 


Gly 


Ly.s 


Ills 


Trp 


Tyr 


Gly 


Ser 


Ala 


Leu 


Phe 


Thr 


Phe 


Leu 


Ser 






115 










120 










125 








Gin 


Tyr 


Ali 


Lys 


Gly 


Tyr 


Glu 


Lys 


Pro 


Sor Asp 


His 


Leu 


Thr Gly 


Val 




13Q 










135 










140 










Lyc 


His 


He 


Ser 


Asn 


Phe 


Phe 


Ala 


l?ro 


Ala 


Tyr 


L.eu 


Thr 


Leu 


Gly 


lie 


1-15 










150 










155 










160 


Gly Ala 


Asp 


Tyr 


Lys 


Pro 


Asn 


Glu 


Lys 


Phe 


Ser 


Leu 


Tyr 


Leu 


Sor 


Pro 










165 










170 










175 




Thr 


Thr 


Gly 


Lys 


Leu 


Thr 


Val 


Val 


Ala 


Asp Asp 


Tyr 


Leu 


Ser 


Ser 


Leu 








180 










185 










190 






Gly 


Ala 


Fhe 


Gly 


Val 


Lys 


Val 


Gly 


Glu 


T.ys 


Thr 


Met 


Phe 


Glu 


Leu 


Gly 






195 










200 










205 








Ma 


Leu 


Val 


Val 


Glv 


Ser 


Ala 


Asn 


H« 


Asn 


Leu 


Het 


Glu 


Asn 


Val 


Asn 




210 










215 










220 










Leu 


He- 


Thr 


Lyt 


Ala 


Ser 


Phe 


Phe 


Str 


Ala 


Tyr 


Thr 


Hi.-? 


Asp 


Phe 


Gly 












230 










235 










240 




ll- 


Affp 


He 


Asn 


Trp 


Glu 


Ala 


Het 


Leu 


Ala 


Het 


Lys 


He 


Asn 


Lys 










245 










250 










2S5 




Phe 


Leu 


Thr 


Ala 


Thr 


He 


Ala 


Thr 


Asn 


Leu 


He 


Tyc Asp 


Asp Asp 


Val 








260 










265 










270 








He 


Asn 


Asp 


Gly 


Pro 


Lys 


He 


Gin 


Phe 


Lys 


Glu 


Val 


Val 


Gly 


Val 






27S 










280 










285 








Gly Val 


Ala 


Tyr 


Thr 


Phs 























290 



{?.) TMFORHATIon FOR SEQ 10 NO: 4 80 

{ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 amino acids 
(B» TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

(vij ORIGIUAL SOURCE: 

lA) ORGAIHSM; Porphyromonas gingivalie 

ti:0 FEATURE: 

(A) HAIIE/KEY: ml sf:_f ea t ure 

(B) LOCATIOf] 1 . . .204 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



™P o A oi e m -Cji kt; lO 1Q O 



WO 99/29870 

429/ 490 

(Xl) SEQUENCE DESCRIPTION: StQ ID NO: 4 80 



Met Lys Lys Met lie Leu Ala Ala Thr Met T.cu l.cu Ala Thr He Gly 
Phe Ala A.,n Ala Cln Ser Arg Fro Ala [eu Arg Leu A^p Ala ^sn Phe 
Val Gly Ser A^n Leu Met Gin Lys Val Ala A.-n Thr Ser Val Asn Asn 
Lyn Mnt Tl« Val Gly Leu Arg Val Gly Ala Ala Ala gL Phe Ala Leu 
ser Asn Asp Gly Phe Tyr Leu Ala Pro Gly Leu IL Tyr Thr Met Arg 



-75 



15 



Gly Ala Ly.^ Met Glu S^r Leu Ser Glu Thr Thr Thr Arg Leu His Tyr 



90 



Leu Gin He Pro V^l Asn Ala Gly Met Arg The S^r Phe Ala Lp A^n 

105 110 
Met Al. T.^. s<.r L-.u Clu Ala Gly Pro Tyr Phe Ala Tyr Gly Val Ala 

20 Gly Thr lie Lys Thr Lys Val Gly Val Thr Ala sll Val A»p Ala 

Dhe Gly A.-P Asn Gly Tyr Asn Arg Phe A^p L.u ely Leu Gly Leu Ser 

2g Ala Ala Leu Ser Tyr Asp Arg Tyr Tyr Val Gin He Gly Tyr Glu Hi^ 
Gly Leu Leu Asn Met Leu Lys Asp Ala pJo Asp Lys Thr Sc-r Arg 



30 



35 



Asn His Asp Phe Phe Val Gly Leu Gly Val Arg Ph- 
195 200 

(2) IHFORIIATIOIJ ETIR SEQ ID HO: 481 

<i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 24 3 amino ac;ids 
(R) TYPE: amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

40 (iii) HYPOTHETICAL: YES 

(Vl) ORIGINAL SOCfRCE: 

(A) ORGAinsH: Porphyromonas gingivalis 

45 { XK ) FEA'J'URE : 

(A) NAME/ KEY: misc feature 

(B) LOCATIOH i , . .2T3 

gQ ("i' SEQUENCE DESCRIPTION: SEQ in M0:4P1 

net Lys Arq He Phe Thr Val Ala Leu Val Leu Leu Ala Scr Val Thr 
Met Ala He Gly Gin Scr Arg Pro Ala Leu Arg Val Asp Ala Asn Phe 

55 val GlY Ser A»„ Gin Ser Met Lys ^g Asp Gly Tyr Val T^p A«p Thr 

Lv^ Met Asn Val Gly Leu Arg Val Gly Ala Ala Ala Glu Phe Met He 

55 gQ 

60 65^' ^" ""^ 7^ ^" "^y^ "'^■^ '•i-' 

Gly Ser Lys Thr Glu Trp Asp He Pro Glu llet Val Pro Gly Thr ^^r 

qo ^ 



He Thr Het Val Ser Thr Arg Leu His Tyr Leu Gin Leu Pro l" 

Cc 105 iin 
DO Ala Glv Mot Ara Phe A.*r» t... . 



Asn 



Ala Gly Met Arg Phe Asp Leu Met Asn Asp Met Ala Val Ser He Glu 

110 



Ala Gly Pro Phe Leu Ala Tyr G^y He Tyr Gly The ^yr Arq Gin Lys 



70 14S "^^^^ [so 

Thr Leu Gly Gly Pro Thr Asn He Arg Trp Asp He Gly Ala A..n n'e 

165 i-jQ _ 

rie Ala Ala Phe His Tyr Lys Arg Tyr Tyr He Gin He Gly Tyr Glu 

75 His Gly Pho val A.,p T,e V.l Sec Gl'y Gly Gly Ser Asp \il ?ro Arg 



PCT/AU98/01023 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



lUli "z?^ »«i3 .«iL jilZ O ias uj -lUii «3 'sSZ:^ O iUi ^Ui 



wo 99/29870 

430/ 490 

200 205 
Leu Asn Asp Asn Arg Gin Ser Ser Ser Thr Thr Ala Leu Ary Glu Lyr 

Gly Asn Acn Glu Tyr Ala Tyr A.-n Arg A--.p Phe Ph^ Val Gly He Gly 

I « r,u 235 2^0 

Tyr Arg Phe "^^-^ 

(2) IHt*ORi'IATIOn FOR SEQ ID HO: 4 82 

(i) SF.QUEIICE CHARACTERISTICS: 

(A) LEHGTH: 207 amino acid*^ 

(B) TYPE: amino acid 
(P) TOPOLOGV: linear 

(ii) HOLECULE TYPE: protein 

(iii) nypO-i'HETlCAL: YES 

(Vi) ORIGIIIAL SOURCE: 

(AJ ORGAHISM: Porphyromonas ginqt vails 

iiil) FEATURE: 

(A) IIAIIE/KKV; misc feature 
(Bl LOCATIOH 1. . .207 

(Xi) SEQUENCE DESCRIPTlOtl: SEO ID MO: 482 

Met Lys Arg Met Leu Leu Leu Leu Val Val Lj^u Leu Tyr Gly He Ala 

Gly Arg Leu Ala Ala Gin Asp Val He Arg Pro Trp Ser Leu Gin Val 

25 30 
Gly Ala Gly Tyr Ser Acp Thr Gin A.sr. He Pre Gly Gly Phe Thr Tyr 

Gly Phe Tyr Leu Gly Lys Arg Met Gly Ser Phe Leu Glu Val Gly Leu 

55 (^Q 
Ser Met Tyr Asn Ser Thr Arg Gin Thr Ala Asn Asn Ala Asp Ser Phe 

Ala ser Asn Glu Gly Asp Gly Ser Phe Gin Val Asn Met Ser Ser Pro 

°S 90 ^5 

Asn Glu Lys Trp Ser Phe Phe Asp Ala Gly Ser Ala Asn Cys iyr M-t 

105 

He Val Val Gly Val Asn Pro Leu His Leu Phe Trp Gin Asn Ser Arg 

His Asn Leu Phe Leu Ala Va) Gin Ala Cly Leu Ser Lys His Asn 

lie Hlr Ph^ Ho Tyr Gly Asp Lys Gly Ala Lys vtl Ser He Tyr Thi 

Asn Ser Asn Thr Tyr He Gly Tyr Gly Ala ^rg Val Ala Tyr Glu Tyr 

165 j^t 
Gin He His Lys Asn Val Gly Ala Gly Ala Ala Vsl ilet Tyr Asp His 

, , 185 ton 

^ly Asn Lys flet Leu Thr Ala Met Ala Thr Leu Ser Thr his Phe 
195 200 205 

<2) IHFORHATION FOR SEQ ID NO: 4 83 

(1) SEQURUCE CHARACTERISTICS- 

(A) LENGTH: 951 amino acids 
tB) rx-pS: amino acid 
(D) TOPOLOGY: linear 

(ii» MOLECULE TYPE; protein 

(HI) HYPOTHETICAL: VES 

(vl) ORIGINAL SOURCE: 

(A) ORGAiUSH: Porphyron.jnas gingivalis 

lix) FEATL'RE: 

(AJ NAflE/KEY: misc featur- 

(B) LOCATIOH 1 ... 951 

(>:i) SEQUENCE DESCRIPTION: SEQ ID NO: 483 



PCT/AU98/01023 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



iP=^ ia~Hi W"^ ii"»Jl «n 



WO 99/29870 



PCT/AU98/01023 



431/490 



Met Arg 
1 

He Lou 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Met Aia 

Asp Ssr 

50 
Ser Ala 

Gin A£p 

Tyr Glu 

Leu Phe 

A5n Phe 
130 
Val L-u 

Glu Lys 

Leu Thr 

Gin Gly 
210 
Tyr He 
225 

TKr Gly 



Vai Ser 

Val Gly 

20 
Gly Ala 
35 

Thr Ala 
Phe Arg 

Ser Val 

Ala Ala 

100 
Gly Lys 
115 

Met Tyr 

Asp Thr 

Gin Ser 

Gly He 
180 
Ala Gly 

Gly Arg 



Asp Leu Cyc 
5 

Leu Leu Cys 

Val Gly Leu 

Lya fKsp Thr 
55 

Asp Ser Leu 
70 

Tyr Asp A«p 
85 

Asp Ser He 

Ser Tyr Val 

Leu Asn Thr 
135 

Ala Gly Tyr 

150 
Phe Glu Ala 
165 

He Ser Gly 
Lys Thr Lys 



Ser Arq 

Ala Thr 

25 
His His 
40 

Val Pro 

Pro Ala 

Glu Phe 

Val Leu 
105 
Ser Tyr 
120 

Asp Scr 
Pro Met 
T.ys A.?n 



Asn Leu 
Pro Val 



Leu Pro 

Leu tl£t 

Asn Gly 
290 
Arg Gly 
305 

Lys Tyr 

Leu Val 

Thr Thr 

A^n Fro 
370 
Tyr Fhe 
385 

Ala Thr 

Thr Pro 

Asp Thr 

Thr Arg 
450 
Tyr Glu 
465 

Leu Thr 



Tyr Thr Thr 
215 

Ser Lys Ala 

230 
Asn Leu Val 
245 

Tyr Phe Pro 



Phe Gly 
260 

Pro Thr Tyr Gly Glu 
275 

Gly Tyr 



Glu He 

Lys Lys 

Ser Lys 
3^0 
Ser Leu 
355 

Leu Gin 

Gin Asn 

Thr Arg 

Phe Ser 
420 
Thr Vai 
435 

Tyr Pro 
Lys Leu 
Lyc Glu 



Lys hsn 

Leu Asp 

Tyr Thr 
530 
Leu Pro 
545 

Leu Ser 
Lyt- Pro 
Thr Pro 



Gly Itet 
500 
Tyr He 
515 

Lys Gly 

Ser Aap 

Ala Gly 

Phe Ser 
580 
Thr Val 



Tyr Phe Ala 
295 

Phe Ser Lys 

310 
Arg Tyr Lys 
32b 

Ser Gly Asp 

Asn He Arg 

Thr Leu Ser 
375 

Ser Lsu Asn 

390 
Ser Ser Ala 
405 

He Thr Gly 

Ser Leu Thr 

Phe Lys Arg 
455 

Ser Val Gly 

470 
Lys Asp Leu 
485 

Arg His Str 



Vai He 
105 
Lys Met 
200 

Oys Asp 

Lys Val 

He Ala 

Phe Sor 
265 
Asp Asn 
280 

Phe Ssr 

Gly Ser 

Tyr Asn 

Lys Tyr 
345 
Trp Thr 
360 

Ala Asn 
Thr Thr 
Val Ser 



Leu 
10 
Leu 

Arg 

Leu 

Asp 

Glu 
90 
Leu 

Gin 

Scr 

Ala 

Phe 
170 
Thr 



Ser 

Val 

Arg 

Ala 

Ser 
75 
Leu 

Gly 

Lys 

Thr 

Phe 
155 
Thr 



Trp Leu Leu 
Ala 
His 



Lys 

60 

Tht 

Clu 

Gin 

Ser 

Val 
140 
Pro 



Ala Glu 
30 

Ala Ala 
45 

Pro He 



Gly Ser 

Asp He 

Asi: Arg 
110 
Arq Leu 
125 

Tyr Thr 



Pre 
Asn 
His 



Asp 
250 
Asn 

Arg 

Asp 

Trp 

Gly 
330 
Val 

Hlr; 

Val 

Tyr 



Gin 

Asp 

His 

Pro 
235 
Het 

Lys 

Tyr 

Tyr 

Gly 
315 
Ser 

Pro 

Ser 

Asn 



Tyr 

Gin 

Asn 

Asp 
220 
Glu 

Pro 

Tyr 

Gly 

He 
300 
He 

Phe 

Gly 

Gin 



Val Phe 
Asn Phe 



Gly Glu 
190 
He Met 
205 

His Pro 

Lys Asp 

Leu Pro 

Ser Ser 
270 
Phe Tyr 
205 

Asp Leu 
Ser Ala 
Glu Ala 



Ser Met 
4 25 
Leu Pro 
440 

Lys Thr 
Tyr Ser 
Leu Gin 



Tyr 
410 
Asp 



Asn Leu Thr 

He Arg Lys 

535 

Thr Thr Tyr 

550 
Leu Ser Thr 
565 

Phe Gly Gly 
Sar P!ie Ser 



Val Pro 
505 

Met Gly 
520 

Ser Trp 

L/a Phe 

Thr Leu 

Asn Leu 
585 
Tyr Met 



Asn 

Arg 

Gly 

Ser 
4 90 
He 

Val 

Asn 

Arg 

Tyr 
570 
lie 

Pro 



Asp 
395 
Ser 

He 

Leu 

Val 

Gin 
475 
Asn 

Ser 

Asn 

Glu 

Arg 
555 
Gly 

Met 

Asp 



Phe 
380 
Val 

Arg 

Ser 

Ser 

Gly 
460 
Leu 

Leu 

Leu 

Tyr 

Asp 
540 
Leu 

Met 

He 

Phe 



Asp Tyr 
350 
Asp Pro 
365 

Ala Thr 



Asn Ala 

Lys Phs 

Gin Asn 
430 
He Asn 
445 

Pro Glu 

Arg Asn 

Val Arg 

Thr Val 
510 
Asn Glu 
525 

Lys Lys 
Tyr Asp 
Phe Lys 



Pro Val 
15 

Arg Pro 

Leu Ser 

Pro Asp 

>lec Arg 

80 
Val Glu 
95 

Ala Tyr 

Glu Ala 

Arg Tyr 

Lys Asp 
IGO 
Arq Thr 
175 

Gly Tyr 

Phe Met 

His Phe 

He Val 
;¥40 
He Gly 
255 

Gly He 

Leu Arg 

Ala Leu 

Gin Ser 
320 
Asn Tyr 
335 

Ser Lys 

Lys Ala 

Gly Stir 

Arg Thr 
4 00 
Pro Gly 
415 

Met Arg 

Met Ser 

Arg Trp 

Ser He 
480 
Asp Trp 
495 

Pro Leu 



Arg Hlfc- 
590 
Thr Lys 



Trp Trp 

Thr Phe 

Tyr Ser 
560 
Pro Trp 
575 

Arg Phe 



Arg Arg 



SUBSTTTUTE SHEET (Rule 26) (RO/AU) 



wo 99/29870 PCT/AU98/01 0Z3 

432/490 



5?5 500 605 



Tyr 


Gly 


Phe 


Trp 


Gltj 


Leu 


L-u 


Glu 


His 


Thr 


Asp 


Gin 


Asn 


Gly 


Lys 


Leu 


610 










615 










620 










His 


The 


Leu 


Leu 


Tyr 


Sec 


Pco 


Tyr 


Phe 


Glu 


Gin 


lie 


Phe 


Gly 


Ala 


Pro 


62& 










630 










€35 










640 


Sec 


Met 


Gly Asn 


Ala 


Gly 


Ser 


Val 


Asn 


Phe 


Ser 


Phe 


Asp 


Asn 


Asn 


Leu 




















650 










655 




Glu 


Aia 


Lys 


He 


Lys 


Ser 


Lys 


Ser Asp 


Str 


Thr 


Gly 


He 


Lys 


Lys 


He 








660 










665 










6V0 






Ser 


Leu 




Acp 


Gin 


Phe 


Thr 


Trp Sec 


Thr 


Ser 


Tyr 


Asn 


Met 


Phe 


Ala 






67 S 










680 










685 








Affp 


Ser 


He 


Arq 


Trp 


Ser 


Afn 


He 


Ser 


Ala 


Ser 


L«u 


Ala 


Leu 


Arg 


Leu 














655 










700 










Ser 


Lys 


Ser 


Phe 


Thr 


Leu Arg 


Leu 


Ser 


Gly Leu 


Phe 


Asp 


Pro 


Tyr 


Leu 


705 










710 










715 










720 


Thr 


Lys 


Tyr 


Tyr 


Glu 


Gly Glu 


Asp 


Gly 


Lys 


He 


He 


Pro 


Tyr 


Lys 


Ser 










725 










7 30 










735 




Aen 


Af p 


Leu 


Arg 


He 


Phe 


Asn 


Gly Lys 


Gly 


Leu 


Ala 


Arg 


Leu 


He 


Ser 








7-JO 










745 










750 






Thr 


Giy 


Thr 


Ser 


Phe 


Ser 


Tyr 


Thr 


Leu 


Asn 


Ly.- 


Glu 


Ser 


Leu 


Ser 


Gly 






755 










760 










7 55 








LiU 


He 


Ala 


Leu 


Phe 


Ser 


Gly 


Lys 


Lys 


Glu 


A,cg 


Arg 


Asp Glu 


Lyc 


Lye 




770 










77S 










780 










Asn 


Thr 


Gly 


Ala 


Thr 


Pro 


Hi? 


Glu 


Gly Asp 


Asp 


Ala 


Ala 


Asp 


He 


Leu 


705 










7 90 










795 










900 


G)M 


Gly 


51 y 


Arg 


Pro 


Gin 


Aj?n 


Glu 


Ser 


'3iy 


Gly 


Ser 


Leu 


Leu 


Glu 


Arg 










805 










010 










815 




Asn 


Arg 


Gin 


Gly 


Gly 


Ala 


Val 


Asp 


Gin 


Asp 


Gly 


Tyr 


Phe 


Ala 


Tyr 


Ser 








820 










825 










830 






He 


Pro 


Trp 


Ser 


Leu 


Ser 


Phe 


Asp 


Tyr 


Ser 


Trp 


A^n 


He 


Ala 


Thr 


Aiip 






835 










840 










845 








Tyr 


Asn 


Arg 


Tyr 


Asn 


Val 


Affn 


Lys 


Met 


Glu 


His 


Tyr 


Tyr Arg 


Val 


Thr 




850 










855 










660 










Gin 


Asn 


Leu 


Ser 


Phe 


Arq 


Gly Asn 


He 


Gin 


Pco 


Thr 


Pro 


Asn 


Tcp 


Ser 


865 










870 










875 










880 


Phfi 


Gly 


Phe 


Asn 


Ala 


Asn 


Tyr Asn 


Phe 


Arp 


Leu 


Lys 


Lys 


li- 


Thr 


Ser 










895 










8 90 










8 95 




Leu 


Thr 


Cys 


Asn 


Val 


Thr 


Arg 


Asp 


Met 


His 


Cys 


Trp 


Ala 


ne 


Ser 


Ala 








900 










905 










910 






Ser 


Phe 


He 


Pro 


He 


Gly 


Ala 


Tyr 


Lys 


Sor 


Tyr 


Asn 


Phe 


Val 


He 


Ser 






915 










920 










925 








Vai 


Lys 


Scr 


Ser 


Leu 


Leu 


Gin 


Asp 


Leu 


Lys 


Tyr 


Gin 


Gin 


Ser 


Asn 


Ary 




930 










935 










940 










Pro 


He 


Thr 


Asn 


Thr 


Txp 


Tyr 




















945 










950 






















(2) 


Id FORI UKVlOU 


FOR 


SEQ 


ID (10:484 

















ti) SEQUEIJCE CHARACTERISTICS: 

{A} LENGTH: 1226 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



Ul) MOLECULE T^'PE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAIUSH: Porphyromonaa gingivalis 

FEATURE: 

(A) NAIIE/KEV: mii-c_£ea ture 

(B) LOCATION 1. . .1226 

(xil SEQ'JEUCE DESCRIPTIOti: SZQ ID 110:484 

net net Lyr Arg Tyr Thr He He . Leu Ala Val Phe Leu Leu Phe Cy^- 

IS 10 15 

Thr Val Phe Thr Phe Gin He Lys Aia Arg Pre Tyr-Glu Arg Phe Ala 

20 25 30 

Asp Val Glu Lys Pro Trp He Gin Lys His Sec Met Asp Ser Lyc Leu 

35 40 45 

Val Pro Ala Asn Lys Gly A^n Leu He Gin AU Glu He Val Tyr Gin 

50 55 60 

Ser Val Ser Glu His Ser A^p Leu Val II- Ser Pro Val Arn Glu He 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



65 










70 










75 










80 


Arg 


Pro 


Ala 


Asn 


Arg 


Phe 


Pro 


Ser 


His 


Arg 


Lys 


Ser 


Phe 


Phe 


Ala 


Glu 








85 










90 










95 




Asn 


Leu 


Arg 


Ala 


Ser 


Pro 


Pro 


Val 


Val 


Pro 


Val 


Ala 


Val 


Asp 


Lys 


Tyr 






100 










105 










110 






Ala 


Val 


Pro 


Vai 


Ala 


Asn 


Pro 


Hot 


A#fp 


Pro 


Glu 


Asn 


Pro 


Asn 


Ala 


Trp 






lib 










120 










125 








Asp 


Val 


Thr 


Leu 


Lys 


He 


Thr 


Thr 


Lys 


Ala 


Val 


The 


VaL 


Pro 


Vai 


Asp 


13Q 










135 










140 










VaL 


Val 


Met 


Val 


lie 


Asp 


Gin 


Ser 


Ser 


Ser 


Met 


Gly Gly Gin 


Asn 


He 


145 










150 










155 










160 


Ala 


Arg 


Leu 


Lys 


Ser 


Ala 


He 


Ala 


Ser 


Gly Gin 


Ary 


Phe 


Val 


Lys 


Lys 










165 










170 










175 




Met 


Leu 


Pro 


Ly? 


Gly Thr Ala 


Thr 


Glu 


Gly Val 


Arg 


He 


Ala 


Leu 


Val 








180 










185 










190 






Ser 


Tyr 


Asp 


His 


Glu 


Pro 


His 


Arg 


Leu 


Ser Asp 


Phe 


Thr 


Lys 


Asp 


Thr 






195 










200 










205 








Ala 


Phe 


Leu 


cyr 


Gin 


Lys 


He 


Arq 


Ala 


Leu 


Thr 


Pro 


He 


Trp 


Gly 


Thr 




21P 










215 










220 










His 


Thr 


Gin 


Gly Gly 


Leu 


Lye 


Hut 


Ala 


Arg 


Acn 


lis 


llet 


Ala 


Thr 


Ser 


225 










230 










235 










240 


Thr 


Ala 


Val 


Asp 


I.ys 


HI.- 


He 


lie 


Leu 


llet 


S-sr 


Asp 


Gly 


Leu 


Aia 


Thr 










245 










250 










255 




Glu 


Ciln 


Tyr 


Fro 


Val 


Lys 


Asn 


Val 


Thr 


Thr 


Ala 


Asp 


Phe 


13- 


Gly 


Lys 






260 










2G5 










270 






Thr 


Gly Asn 


Al a 


Asn 


Asp 


Pro 


He 


Asp 


Leu 


Val 


He 


Gin 


Gly 


Ala 


He 






275 










280 










2G5 








Affn 


Phe 


Pro 


Thr 


Asn 


Tyr 


Val 


Ser 


Asn 


Asn 


Pro 


Scr 


Thr 


Pro 


Leu 


Thr 




290 










295 










300 










Pro 


Asn 


Tyr 


Pro 


Thr 


His 


Ser 


Sec 


Lys Val 


Gly 


Arg Arg Asn 


Leu 


Pro 


305 










310 










315 










320 


Glu 


Ser 


Ly3 


Phe 


Asp 


Tyr 


Ser 


Asn 


Leu 


Sor 


Ala 


Arg 


He 


Thr 


Phe 


Asp 








325 










330 










335 




Gly Val 


Ala 


Gly 


Ala 


Leu 


Val 


Tyr 


Glu 


Pto 


Arq 


Phe 


Pro 


His 


Pro 


Tyr 








340 










345 










350 






Tyr 


Tyr 


Tyc 


Phe 


Pro 


Cys 


Asn 


Ala 


Ala 


He 


Asn 


Glu 


Ala 


Gin 


Phe 


Ala 






355 










360 










365 








Lys 


Asn 


Ssr 


Gly Tyr 


Thr 


He 


His 


Thr 


He 


Gly 


Tyr 


Asp 


Leu 


Gly 


Asp 




370 










375 










380 










Phe 


Ala 


Leu 


Ala 


Asn 


Asn 


Ser 


Leu 


Lys 


Leu 


Thr 


Ala 


Thr Asp 


Glu 


Asn 


385 










390 










395 










4 00 


His 


Phe 


Phe 


Thr 


Ala 


Thr 


Pro 


Ala 


Ajsn 


Leu 


Ala 


Ala 


Ala 


Phe 


Asp 


Asn 










405 










410 










415 




lis 


Ala 


Gin 


Thr 


He 


Asn 


He 


Gly 


He 


Gin 


Arg 


Gly Glu Val 


Thr 


Asp 








420 










425 










430 






PJie 


Val 


Ala 


Pro 


Gly 


Phe 


He 


Val 


Lys 


Asn 


Leu 


Thr 


Gin 


Ser 


Gly 


Asp 






435 










440 










445 








Val 


Thr 


His 


LfeU 


Leu 


Asn 


Val 


Ser 


Asn 


Gly Thr 


Val 


His 


Tyr 


Asp 


Val 




450 










455 










460 










Ser 


Thr 


Lys 


Lys 


Leu 


Thr Trp 


Thr 


TUr 


Gly 


Thr 


He 


Leu 


Ser 


Ser 


Ssr 


4 65 










470 










475 










480 


GXu 


Ala 


Thr 


lie 


Thr 


Tyr 


Arg 


He 


Tyr Ala 


Asp 


Leu 


Asp 


Tyr 


He 


Gin 










4 85 










490 










495 




Ai"n 


Asn 


Asp 


He 


Pre 


Val 


Asn 


Thr 


Thr 


Ser 


Ala 


He 


Gly Fro 


Asp 


Leu 








500 










505 










510 






Gly 


Gly 


Phe 


Asp 


Thr 


Asn 


Thr 


Glu 


Ala 


Lys 


Leu 


Thr 


Tyr 


Thr 


Asn 


Ser 






515 










520 










525 








Pisn 


Gly 


Glu 


Pro 


Asn 


Gin 


Gin 


Leu 


H« 


Phe 


Pro 


Arg 


Pro 


Thr 


Val 


Lys 




530 










535 










540 










Leu 


Gly 


Tyr 


Gly Val 


He 


I.ys 


Arg 


His 


Tyr Val 


Leu 


Val 


Asn 


Lys 


Asp 


545 










550 










555 










560 


Gly 


Gin 


Pro 


lie 


Gin 


Ala 


Asn 


Gly Thr 


Val 


Val 


Ser 


Ser 


Leu 


Ser 


Glu 








565 










570 










575 




Ala 


His 


Val 


Leu 


Gin 


Ser 


Gin 


P^p 


Phe 


Phe 


Leu 


Pro 


Ser 


Gly Gly 


Gly 








580 










585 










590 






His 


lie 


Val 


Pro 


Lys 


Trp 


He 


Lys 


Leu Asp Lys 


Thr 


Thr 


Glu 


Ala 


Leu 






595 










600 










605 








Gin 


Tyr 


Tyr 


Ser 


Val 


Pro 


Pro 


Thr 


Asn 


Thr 


Val 


He 


Thr 


Thr 


Ala 


Asp 




610 








615 










620 








Gly Lys 


Arg 


'fyr 


Arg 


Phe 


Val 


Glu 


Val 


Pro Gly 


Ser 


Thr 


Pro 


A.^n 


Fro 


62S 










630 










635 










6^10 


Gly Gin Tie 


Gly 


He 


Ser 


Trp 


Lys 


Lys 


Pro 


Ala 


Gly Asn Ala 


Tyr 


Phe 










645 










650 










655 




Ala 


Tyr 


Lys 


Leu 


Leu 


Asn 


Tyr 


Trp 


Met 


Gly 


Gly 


Thr Thr Asp Gin 


Gin 








660 










665 










S70 
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Ser Glu Trp Asp V;il Thr Ser Asn Trp Thr Gly Ala Gin Val Pro Leu 

675 680 685 

Thr Gly CSlu Asp Val Glu Phe Ala Thr Thr Glu Asn Phe Gly S«r Pro 

690 695 TOO 

Ala Val Ala Asp Lsu His Val Pro The Thr Asa Pro Lys lie lie Gly 
705 110 7x5 720 

A^n Leu Ilfl Asn Asn Sec Asp Lys Asp t>eu Val Val Thr Thr Ser Ser 

725 730 T35 

Gin Leu Thr He Asn Gly Val Val Glu Asp Asn Acn Pro Asn Val Gly 

740 745 750 

Thr He Val Val Lys 5er Ser Lys Arp Asn Pro Thr Gly Thr Leu Lsu 

7S5 760 765 

Pli» Ala Asn Pro G] y Tyr Asn Gin Asn Val Gly Gly Thr Vai Glu Phe 

770 775 7 80 

Tyr Asn Gin Gly Tyr Asp Cyc Ala Asp Cys Gly Met Tyr Arg Arg Ser 
785 790 795 800 

Trp Gin Tyr Phe Gly He Pro Val Asn Glu Ser Gly Phe Pro He Asn 

805 810 815 

Asp Val Gly Gly Asn Glu Thr Val Asn Gin Trp Val Glu Pro Phe Atii 

920 825 830 

Gly Asp L/^ Tcp Arg Pro Ala Pro Tyr Ala Pro Asp Thr Glu r.eu Gin 

835 840 815 

Lys Phe Lys Giy Tyr Gin He Thr Asn Asp Val Gin Ala Gin Pro Thr 

850 855 860 

Gly Val Tyr .<^er Phe Lys Gly Mat He Cys V.:il Cys Asp AT n Phe Leu 
865 870 675 880 

Asn Leu Thr Arg Thr Ser Gly Val A.sn Tyr Ser Gly Ala Asn Leu He 

805 890 895 

Gly Asn Scr Tyr Thr Gly Ala lis Asp He Lys Gin Gly He Val Phe 

900 905 910 

Pro Pro Glu Val Glu Gin Tlir Val Tyr Leu Phe Asn Thr Gly The Ary 

915 920 925 

Asp Gin Trp Arg Lys Leu Asn Gly Ser Thr Val Ser Gly Tyr Acg Ala 

930 935 94 0 

Gly Gin Tyr Leu Sec Val Pro Lys Asn Thr Ala Gly Gin Asp Asn Lsu 
945 950 955 960 

Pro Asp Arg He Pro Ser Met His Ser Phe Lcmi Val Lys Met Gin Asn 

965 970 97 5 

Gly Ala Ser Cys Thr Leu Unk He Leu Tyr Asp Lys Leu Leu Lys Asn 

900 985 990 

Thr Thr Val Asn Asn Gly Asn Gly Thr Gin He Thr Trp Arg Ser Gly 

995 1000 1005 

Asn Sex Gly Ser Ala Asn Met Pro Ser Leu Val Met Asp Val Leu Gly 

1010 1015 1020 

Asn Glu Ser Ala Asp Arg Leu Trp He Phe Thr Arsp Gly Gly Leu Ser 
1025 1030 1035 1040 

Phe Gly Ph« Asp Asn Gly Trp Asp Gly Arg Lys Leu Thr Glu L/s Gly 

1045 1050 1055 

Leu Ser Gin Leu Tyr Ala Het Set Asp He Cly Asn Asp Lys Phe Gin 

1060 1065 1070 

Vai Ala Giy Val Pro Glu Leu Asn Asn Leu Leu He Gly Phe Asp Ala 

1075 1080 1085 

Asp Lys Asp Gly Gin Tyt Thr Leu Glu Phe Ala Leu Ser Asp His Phe 

1090 1095 1100 

Ala Lys Gly Ala Val Tyr Leu His Asp Leu Gin Scr Gly Ala hys Ills 
1105 1110 1115 1120 

Arg He Thr Asn Ser Thr Ser Tyr Ser Phe Asp Ala Ly= Arg Gly Asp 

1125 1130 1135 

Ser Gly Ala Arg Phft Arg Leu Ser Tyr Gly Cys Asp Glu Asn Val Asp 

1140 1145 1150 

Asp S-r Ilts Val Val Sec Thr Asn Gly Arg Glu He lie He Leu Asn 

J. 155 1160 1165 

Gin Asp Ala Leu Asp Cy.-^ Thr Val Thr Leu Phe Thr He Glu CJ y Lys 

1170 1175 1180 

Leu Lou Arg Arg Lau Lys Val Leu Alo Gly His Arg Glu Val Met Lys 
1185 1190 1195 1200 

Val Gin Thr Gly Gly Ala Tyr He Val His Leu Gin Arn Ala Ph* Thr 

1205 1210 1215 

Asn Asp Val His Lys Val Leu Val Glu Tyr 
1220 1225 

{21 IKrORJIATIOIvI FOR SF.Q ID NO: 4 85 

(1) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 1225 amino acids 
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wo 99/29870 



PCT/AU98/01023 



435/ 



490 



5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHIST-1: Po tphyromonas gingivalif 

lix) b-EATURE: 

tA) HAi-l£/KEY; inisc_f eature 

(B) LOCATiai 1 . , .1225 



txi) SEQUEMCE DBSCftlPTIOM: SEg ID »0:485 



Met 


Lys 


Arq 


Tyr 


Thr 


He 


Il-i 


Leu 


Ala 


Val 


Phe 


Leu 


Leu 


Phe 


Cys 


Thr 


1 








5 










10 










lb 




Val 


Phe 


Thr 


Phe 


Gin 


He 


Lys 


Ala 


Arg 


Pro 


Tyr 


Glu 


Arg 


Phe 


Ala 


Asp 








20 










25 










30 






Val 


Glu 


Ly? 


Pro 


Trp 


He 


Gin 


Lys 


His 


Ser 


Met 


Asp 


Ser 


Lys 


Leu 


Val 






35 










40 










45 








Pro 


AlA 


A.^n 


hys 


Gly 


Afrn 


Leu 


He 


Gin 


Ala 


Glu 


He 


Val 


Tyr 


Gin 


Ser 




50 








55 










60 










Vnl 


Scr 


Glu 


His 


Ser 


Asp 


Leu 


Val 


He 


Ser 


Pro 


Val 


Asn 


Glu 


He 


Ary 


65 










70 










75 










80 


Pro 


Ala 


Asn 


Arg 


Phe 


Pro 


Ser 


His 


Arg 


Lys 


Sor 


Phe 


Phe 


Ala 


Glu 


Asn 










05 










90 










95 




Le.u 


Arg 


Ala 


Ser 


Pro 


Pro 


Val 


Val 


Pro 


Val 


Ala 


Vai 


Asp 


Lys 


Tyr 


Ala 








100 










105 










110 






Val 


Pro 


Val 


Ala 


Asn 


Pro 


Met 


Asp 


Pro 


Glu 


Asn 


Pro 


Asn 


Ala 


Trp 


Asp 






115 










120 










125 








Val 


Thr 


Leu 


Lys 


He 


Thr 


Thr 


Lys 


Ala 


Val 


Thr 


Val 


Pro 


Val 


Asp 


Val 




130 










135 










140 








Val 


Het 


Val 


He 


A£p 


Gin 


Scr 


Ser 


Ser 


Met 


Gly 


Gly 


Gin 


Asn 


He 


Ala 


145 










150 










155 










150 


Arg 


L-u 


Lys 


Ser 


Ala 


He 


Ala 


Ser 


Gly 


Gin 


Arg 


Phe 


Val 


Lys 


Lyc 


Mot 










165 










170 










1/b 




Leu 


Pro 


Lys 


Gly 


Thr 


Ala 


Thr 


Glu 


Gly 


Val 


Arg 


He 


Ala 


Leu 


Val 


Ser 








180 










195 










130 






Tyr 


Asp 


His 


Glu 


Pro 


Kis 


Arg 


Leu 


Ser 


Asp 


Phe 


Thr 


Lys 


Asp 


Thr 


Ala 






195 










200 










205 








Phe 


L«u 


Cys 


Gin 


Lys 


He 


Arq 


Ala 


Leu 


Thr 


Pro 


He 


Trp 


Gly 


The 


His 




210 










215 










220 










Thr 


Gin 


Gly 


Gly 


Leu 


Lys 


Met 


Ala 


Arg 


Asn 


He 


Met 


Ala 


Thr 


Ser 


Thr 


225 










230 










235 










240 


Ala 


Val 


Asp 


Lyc 


His 


Tie 


He 


Leu 


Met 


Ser 


Asp 


Gly 


Leu 


Ala 


Thr 


Glu 










245 










250 










255 




Gin 


Tyr 


Pro 


Val 


Lys 


As-n 


Val 


Thr 


Thr 


Ala 


A^p 


Phs 


He 


Gly 


Lys 


Thr 








260 










26b 










270 




Gly 


As-n 


Ala 


Arn 


Ar«p 


Pro 


He 


Asp 


Leu 


Val 


He 


Gin 


Gly 


Ala 


He 


Asn 






2*7 5 










280 










285 








Phe 


Pro 


Thr 


Asn 


Tyr 


Val 


Ser 


Asn 


Asn 


Pro 


Sqt 


Thr 


Pro 


Leu 


Thr 


Pro 




290 










295 










300 










Asn 


Tyr 


Pro 


Thr 


His 


Ser 


S«r 


Lys 


Val 


Gly 


Arg 


Arg 


Asn 


Leu 


Pru 


Glu 


305 










310 










315 










320 


Ser 


Ly5 


Phe 


Asp 


Tyr 


Ser 


Pisn 


Leu 


Ser 


Ala 


Arg 


Tic 


Thr 


Phe 


Asp 


Gly 










325 










330 










335 




Val 


Ala 


Gly 


Ala 


Leu 


Val 


Tyr 


Glu 


Pro 


Arg 


Phe 


Pro 


His 


Pro 


Tyr 


Tyr 








340 










345 










350 


Tyr 


Tyr 


Phe 


Pro 


Cys 


Asn 


Ala 


Ala 


He 


Acn 


Glu 


Aln 


Gin 


Phe 


Ala 


Lys 






35S 










360 










365 






Asn 


S«r 


Gly 


Tyr 


Thr 


He 


His 


Thr 


He 


Gly 


Tyr 


Asp 


Leu 


Gly 


Asp 


Phe 




37 0 










37 5 










380 










Ala 


Leu 


Ala 


Asn 


Asn 


Ser 


Leu 


Lys 


L«u 


Thr 


AlA 


Thr 


Asp 


Glu 


Asn 


His 


385 










390 










395 










400 


Phe 


Phe 


Thr 


Ala 


Thr 


Pro 


Ala 


Asn 


Leu 


Ala 


Ala 


Ala 


Phe 


Asp 


Asn 


lis 










4 05 










410 










415 




Ala 


GLn 


Thr 


He 


Asn 


He 


Gly 


He 


Gin 


Arg 


Gly 


Glu 


Val 


Thr 


Asp 


Phe 








420 










425 










430 




Val 


Ala 


Pro 


Gly 


Phe 


He 


Val 


Lys 


Asn 


Leu 


Thr 


Gin 


Ser 


Gly 


Asp 


Val 






435 










440 










445 






Thr 


His 


Leu 


Leu 


Asn 


Val 


Ser 


Asn 


Gly 


Thr 


Val 


His 


Tyr 


Ai-p 


Val 


Ser 




450 










455 










460 











Thr Lyj Lys Leu Thr Trp Thr Thr Gly Thr He Leu Ser Ser Ser Glu 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



«— - jini. _^ ^"-li fl«=ii jtJ' ir=^i .ti*' ^==ii il"^ ■a"^! tr^h 



WO 99/29870 PCT/AU98/01023 

436/490 



4 65 










47 0 










47 5 










4 80 


Ala 


Thr 


lie 


Thr 


Tyr 


Arg 


lie 


Tyr 


Ala 


Ajp 


Leu 


Asp 


Tyr 


He 


Gin 


Asn 










4 85 










4 90 










495 




Asn 


Astp 


lie 


Pro 


Val 


Asn 


Thr 


Thr 


Scr 


Al n 


1 1 n 


Gly 


Pro 


Asp 


Leu 


Gly 








500 










505 










510 






Gly 


Phe 


Asp 


Thr 


Asn 


Thr 


Glu 


Ala 


Lys 


Leu 


Thr 


Tyr 


Thr 


Asn 


Ser 


Asn 






515 










520 










525 








Gly 


Glu 


Pro 


Asn 


Gin 


Gin 


Leu 


He 


Phe 


Pro 


Axg 


Pre 


Thr 


Val 


Lys 


Leu 




530 










535 










5'JO 










Gly 


Tyr 


Gly 


Val 


He 


Lys 


Arg 


His 


Tyr 


Val 


Leu 


Val 


Asn 


Lys 


Asp 


Gly 


54S 










550 










SS5 










560 


Gin 


Pro 


lie 


Gin 


Ala 


Asn 


Gly 


Thr 


Val 


Val 


Ser 


Ser 


Leu 


Ser 


Glu 


Ala 










565 










570 










575 




His 


Val 


Leu 


Gin 


Ser 


Gin 


Asp 


Phe 


Phe 


Leu 


Pro 


Ser 


Gly 


Gly Gly 


His 








580 










595 










590 






lis 


Val 


Pro 


Lyc 


Trp 


lie 


l.y.^ 


Leu 


Asp 


Lys 


Thr 


Thr 


Glu 


Ala 


Leu 


Gin 






595 










600 










605 








Tyr 


Tyr 


Ser 


Val 


Fro 


Pro 


Thr 


Asn 


Thr 


Val 


He 


Thr 


Thr 


Ala 


Asp 


Gly 




610 










615 










620 










Lys 


Arg 


Tyr 


Arg 


Phc 


Val 


Glu 


Val 


Fro 


GlV 


Ser 


Thr 


Pro 


Asn 


Pro 


Gly 












630 










635 










640 


Gin 


Il« 


Gly 


He 


Ser 


Trp 


Lys 


Lys 


Pro 


Ala 


Gly 


Asn 


Ala 


Tyr 


Phc 


Ala 










645 










650 










655 




Tyr 


Lys 


l.eu 


Lsu 


Ai?n 


Tyr 


Trp 


Met 


Gly 


Gly 


Thr 


Thr 


Asp 


Gin 


Gin 


Ser 








660 










665 










670 






Gla 


Trp 


Arp 


Val 


Thr 


Ser 


Atn 


Trp 


Thr 


Gly Ala 


Gin 


Val 


Pro 


Leu 


The 






675 










680 










ees 








Gly 


Glu 


Asp 


Val 


Glu 


Phe 


Al a 


Thr 


Thr 


Glu 


Asn 


Phe 


Gly Ser 


Pro 


Ala 




690 










695 










700 










Val 


Ala 


Asp 


Leu 


His 


Val 


Pro 


Thr 


Thr 


Asn 


Pro 


Lye 


He 


He 


Gly 


Asn 


705 










710 










715 








720 


Leu 


lie 


Asn 


Asn 


Scr 


Asp 


Lys 


Asp 


Leu 


Val 


Val 


Thr 


Thr 


Ssr 


Ser 


Gin 










725 










730 










735 




Leu 


Thr 


He 


Ajsn 


Gly Val 


Val 


Glu 


Asp 


Asn 


Asn 


Pro 


Asn 


Val 


Gly 


Thr 








740 










745 










750 






Il« 


Val 


Val 


Lys 


Ser 


Scr 


Lys 


Asp 


Asn 


Pro 


Thr 


Gly 


Thr 


Leu 


Leu 


Phe 






755 










760 










765 








Ala 


Asn 


Pro 


Gly Tyr 


Asn 


Gin 


Asn 


Val 


Gly 


Gl y 


Thr 


Val 


Glu 


Phe 


Tyr 




770 










775 










780 










Asn 


Gin 


Gly 


Tyr 


Asp 


Cys 


Ala 


Asp 


Cys 


Gly Met 


Tyr 


Arg 


Arg 


Ser 


Trp 


785 










790 










795 










800 


Gin 


Tyr 


Phe 


Gly 


lis 


Pro 


Val 


Acn 


Glu 


Ser 


Gly 


Phe 


Pro 


He 


Asn 


Asp 










805 










810 










83 5 




Val 


Gly 


Gly 


Asn 


Glu 


Thr 


Val 


Asn 


Gin 


Trp Val 


Glu 


Pro 


Phe 


Asn 


Gly 








aao 










825 










830 






Asp 


Lyf 


Trp 


Arg 


Pro 


Ala 


Pro 


Tyr 


Ala 


Pro 


Asp 


Thr 


Clu 


Leu 


Gin 


Lys 






035 










840 










845 








Phe 


Lyf 


Gly 


Tyr 


Gin 


He 


Thr 


Asn 


Asp 


Val 


Gin 


Ala 


Gin 


Pro 


Thr 


Gly 




850 










855 










860 










Val 


Tyr 


Ser 


Phe Lys Gly 


Met 


He 


Cys 


Val 


Cy.T 


App 


Ala 


Phe 


Leu 


Asn 


865 










870 










075 










830 


Leu 


Thr 


Arg 


Thr 


Ser Gly 


Val 


Asn 


Tyr 


Ser 


Gly 


Ala 


Asn 


Leu 


H« 


Gly 










885 










8 90 










895 




Ajrn 


S-r 


Tyr 


Thr 


Gly 


Ala 


He 


Asp 


He 


Lys 


Gin 


Gly 


He 


Val 


Phc 


Pro 








900 










905 










910 






Pro 


Glu 


VdI 


Glu 


Gin 


Thr 


Val 


Tyr 


Leu 


Phe 


Asn 


Thr 


Gly 


Thr 


Arg 


Asp 






915 










920 










925 






Gin 


Trp 


Arg 


Lys 


Leu 


Asn 


Gly 


Ser 


Thr 


Val 


Ser 


Gly Tyr Arg Ala 


Gly 




930 










935 










940 










Gin 


Tyr 


Leu 


Ser 


Val 


Pro 


Lys 


Asn 


Thr 


Ala 


Gly 


Gin Asp Asn 


Leu 


Pro 


945 










950 










955 










960 


A£p 


Arg 


lie 


Pro 


S^r 


Met 


His 


S-r 


Phe 


Leu 


Val 


Lys 


Met 


Gin 


Asn 


Gly 










965 










97 0 










975 




Ala 


Ser 


C^-s 


Thr 


Leu 


Unfc 


He 


Leu 


Tyr 


Asp 


Lys 


Leu 


Leu 


Lys 


Asn 


Thr 








980 










985 










990 






Tht 


Vil 


Asn 


Asn 


Gly Asn 


Gly 


Thr 


Gin 


He 


Thr 


Trp 


Arg 


Ser 


Gly 


Asn 






995 










1000 








1005 






Ser 


Gly 


Ser 


Ala 


Asn 


Met 


Pro 


Ser 


Leu 


Val 


Kct 


Asp 


Val 


Leu 


Gly 


Asn 




1010 








1015 








.1020 








Glu 


Ser 


Ala 


A^p 


Arg 


Leu 


Trp 


He 


Phe 


Thr 


Asp 


Gly Gly 


Leu 


Ser 


Phe 


1025 








1030 








1035 








1040 


Gly 


i?he 


Asp 


Asn 


Gly 


Trp Asp 


Gly Arg 


Lys 


Lau 


Thr 


Glu 


Lys 


Gly 


Lsu 










1045 








1050 








1 055 


Ser 


Gin 




Tyr 


Ala 


Met 


Ser 


Asp 


He 


Gly 


Asn 


Asp 


Lys 


Phe 


Gin 


Val 



1060 1065 10/0 
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Ala Gly Val Pro Glu Leu Asn Asn L«u Lau lift Gly Phe A5!p Ala Asp 

1075 1080 1085 

Lys Asp Gly Gin Tyr Thr Leu Glu Phe Ala Leu Ser Aep His Phe Ala 
1090 1095 1100 

5 Ly^; Gly Ala Val Tyr Leu His A5tp Leu Gin Ser Gly Ala Lys His Arg 

1105 1110 1115 1120 

lie Thr A*n Ser Thr Ser Tyr Ser Ptie Asrp Ala hya Arg Gly Asp Sec 

1125 1130 1135 

Gly Ala Acq Fho Acq Leu Ser Tyr Gly Cys A;?p Glu Asn Val Asp Asp 
10 1140 1145 IISO 

Ser His Val Val Ser Thr Asn Gly Arg Glu lie lie Ii« Leu Asn Gin 

1155 1160 1165 

Affp Ala Leu Aep Cys Thr Val Thr Leu Phe Thr lie Glu Gly Lys Leu 
1170 1175 IIBO 

15 Leu Arg Arg Leu Lys Val Leu Ala Gly His Arg Glu Val Met Lys Val 

1185 1130 1195 1200 

Gin Thr Gly Gly Ala Tyr He Val HI.-; Leu Gin Asn Ala Phe Thr Asm 

1205 1210 1215 

Arp Val His Lys Val Leu Val Glu Tyr 
20 1220 1225 

(2) iriFOWIATIOH FOR SEQ ID MO: 4 86 

a) SEQUEJ/CE CHARACrrERISTICS: 
25 (A| LEIIGTll: ^25 amino acld.=? 

fU) TYPE: amino acid 
(D) TOPOLOGi': linear 



(ii) MOLECULE TYPE: prot.ein 
Uii) HYPOTHETICAL: YES 

(vi) ORIGIHAL SOURCE: 

(A) ORGAl-HS]-!: Porphyromonn.T gingival 

(iJt) FEATURE: 

(IK) KAWE/KEY: misc_feature 

(B) LOCATIOM 1. . . 425 



40 ixi) SEQUEirJE DESCRIPTION: SEQ ID NO: 486 



45 



Met 


Glu 


Val 


Lys 


Lys 


Asn 


Thr 


Val 


Val 


Leu 


Arg 


Leu 


Leu 


He 


Trp 


Phe 


1 








S 










10 










15 




Val 


Ala 


He 


Leu 


Leu 


Phe 


His 


Ser 


Ser 


Arg 


Lou 


Trp 


Gly Gin 


Glu 


Gly 








20 










25 








30 






Glu 


Gly 


Ser 


Ala 


Axg 


Tyr 


Arg 


Phe 


Lys 


Gly 


Phe 


Val 


Asp 


Thr 


Tyr 


His 






35 










40 










45 








Ala 


Val 


Arg 


Ser 


Ser 


Ser 


Pro 


Phe 


Asp 


Phe 




Ser 


Ser 


Arg 


Thr 


Arg 




50 










55 










60 










Val 


Arg 


Gly 


Glu 


Leu 


Glu 


Arg 


Sor 


Phe 


Gly 


Asn 


Ser 


Lys 


Val 


Ala 


Val 


65 




















75 










80 


Ser 


Val 


Asn 


Ala 


Thr 


Tyr 


Asn 


Ala 


Leu 


Leu 


Lys 


TUp 


Glu 


Thr 


Gly 


Leu 










85 










50 










9b 




Arg 


Leu 


Arg 


Glu 


Ala 


Phe 


Phe 


Glu 


His 


Gin 


Glu 


Glu 


His 


Trp 


Gly 


Leu 








100 










105 










110 






Acq 


Leu 


Gly 


Arg 


Gin 


He 


Val 


He 


Trp 


Gly 


Ala 


Ala 


Asp Gly 


Val 


Arg 






LIS 










120 










125 








He 


Thr 


Acp 


Leu 


Tie 


Ser 


Pro 


Met 


Asp 


ttet 


Thr 


Glu 


Phe 


Leu 


Ala 


Gin 




130 










135 










140 










Asp 


Tyr 


Asp 


Asp 


He 


Arg 


Het 


Pro 


Val 


Asn 


Ala 


Leu 


Arg 


Phe 


Ser 


Val 


145 










ISO 










155 










160 


Phe 


Attii 


Glu 


Ser 


list 


Lys 


Val 


Glu 


Val 


Val 


Val 


Leu 


Pro 


Val 


Phe 


Glu 










165 










170 










175 




Gly 


Tyr 


Arg 


Leu 


Pro 


Val 


Asp 


Pro 


Arg 


TVen 


Pro 


Trp 


Asn 


He 


Phe 


Ser 








180 










185 










190 






Lsu 


Ser 


Pro 


He 


Ala 


Gin 


Gly 


Het 


A3n 


He 


Val 


Trp 


Lys 


Glu 


Glu 


Ala 






195 










200 










205 








Gly 


Lys 


Pro 


Ala 


Phe 


Lys 


Val 


Ala 


Asn 


He 


Glu 


Tyc 


Gly Ala 


Arg 


Trp 




210 










215 










220. 










Ser 


Thr 


Thr 


Leu 


Ser 


Gly 


He 


Asp 


Phe 


Ala 


Leu 


Ala 


Ala 


Lou 


His 


Thr 


225 










230 










235 










24 0 


Trp 


Aen 


Lyff 


net 


Pro 


Val 


He 


Glu 


Val 


Gin 


Gly 


He 


Val 


Pro 


Thr 


Giu 










24 5 










250 










255 




He 


He 


Val 


Ser 


Pro 


Arg 


Tyr 


Tyr 


Arg 


Het 


Gly 


Phe 


Val 


Gly 


GlY 


Asp 








260 










265 










270 
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Leu Ser Vai Pro Vai Giy Gin Ph« Val Phe Aig Giy Glu P.ia Ala Phe 

275 280 285 

Asn lie Asp Lys His Phe Thr Tyr Lys Sec His Ala Glu Gin Glu Gly 
290 ?.95 300 

5 Phe Glrv The Il-e Asn Trp Leu Ala Gly Ala Asp Trp Tyi: Ala Pro Gly 
305 310 315 320 

Glu Trp Met He Ser Gly Gin Phe Sec l-tet Glu Ser He Phe Arq Tyr 

325 330 335 

Arq Asp Phe He Sec Gin Art} Gin His Ser Thr L^iU He The Leu Asn 
10 340 345 350 

Vai Ser Lys Lys Phe Phe Gly Ser Thr Leu Gin Leu Ser Asp Phe Thr 

355 360 365 

Tyr Tyr Any Leu The Gly Lys Gly Trp Phe Ser Arg Phe Ala Ala Asp 
370 37b 380 

15 Tyr Ala T.eii A.5n A^p Gin He His Leu Met Ala Gly Tyr Asp Trp Phe 
385 390 396 400 

Ser Ser Lys Giy Ser Gly He Phe Asp Arq Tyr L/s Asp Asn Ser Glu 

405 410 415 

Leu Trp Phe Lys Ala Arg Tyr Ser Phe 
20 4 20 4 25 

(£J rUFORriATIon for SEQ id NO:4 07 

(i) SEQUEIICE CHARACTERISTICS: 
25 (A) LEJJGTH: 104 amino acidi- 

(B) TYPK: amino acid 
(D) TOPOLOGY: linear 



30 



35 



50 



55 



60 



65 



70 



75 



(ii} MOLECULE TYPE: protein 

(ill) HYPOTHETICAL: YES 

{vij ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonae gingival is 

( i X ) FEATURE : 

(A) HA>1E/KEY: mtsc_f eature 
(BJ LOCATION 1. , .404 



40 (xi) SEQUENCE DESCRIPTION; SEQ ID NO:487 



45 



Met 


Ser 


Ser 


Cys 


Glu 


Val 


Ala 


Tyr 


Phe 


Ser 


Leu 


Lys 


Pro 


He 


Asp 


Leu 


1 








5 










10 










IS 




Gin 


Asn 


He 


Arn 


Glu 


Arg 


Asn 


His 


Ser 


Ser 


Asp 


He 


Ala 


Leu 


Ser 


Asn 








20' 










25 










30 






Leu 


Leu 


Asp 


Asn 


Ser 


Asn 


Gin 


Leu 


Leu 


Ala 


Thr 


He 


Leu 


He 


Gly Aen 






35* 










40 










45 








Asn 


Vai 




A.<?n 


Val 


Ala 


He 


Val 


He 


Leu 


Ser 


Asn 


Tyr Ala 


He 


Glu 




50 










55 










60 










Gin 


thr 


Phe 


Vai 


Phe 


Ser 


Ser 


Pro 


He 


He 


Gly 


Phe 


Leu 


He 


Gin 


Thr 


65 










70 










7i, 










80 


He 


Leu 


I.eu 


Thr 


Thr 


Val 


Leu 


Leu 


Leu 


Phe 


Gly 


Glu 


He 


Leu 


Pro 


Lys 










85 










90 










05 




Vai 


Tyr 


Ala 


Arg 


Lys 


Asn 


Pto 


Leu 


Gin 


Tyr 


Ser 


Arg 


Phe 


Ser 


Ala 


Ma 








100 










105 










110 






Ala 


Met 


Ser 


Vai 


He 


Tyr 


Lys 


He 


Leu 


Ser 


Pro 


Phe 


Ser 


Lys 


Leu 


Leu 






115 










120 










125 








Vai 


Lys 


Ser 


Thr 


Gly 


H^ 


Val 


Thr 


Arg 


Gly 


He 


Ser 


Lys 


Lys 


T.ys 


Tyr 




130 










135 










140 










Asp 


Ur.t. 


Ser 


Val 


Asp 


Glu 


Leu 


Ser 


Lys 


Ala 


Val 


Ala 


Leu 


Thr 


Thr 


Thr 


145 










150 










155 










160 


Glu 


Gly 


Glu 


Pro 


Glu 


Glu 


Lys 


Glu 


Met 


He 


Asn 


Glu 


He 


He 


Lys 


Phe 










165 










170 










T75 




Tyr 


A^n 


Lys 


Thr 


Ala 


Cys 


Glu 


He 


Met 


Val 


Pro 


Arg 


He 


Asp 


He 


Val 








180 










185 










190 






Asp 


Vai 


Arp 


Leu 


Ser 


Trp 




Phe 


Arg 


Lys 


Met 


Leu 


Asp 


Phe 


Val 


Val 






IPS 










200 










205 








Ser 


Ser 


Gly 


Tyr 


Ser 


Arg 


Leu 


Pro 


Vai 


Ser 


Glu 


Gly 


Ser 


Glu 


Asp 


Asn 




210 










215 










220- 










He 


Lys 


Gly 


Val 


lift 


Tyr 


He 


Lys 


Asp 


Leu 


He 


Pro 


His 


Met 


Asp 


Ly^ 


225 










230 










23S 










240 


Gly 


Asp 


Glu 


Phe 


Asp 


Trp 


His 


Pro 


Leu 


He 


Arg 


Lys 


Ala 


Tyr 


Phe 


Vai 










245 










250 










255 




Pro 


Glu 


Asn 


Lys 


Arg 


He 


Asp 


Asp 


Leu 


Leu 


Glu 


Glu 


Phe 


Arg 


Ala 


.Asn 








25C 










265 










270 
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Lys 


Val 


His 


Val 


Ser 


He 


Val 


Val 


Asp Glu 


Phtt 


Gly 


Gly 


Thr 


Cys 


Gly 
















280 










285 








Leu 


Il« 


Thr 


Met 


Glu 


Asp 


Tie 


Leu 


Glu 


Glu 


He 


Val 


Gly Glu 


He 


Thr 




290 










295 










300 










Asp 


Glu 


Tyr 


Asp 


Glu 


Glu 


Glu 


Leu 


Pro 


Phe 


Lye 


Val 


Leu 


Gly Asp Gly 


3Q5 










310 










315 










320 


Ser 


Tyr 


Leu 


Phe 


Glu 


Gly 


Lys 


Thr 


Ser 


Leu 


Ser 


Asp 


Val 


Arg 


His 


Tyr 










325 










330 










335 




Leu 


Asp 


Leu 


Pro 


Glu 


Asn 


Ala 


Phe 


Gly 


Glu 


Leu 


Gly 


Aep 


Glu 


Val 


Asp 








340 










345 










350 






Thr 


Leu 


Ser 


Gly 


Leu 


Phe 


Leu 


Glu 


He 


Lys 


Gin 


Glu 


Leu 


Pro 


His 


Val 






355 










360 










365 








Gly 


Arp 


Thr 


Ala 


Val 


Tyr 


Glu 


Pro 


Phe 


Ary 


Phe 


Gin 


Val 


Thr 


Gin 


net 




37 Q 










37 5 




















Asp 


Lys 


Acg 


Arg 


He 


n- 


Glu 


He 


Lys 


He 


Phe 


Pro 


Phe 


Glu 


Arg 


Thr 


3QS 










390 










395 










400 


Tcp 


Glu 


Val 


Glu 



























(2) IHFORJ-IATION FOR SEQ ID 110:488 

(1) SEQUENCF. CHARACTERISTICS: 

(A) LENGTH: 260 amino acid.'? 

(B) TYPE: amino acid 
(D) TOpaiX)GV: linear 

lii) MOLECULE TYPE: protein 

[Hi J HYPOTHETICAL: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGANISTi: Porphyromonaa gingivaliff 

(Ix) F-EATURE: 

(A) HAME/KEY: misc_feature 

(B) LOCATIOlf 1 . . ,260 



{xi) SEQUENCE DESCRlPTIOtl: SEQ ID HO: -188 



Met 


hys 


Leu 


Leu 


Leu 


T-yr 


Leu 


Leu 


Leu 


Val 


Leu 


Ser 


Thr Leu 


Ser 


Pro 


1 






5 










10 








IS 




Met 


Tyr 


Ser 


Gin 


Met 


Leu 


Phe 


Ser 


Glu 


Asn 


Lgu 


Thr 


Met A-rn 


Ho 


A£!p 








20 










25 








30 






Sei 


Thr 


hys 


Thr 


He 


Gin 


Gly 


Thr 


He 


Leu 


Pro 


Val 


Leu Asp 


Phe 


Lys 






35 










40 










4S 






Thr 


Glu 


I.y.» 


Glu 


Asn 


Val 


Phe 


Thr 


Phe 


Lyi.- 


Ai'n 


Thr 


Ala Asn 


Leu 


Asn 




50 










55 


















Leu 


Leu 


He 


Lys 


hLa 


Gly 


Gin 


Val 


71 e 


Asn 


Leu 


He 


Asn Lys 


Leu 


Glu 


65 










70 










7 5 








80 


Phe 


Scr 


Thr 


Tyr 


Gly 


A5n 


Lys 


Val 


Thr 


Val 


Ser 


Gly 


Giy Tyr 


Val 


Hie 










85 










^o 








95 




Thr 


Glu 


Tyr 


Arg 


Tyr 


Leu 


Leu 


His 


His 


Val 


Phe 


Glu 


Val Tyr 


Pro 


Tyr 








100 










105 








110 






Val 


Glu 


Ser 


Gin 


Trp 


Ala 


Glu 


Ser 


Arg 


Gly 


Met 


Lys 


Tyr Lys Val 


Ssr 






lis 










120 










125 






Thr Gly 


Leu 


Gin 


Ser 


Arg 


Tyr 


Arg 


Lsu 


Val 


Asn 


Ser 


Asp Asn 


Cys 


Leu 




130 










135 










140 








Met 


Phe 


Ala 


Thr 


L«iu 


Gly 


Val 


Phe 


Phe 


Glu 


Phe 


Glu 


Ly? Trp 


Glu 


Gin 


145 










150 










155 








160 


Pro 


Als 


Thr 


Ser 


Leu 


Phe 


Ala 


Gly 


Thr 


Tyr 


Ali 


Tyr 


Ser Arg 


Ser 


He 










165 










no 








175 




Lys 


Ser 


His 


Leu 


Ser 


He 


Ser 


Phe 


Arg 


His 


Arg 


Leu 


Gly Glu 


His 


Trp 








180 










185 








190 






Glu 


Phe 


Thr 


Thr 


Thr 


Ala 


He 


His 


Gin 


Gly 


Lys 


Pro 


Asp Ser 


Tyr 


Phe 






195 










200 










205 




Lys 


Ly= 


Ala 


Arg 


Ph« 


Gly 


Gly 


Ala 


He 


Asp 


Leu 


Lys 


Tyr His 


He 


Thr 




210 










215 










220 








Pro 


Thr 


He 


Gly 


He 


Arg 


Gly 


Ala 


Tyr 


Arg 


He 


He 


Tyr Asp Thr 


Ala 


225 










230 










235 








240 


Pro 


He 


Val 


Pro 


Val 


Arg 


Lya 


Arp 


Tyr 


Asn 


Thr 


Vai 


Asp Val 


Gly 


He 










245 










7.50 








255 




Asp 


He 


Ser 


Phe 






























260 
























<21 


IHfOWlATIOH 


FOR 


SEQ 


ID HO:4( 


35 
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10 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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tn SEQUEMCE CHARACTERISTICS: 

(A) LEMGTH; 834 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) MOLECULE TYPE; protein 

Uii) HYPCrrilETlCAL: YES 

(vi) ORIGIIJAL SOURCE: 

|A) ORGANISH; Pcjrphyromonas gingivaiis 



tlx) FEATURE: 
15 (A) HAIIS/KEY: misc_feature 

IB) LOCATION 1 . . ,834 



txi) SEQUCncr. DF.SCRTPTION: SEQ id no: 499 



Met 


Lys 


Arg 


He 


Val 


Leu 


Ser 


Ser 


Phe 


Leu 


Phe 


Val 


Leu 


Ser 


He 


Leu 


I 








S 










10 










15 








Met 


Ala 


Gin 


Asn 


Asn 


Thr 


Leu 


Asp 


Val 


His 


He 


Ser 


Gly 


Thr 








20 










25 










30 






lis 


Lys 


Asp 


Ala 


Ser 


Sec 


•3iy 


Glu 


Pro 


Val 


Pro 


Tyr 


Ala 


Thr 


Val 


Ser 






3S 










40 










45 








He 


Acg 




Thr 


Gly 


Al« 


A5P 


Thr 


Thr 


Gin 


Val 


Phe 


Axg 


Gin 


Val 


Thi 




SO 










55 




















Asp 


Gly 


Asn 


Gly 


Tyr 


Phe 


Val 


lie 


Gly 


Leu 


Pro 


Ala 


Ala 


Pro 


Ser 


Tyr 












70 










75 










80 


His 


Leu 


Thr 


Ala 


Ser 


Phe 


Val 


Gly 


Met 


Lys 


Thr 


His 


Thr 


Met 


Gin 


He 










85 










90 










95 




Ser 


Arg 


Gly 


Asn 


Gly 


Gin 


His 


Asp 


lis 


Lys 


Ser 


He 


Asp 


He 


Ser 


Leu 








100 










105 










HO 






GlU 


Ser 


Glu 


Asp 


Lys 


Gin 


Leu 


Ser 


Thr 


Val 


Thr 


Val 


Ser 


Ala 


Ala 


Arg 






115 










120 










125 






Pro 


Leu 


Val 


Lys 


Met 


Glu 


He 


Asp 


Arq 


Leu 


Ssr 


Tyr 


Asn 


Met 


Lys 


Asp 




130 










135 










140 










A.-P 


Pro 


Ala 


Ala 


Lys 


Thr 


Asn 


Asn 


Leu 


Leu 


Glu 


Met 


Leu 


Arg 


Asn 


Val 


14S 










150 










155 










ISO 


Pro 


L*;u 


Val 


Thr 


Val 


Asp 


Gly 


Gin 


Gly 


Asn 


He 


Gin 


Val 


Lys 


Gly 


Ser 










16S 










170 










175 




Ser 


Asn 


Phft 


Lys 


He 


His 


Leu 


Asn 


Gly 


Arg 


Pro 


Ser 


Thr 


Met 


Val 


Ser 








180 










185 










190 






Ser 


Asn 


Pro 


Lys 


Glu 


Val 


Phe 


Arg 


Ser 


Tlo 


Pro 


Al a 




Thr 


He 


Lys 






195 










200 










205 








Axg 


Val 


Glu 


Val 


He 


Thr 


Asp 


Pro 


Gly 


Val 


Lys 


Tyr 


Asp 


Ala 


Glu 


Gly 




210 










215 










220 










Thr 


Ser 


Ala 


He 


Leu 


Asp 


He 


Val 


Thr 


G1 u 


Glu 


Gly 


Lys 


Lys 


Leu 


Glu 


225 










230 










235 










240 


Gly 


Tyr 


Ser 


Gly 


Ser 


11- 


Thr 


Ala 


Ser 


Val 


Ser 


Asn 


Asn 


Pro 


Thi- 


Ala 










245 










250 










255 




Asn 


Gly 


Ser 


He 


Phe 


Leu 


Thr 


Ala 


Lys 


Ser 


Gly Lyj? 


Val 


Gly 


Leu 


Thr 








260 










265 










270 






Thr 


Asn 


Tyr 


Asn 


Tyr 


Tyr 


Gly 


Gly 


Lys 


Asn 


Lys 


Gly 


Ser 


Arg 


Tyr 


Phe 






275 










280 










205 








Thr 


Glu 


Arg 


Thr 


Thr 


Ser 


Met 


Leu 


Gin 


Thr 


He 


Glu 


Glu 


Gly 


Lys 


Gly 




290 










295 










300 










Gin 


Giu 


Thr 


Phe 


Gly 


Gly 


His 


Phe 


Gly 


Asn 


Ala 


Leu 


Leu 


Ser 


Phe 


Glu 


305 










310 










315 










320 


He 


Aip 


Ser 


Leu 


Asn 


Leu 


Phe 


Thr 


Val 


Gly 


Gly 


Asn 


Val 


Arg 


Leu 


Trp 










325 










330 










335 




Clu 


Het 


Thr 


Thr 


Asp 


Arg 


Asn 


Sar 


Val 


Glu 


Lys 


Sor 


Phe 


Ala 


Gly 


ser 








340 










345 










350 




Asn 


Leu 


Met 


Ser 


Tyr 


He 


Asp 


Arg 


Lys 


Leu 


Lys 


Thr 


Gin 


Met 


Asp 


Ala 






355 










360 










365 








Gly 


Ser 


Tyr 


Glu 


Leu 


Asn 


Ala 


Asp 


Tyr 


Gin 


His 


ser 


Thr 


Arg 


Leu 


Pro 




3T0 










37 5 










380 










Gly 


Glu 


Lgu 


Leu 


Thr 


Val 


Ser 


Tyr 


Axg 


Phe 


Thr 


His 


Asn 


Pro 


Asn 


Asn 


385 










350 










395 










4C0 


Ser 


Glu 


Thr 


Phe 


He 


Asp 


Gin 


Trp 


Lys 


Arg 


Asp 


Pro 


Leu 


Asn 


Thr 


Ala 










405 










410 










415 




Aan 


Thr 


Ho 


Gin 


Tyr 


Ala 


Gly 


Gin 


His 


Ser 


Lys 


Ser 


Asp 


Ala 


Gly 


Met 








420 










425 










430 






Asp 


Glu 


His 


Thr 


Ma 


Gin 


Val 


Asp 


Tyc 


Thr 


Arg 


Pro 


Leu 


Gly 


Gin 


Ala 






435 










440 










445 
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His 


Ser 


Leu 




Ala 


Cily 


Leu 


Lys Tyr 


He 


Tyc 


Acg 


His 


Ala 


Thr 


sor 




450 










455 








460 










Afp 


Pro 


Leu 


Tyr 


Glu 


He 


Arg 


Pro Ser 


Glu 


Asp 


Ala 


Pro 


Trp 


Gin 


Pro 


4 65 










410 








475 










480 


Cly 


Ser 


Leu 


Tyr 


Ala 


Gin 


Asn 


Pro Ser 


Asn 


Gly 


Lys 


Phe 


Arg 


His 


Asp 








4fl5 








4 90 










4 95 




Gin 


Tyr 


He 


Gly Ala 


Ala 


Tyr Ala Gly Tyr 


Asn 


Tyr 


Arq 


Lys 


Asp 


Gin 








500 








505 










510 






Tyr 


Sr.r 


LC;\l 


Gin 


Thr 


Gly 


Leu 


Arg Val 


Giu 


Ser 


Ser 


Arg 


Leu 


Lys 


Ala 






515 










520 








52 5 








Leu 


i?he 


Pro 


Glu 


Asn 


Ala 


Ala 


Ala Asp 


Phe 


Ser 


His 


Asn 


Ser 


Phe 


Asp 




530 










535 








54 0 










Trp 


Val 


Pro 


Gin 


LRU 


Thr 


Leu 


Gly Tyr Thr 


Pro 


Ser 


Pro 


Met 


Lys 


Gin 












550 








555 










560 


Leu 


Lys 


Leu 


Ala 


Tyr 


Asn 


Phe 


Arg He 


Gin 


Arg 


Pro 


Ala 


He 


Gly 


Gin 










5SS 








57 0 










575 




Leu 


Asn 


Pro Tyr 


Arg 


Leu 


Gin 


Thr Asn 


Asp 


Tyr 


Gin 


Vai 


Gin 


Tyr 


Giy 








580 








505 










590 






Asn 


Pro 


Asp 


Leu 


Lys 


Ser 


Glu 


Lys Arg 


Hie 


His 


Val 


Gly T.eu 


Ser 


Tyr 






590 










500 








605 








PiSn 


filn 


Tj-r 


Gly 


Ala 


Lys 


Val 


Het Leu 


Thr 


Ala 


Ser 


Leu 


Asp 


Tyr 


Asp 




610 










61S 








620 










Phe 


eye 


Asn 


Asn 


Ala 


He 


Gin 


Asn Tyr 


Thr 


Phe 


Ser 


Asp 


Pro 


Ala 


Asn 


625 










630 








635 










640 


Pro 




Leu 


The 


His 


Gin 


Thr 


Tyr Giy 


Asn 


He 


Gly 


Aru 


Glu 


Hir 


Ser 










645 








650 










655 




Pho 


Ser 


Leu 


Asn 


Ttir 


Tyr 


Ala 


Het Tyr 


Thr 


Pro 


Ala 


Val 


Trp 


Val 


Arg 








650 








665 










67 0 






He 


Met 


Leu 


Asn 


Gly 


Asn 


He 


Asp Arg 


Thr 


Phe 


Gin 


Lys 


Ser 


Glu 


Ala 






67 5 










680 








685 








Leu 


Gly 


lie 


Asp 


Val 


Asn 


Ser 


Trp Ser 


Gly Met 


Val 


Tyr Ser Gly Leu 




690 










695 








700 










Met 


Phe 


Thr 


Leu 


Pro 


Lys 


Asp Trp Thr 


Val 


Asn 


Leu 


Phe Gly Gly 


Tyr 


70S 










710 








715 










720 


Tyr 


His 


Gly 


Gly Arg 


Sor 


Tyr Gin Thr 


Lys 


Tyr 


Asp Gly Asn Val 


Phe 










725 








730 










735 




Asn 


Asn 


He 


Gly 


He 


Ala 


Lys 


Gin Leu 


Phe 


Asp 


Lys 


Lys 


Leu Arg 


Val 








740 








745 










750 






Ser 


Leu 


Ser 


Ala 


Asn 


Asn 


Hs 


His Ala 


Lys 


Tyr 


Ser 


Thr Trp Ly^ 


Ser 






755 










760 








7 65 








Arg 


Thr 


He 


Gly 


Asn 


Gly 


Phe 


Thr He 


Tyr 


Ser 


Glu 


Asn Ala Gly 


H = 




770 










775 








7B0 










Gin Arg 


Ser 


Val 


Ser 


Leu 


Ser 


Leu Thr Tyr 


Ser 


Phe 


Gly Lys Het Asn 


785 










7 90 








7 55 










000 


Thr 


Gin 


Val 


Arg 


Lys 


Val 


Glu 


Arg Thr 


He 


Val 


Asn 


Asp Asp 


Leu 


Lys 










805 








810 










815 




Gin 


Thr 


Ser 


Scr 


Gin 


GJy 


Gin 


Gin Gly 


Gly 


Giy 


Gin 


Gly Asn 


Pro 


Thr 








820 


















830 






Gly 


Asn 





























(2) INFORJ-IATIOM FOR SEQ ID MO: 4 90 

il) SKQUEltCE CIIARACTEKISTICS: 

[A) LEMfPTH: 3 99 amino acids 
(Bt TYPE: amino acid 
(D| TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(VI) ORIGIIJAL SOURCE: 

(A} ORGAtUSN: Porphycomonas gingival is 

FEATURE: 

(A) HAJIE/KEY: misc featur*? 

(B) LOCATIOrr i...399 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 490 

Met Ary hnu Sec Ala He Leu He Ala Leu He Val Met Leu Pco Ala 
15 10 15 

Val Leu Sec Gly Gin His Tyr Tyr Ser Met Ala Giy Glu Arg Leu Glu 
20 25 30 
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5 
10 
15 
20 
25 
30 
35 
40 
45 
50 
55 
60 
65 
70 
75 



Th r 






Pro Asr\ Glu 


L«u 


Ser 


Ala 


S%r 


rie 


Rxq 


Ser 


Ala 






35 


40 










45 










Phe 


PKa &m &<?rk 
C^fl? r\SXi\ 


Affn Glu Tyr A.Tn 


Ala 


Arg 


St 


Val 


Lys 


Gly 


Tyr 




SO 




55 


















Thr 


Lc\» 


D I V/ Ala 

It K x> u X J t\X o 


Arq Val Ser 


Ala 


Phe 


Al a 


Ser 


Tyc 


Ser 


Leu 


Pro 


65 






■70 






7 5 










80 


Ala 


Ala 


His Gly Val 


Lys Leu Ser 


Lou 


Gly 


V'sl 


Ser 


Thr 


LfiU 


Asn 


Tyr 






05 






90 










95 




Tro 


Gly 


Ala Ser Arfj 


Tyr Pro Ala 


Gly lie 


Ala 


I yr 


Ser Asp 


Leu 


Pro 






100 




105 










110 






Tvr 


Tro 


Thr Asp Tyr 


A-sn Asp Tyr 


Val 


Ar-j 


Leu 




lie 


Lou 


Pro 


Tyr 






US 


120 










125 








VaX 


Gin 


Ala Hst Lsii 


Lys Pro Thr 


Ala 


Thr 


Thr 




Leu 


Met 


Leu 


Gly 




1 30 




13fi 








I jn 










Arn 


lie 


f\i.€k VaXj 


Thr Ala His 


Gly 


Leu 


11* 


r 1 


Pro 


lie 


Tyr Asn 


1 J J5 






150 






155 










160 


Pro 




LtSU Asp i^eu 


Thr Ala Asp 


Pro 


Glu 


Ala 


Gly 


Val 


Gin 


Phe 


Arg 






165 






170 










175 




Gly 


Asp 


Trp Thr Arc) 


Phe Arg Met Asp Val 


Trp 




Asn Trp 


Met 


Ser 






160 




185 
















Met 


Z is 


Prve l.ys Af^n 


Asp Asn His 


Gin 


Glu 


Ssr 


Phe 


Val 


Phe 


Gly 


L^ju 






1 95 


200 










205 








S«r 


Thr 


Thr Ssr Lys 


Leu Leu Ser 


Gly 


Glu 


Gl y 


Lys 


Trp 


Arg 


Leu 


Glu 




210 




215 








220 










Lsu 


Pro 


IjSU (t.in Aia 


lie Ala Thr 




Arg 


Gly 


Gly 


Glu 


Tyr 


A£^n Tip 


225 






330 






235 










2'10 


Ala 


Gin 


Gin Asp Thr 


Val His Thr 


Trp Val 


Asn 


Gly 


Ala 




Gly 


Leu 






2']5 






250 










255 






Leu 


Ser Tyr Arg 


Pro Arg Thr Ai-p 


Lyr 


Pro 


Itc-t 


Gin 


He 


Trp 


Gly 






2S0 




265 










270 






Ssr 


Ala 


Tyr Gly Val 


Ala Ala Leu 


Ser 


Ser 


Gly 


Gly 


Tyr 


Phe 


Pro 


Tyr 






275 


280 








265 








Gill 


Arq 


Gly Trp Gly 


Gly Tyr Leu 


Ser 


Leu 


Gly 


net 




Leu 


Glu 


His 




290 




295 








300 










Phe 


Ala 


Phe Arg Tlir 


Asp Tyr Trp Tyr Gly 


Arg 


His 


Tyr 


VbI 


Ser 


Pro 


305 






310 






315 










320 


Phe 


Ala 


Ala Pro Phe 


Ala Asn Sec 


Leu 


Thr 


Tyr 


Asp 


Lys 


Gin 


Pro 


Leu 






325 






330 










335 




Thr 


Asn 


Gly Trp Gly 


Asp Tyr lie Arg 


Leu 


Tyr 


Ala 


Asp 


Tyr 


Ser 


Trp 






340 




345 










350 






Arg 


net 


Ala Arg Ser 


Val S«r Leu 


Ala 


Ala 


Val 


Ala 


Arg Yal 


Trp 


Phe 






355 


360 










365 






Gin 


Pro 


Ser Afp Arg 


Phe Ala Met 


Ser 


His 


Ala 


Leu 


Glu 


Leu 


Thr 


Het 




370 




37 5 








380 










Ara 


Tie 


7\sp Pro Lys 


Phe Pro lie 


Ala 


Phe 


Leu 


Lys 


Gly Ksn 


His 




355 






390 






395 












(2) 


IMFORHATIOH FOR 


SEQ ID NO: 4 91 

















U) SEQUEUCR CHARACTERISTICS: 

(A) LENGTH: 382 amino acids 

(B) TYPE: amino acid 
(DJ TOPOLOGY: linear 

(11 > MOLECULE TYPE: protein 

Uil) HYParHETICAL: YES 

(vi> ORTGIHAL SOURCE: 

(h) OKGPHISH'. Porphyromonas gingival is 

(Ix) FEATURE: 

(A> HAME/KEi: misc_featuce 
IB) LOCATlOI-1 1 . . .382 

(xil SEQUEilCE DESCRIPTIOH; SEQ ID IIO:49l 

I let Asn Lys Ser Leu Lou Ser Leu Ala Cys Leu lie Leu Cys Gly I let 

15 10 15 

Pro Ala He Ala Gin Gin Thr Gly Pro Ala Glu Arg Ser Gly Glu Pro 

20 2•^ 3C 

Ser Leu Ala Glu Arq Val Phe Gly Leu Glu Gin L/s Gin Lys Lye Leu 

35 40 45 

Ly# Val Tyr Leu Gly lie Gin Ser Phe Tyr A^p Gin Pro Leu Val Asp 
50 5b 60 
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Asp 


Glu 


lier 


His 


He 


Gly 


His 


Phe 


Lys- 


Val 


Gin 


Glu 


Leu 


Arg 


Met 


Ser 


65 










70 










75 










80 


Ala 


His 


Gly 


Glu 


Leu 


Asn 


Arg 


His 


I,*iU 


Ser 


Phi- 


Asp 


Trp 


Arg 


Gin 


Arg 










85 










90 










95 






Asn 


Arg 


Ala 


Ala 


Asp 


Gly Thr 


Ser 


Phe 


Ala 


Aep 


Asn 


Leu 


Ser 


Asn 








100 










105 










110 






Ala 


Il« 


Asp 


lie 


Ala 


Gly Val 


Asp 


Trp 


His 


Pro 


Asn 


Asp 


Lys 


Val 


Ser 






116 










120 










125 








Phe 


Phe 


Phe 


Gly Arg 


Gin 


Tyr 


Ala 


Arg 


Phe 


Gly 


Gly 


He 


Glu 


Tyr 


Asp 




130 










135 










140 










net 


Ai-n 


Pro 


Val 


Glu 


lie 


Tyr 


Gin 


Tyr 


Ser Asp 


Leu 


Val 


Asp 


Tyr 


Met 


145 










150 










155 










160 


Thr 


Cy.5 


Tyr 


Thr 


Ser 


Gly 


Val 


Asn 


Phe 


Ala 


Trp 


Asn 


Phe 


His 


Pro 


Glu 








165 










170 










175 




Gin 


Gill 


Leu 


Gin 


Leu 


Gin 


Vsl 


Leu 


Asn 


Ala 


Tyr 


Asn 


Asn 


Arg 


Phe 


Ala 








1»0 










185 










190 






Asp 


Arg 


Tyr 


His 


Val 


Thr 


Pro 


A-p 


Val 


Ala 


Thr 


Ala 


Thr 


Ser 


Tyr 


Pro 






195 










200 










205 








Leu 


Leu 


Tyr 


Ser 


Ala 


Gin 


Trp Asn 


Gly Thr 


Leu 


Leu 


Gly 


Gly 


Ala 


Leu 




210 










215 










220 










Hi5 


It-et 


Arg 


Tyc 


Ala 


Val 


Sec 


Met 


Ai- 


Hla 


Gin 


Ala 


Gin 


Glu 


Arg 


A3n 


22S 










230 










235 










240 


n^t 


Trp 


Tyr 


Phe 


Thr 


Ala 


Gly Asn 


Leu 


Phe 


Asn 


Pro 


Gly 


Lys 


Arg 


He 










245 










250 










25S 




A5n 


Gly 


Tyr 


Leu 


Asp 


Leu Thr Tyr 


Ser 


He 


Glu 


Gly 


Leu 


Asp 


A.<jp 


Lys 








260 










265 










270 






Gly 


rie 


mt 


Thr 


Ala 


Arg Tyr Gly hys 


Gly Lys 


Thr 


Leu 


Thr 


A^'p 


Val 






215 










280 










285 








Lys 


Tyr 


Tyr 


Ala 


Leu 


Vdl 


Ser 


Lys 


Trp 


Asn 


Phe 


Arg 


He 


Phe 


Asp 


Gin 




290 










29i> 










300 










Vol 


Asn 


Leu 


Ph» 


Lpvi 


Lys 


Gly 


Met 


Tyr 


Glu 


Asn 


Gly 


•l*yr 


Ala 


Pro 


Ala 


305 










310 










315 










320 


Gin Tyr 


Gly 


Glu 


Set 


Ser 


Hib 


Thr 


Arg 


His 


Scr 


Tyr 


Gly 


Tyr 


Met 


Gly 










325 










330 










335 




Gly 


Val 


Glu 


Tyr 


Tyr 


Pro 


Thr 


Glu 


Thr 


Asn 


Phe 


Arg 


Leu 


Phe 


Val 


Thr 






340 








345 










350 






Tyr 


He 


Gly 


Arg 


His 


Tyr Arg 


Tyr 


Ser 


Ala 


Thr 


Glu 


Thr 


Glu 


Ser 


Thr 






355 










360 










365 








Asn 


Ala 


Leu 


Arg 


Ala 


Gly 


Leu 


He 


Tyr 


Gin 


He 


Pro 


Phe 


Leu 







370 375 380 



(2) IHFORJ-lATiaJ FOR SEQ ID NO: 492 

ti) SEQUEJiCE CHARACTERISTICS: 

(Al LENGTH: 222 amino acids 
(BJ TYPE: amino acid 
<DJ TCPOLOGV: linear 

(ii) MOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISrt: Porphyromonas gingivalis 

iix) FEATURE: 

(A) MAME/KEY: mi.-sc feature 

(B) LOCATIOil 1. . .222 



(xU SEQUENCE DESCRIPTION: SEQ ID HO: 4 92 



I let 


Lys 


His 


Leu 


Phe 


Lys 


Ser 


Thr 


r.eu 


Val 


Leu 


L-u 


Cys 


Ala 


Leu 


Ser 


1 








5 










10 










15 




Phe 


Ser 


Gly 


Thr 


Tyr 


Thr 


Phe 


Ala 


Gin 


Glu 


Asn 


Asn 


Thr 


Glu 


Lys 


Ser 








20 










25 










30 






Arg 


Phe 


A£p 


Phe 


Ser 


Val 


Arg 


Leu 


Gly 


Gin 


Gly 


Tyr 


He 


Ala 


Gly 


Ser 






35 










40 










45 








Thr 


Thr 


Asn 


Leu 


Met 


Tyr 


Gly 


Tyr 


Thr 


Ser 


Ala 


Asn 


Asp 


Arg 


Leu 


Leu 




50 










55 










60 










Ser 


Gly 


Ala 


He 


Tyr 


Leu 


Gly 


Leu 


Thr 


Pro 


Ser 


Lys 


L^'s 


Glu 


Asn 


Ala 


65 










70 










75 










80 


Thr 


Gly Val 


Ala 


Phe 


Arg 


Phe 


Leu 


Ser 


Pro 


Ser 


Pro 


Gly 


Tyr 


Tyr 


Val 










85 










90 










95 




Asp 


He 


Ser 


Gly 


Lys 


Glu 


Asn 


Thr 


Leu 


Asn 


Tyr 


Ala 


Phe 


Tyr 


Val 


Val 



100 105 110 



SUBSTITUTE SHEET (Rul& 26) (RjO/AU) 



-if^nj iP'~ iri»^i 

lUl «3i Oi wL, 



jfl! i'^' 'P'^i 

ai u o jc:; o iUi 
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20 



25 
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Gly Ala Tvr A..n Arg li. Ala lie Pro He Axg Pcu He Ly. A.n Phe 
Asn Phe lie Phe Ser Thr Glu Va^ Gly Met Ala Trp Mer S.r Arg HI. 



Glu Gin He Tyr A..n S.r Ser Gin Thr Trp L'p Lys Gin Arc, Lys 



^.r Arc Ser Giy Z.eu Phe Gly Leu Gly Met Hi.^ Leu Gin Unk tits 



165 — 

10 "-^^ ""y' A*p I,e>i Thr sir Cys 

1 85 1 on 

net Phe Gly Ly. Arg He A.^n Asp Tyr Gin Gin Lys Asp a;, Thr Phe 

200 205 

He Ala Leu He A.«p Asn Ser He Gly He Gly Leu Asn Leu 

15 215 220 
(2} IHtOW JATIOIJ FOR SEQ ID NO: 4 93 



55 



75 



(1) SF.OUENCE aiARACTERISTICS- 

(A) LENGTH: 391 amino acid^ 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(Hi) HYPOTIJETICAL: YES 



(vi) ORIGINAL SOURCE: 

(A) ORGAWISII: Po rphyromonas: gingival ir 
30 (i>:) FEATURE: 

(A> MAtlE/KEY: misc featur- 

(B) LOCATION 1. . ,391 

35 ^5=^ > fJEQUEMCE DESCRIPTION: SEQ ID NO: 493 

Met He He Lys Lys Met Leu Ly^ A.^n Ly.^ Leu Ala Pro Leu Ala He 
Leu Phe Leu Phe Ala Pro Lys Ala Het Jys Ala Gin Giu Gin [eu A^n 

40 Val Val Hi. Thr Ser Val Pro Ser Leu A.n He Ser Pro Asp Ala Arg 

Ala Ala Gly Met Gly A.p He G^y Vai Ala Thr Thr Pro Asp Ala Tyr 
5^ 60 
45 IV ^>-= H.t A^p Thr Ly. Ala 

Oly lie ser Ph, S,r Tyr Thr Pro Trp Leu Ser Ly. Leu Val A«„ L-p 
85 90 55 



He Ala Leu Met Gin Thr Gly Phe Tyr Ly, Leu Gly Thr I^p Glu 

50 Gin Ala lie Ser Ala Ser Leu Tyr Phe Thr L.u sjy Lye Leu 

Glu Thr Phe A.p Glu L=u Gly cfu Ser Met Gly Glu ^^l HI, Pro A.n 
Glu Phe Ala val Asp Leu Gly Tyr Ser Arg Gin L*u S« Glu Asn Phe 
ser Het Ala Val Ala Leu Arg Tyr He Arg "r A.p Gin Ser Thr H.'s 
A»„ Thr Gly Glu Asn Gin Ala Gly A.^n Ala Ph. Ala Ala Asp He Ala 

60 Gly Tyr Leu Gin Lys Tyr Val Leu ^eu Gly Asn Ala Glu Ser Leu Trp 

200 

S.r Leu Gly Phe Asn Vai Lys Asn He Gly Thr Uy. He Ser Tyr A.p 

65 til 2^30 

He Asp Asp Tyr Asn S-r He Asn Phe Asn 
250 

iln 

ASP G,n Ala Gly Tyr Glu Ala Ala L^: Lys Ly. TyF Gln gIu Thr Ser 

280 285 



Gly Leu Leu Tyr Pro II. Asp Asp Tyr Asn s.r lie Asn Phe Asn !eu 

''SO 

Glu Leu Ser Lys Leu Leu Val Pro Thr Jro Pre He >,et Asp 
70 ASD Gin Al. ni,, -P,,, ..... «. .. _ 270 



ser He Ser Gly He Phe Ser Ser Phe Gly A.p Ala Pro Gly Gly Leu 
Lys Glu Glu Phe Arg Clu lie Thr Trp Gly Leu Gl^y Ala Glu Tyr Ser 



PCT/AU98/01023 



31i> 320 



SUBSTITUTE SHEET (Rule 26) (RCVAU) 



o ~ii o JL. ii= o rfe, „, G ^ S G O 



^^'^'^PSTO I>Cr/AU98«1023 

445 / 490 

Tyr A^p Asp Ly^- Phe Phe Vsl Arg A] □ Gly Tyr Ser Tyr Leu Hi3 Pro 

330 33c 
Thr Lyr Gly A.^n L.>i Gin Tyr Phe Thr Ala Gly Aia Gly Phe Ly^ Het 

5 Asn Phe Arg He Asp Ala Ser Tyr Leu r.cu Scr Thr lie Gin s-r 

360 

A^rn Pro Lou Asp Gin Thr Leu Arq Phe Thr Leu Ala Phe Asp Met Aso 

Gly Leu Arg Asn Leu Plie Hie 
10 385 390 

(2) IlirORMATIOIl FOR SEQ ID HO: 4 94 

^_ (i) i'EQULMCE CHARACTERISTICS: 

[A| LEtlGTH: 446 amino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: llnnnr 



20 



25 



30 



{ii) molecule: TYPL: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGiriAL SOURCE: 

(A) ORGAMTSM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) KAJ IE/KEY: misc feature 
(BJ LOCATION 1 . . .44 6 

(xi) SEQUEIICE DESCRIPTIOiJ: SEQ ID MO: 494 



HeL Lys Thr Thr Val Gin Gin He He Leu Cys Leu Ala Leu Met Met 

ser Gly Val Leu Gly Gly A^n Ala Gin Ser Phe Trp Glu GIu lie Ala 

Pro Pro Phe he Ser Asn Glu Pro ^^n Val Lys Tyr He He Pro Asn 

Met Gly H« Asp Ser Lys Gly Thr He Tyr Vol Thr Val Thr Lys Arg 

40 He Gin Gin Gly Ala Asn Tyr Thr Ser Glu Gin Leu Gly Met Tyr Tyr 

Arg Pro Leu Gly Asp Asn Glu Gin Trp Trp lys Mis Asp Pro Tyr Phe 

Arp A^p Lys He Val Ala Asp He Gin Thr Asp Ala Tyr Gly Arg Val 

105 110 
Jyr Val Cys Thr Thr Ser Ser Arg Asp Gin Glu Tyr Gin Leu Tyr He 

Asn Glu Gin Asn Glu Trp Arg Cys He Phe Lys Thr Ser Val Ser Thr 

50 Tyr Glu His Gly Mot Ala vf 1 Phe Arg Ser Ser T^r Gly Val Thr Tyr 

ISS i^n 
11= Gly Thr Arg Mi, HI, II. Ph, AXa Ser Gly Val A«, A,p Phe gIu 

170 

Phe Thr lie Tyr Glu A^p Ser Thr Pro ,.,et Ser Cy, Arg Phe Ala 

Glu Ala Thr Asn Ser Gly Thr He Tyr Leu Ala Leu Met His Glu Thr 

200 20s 

Thr Met Ser Thr Thr lie Leu Thr Tyr Gin Asn Gly Glu Phe Val A,p 

fin 215 220 

III ^" ^" ^" "^"^ '^•^ 

Asn Ly« Glu Gly Asp IJ = He Wa Leu Val "r Ser Tyr Thr Gly Eh"' 

^-^S ,cc 
Met S,r Gly Thr Leu Ala He Arg Ly, Ala Arp Glu Gly T.y. Trp Gin 

Leu Val Gly Gly A=>p lie Gin A^n a" 11, v.l Gin A,n lie Cye Me: 

280 285 
net A.,p A,p A--n Lys He Ala Cys Glu Val Phe Gly Thr Pre C1-- 
70 300 

val Asp Gly Arg Thr Arg Val Cys Val Ser Asp Ali. Ser Val Fhe A^p 
?10 31S 



Phe Glu Trp Tyr Glu fl=p Glu He Tyr Gly Gly Leu He Phe Asp Thr 



75 



325 33Q 
Phe Phe Tyr Ser Pro Trp Asp Lys Leu Leu Tyr Ala Lys Phe Gly Gl- 
340 3^5 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 




™4t tvi- -iF-ni fllfi) 

gr; OTn Oi lii iUii 53 E O 1U1 o 
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lift 


Met 


Lnii 


Arg 


Ser 


Lys 


Glu 




Phe 


lie 


Thr 


ser 


Fhe 


He 


Ser 


Pro 






3S5 










3S0 










365 








Thr 


Val 


Val 


Gin 


Gly Val 


Ai-p 


Val 


Tyr 


Thr 


Leu 


Ala 


Gly Lys 


He 


Arg 




37 0 










37 5 










380 










lie 


Glu 


Snr 


Glij 


Thr 


Pro 


Val 


S*r 


Glu 


val 


Leu 


Leu 


Phe 


Asp 


Leu 


Ala 


385 










390 










395 










400 


Gly Arg 


Met 


Val 


Leu 


Arg Gin 


Thr 


1\b 


Asp hsn 


Lys 


Tie 


Tyr 


Ser 


Asp 










405 










410 










415 




He 


A.-P 


Thr 


Aun 


Cly 


Leu 


Lya 


Arq 


Ser 


Gly 


He 


Tyr 


Val 


Val 


Ser 


Val 








420 










425 










430 






Arg 


Leu 


S*r 


Ser 


Gly 


Gin 


Val 


Phe 


Ser 


His 


Lys- 


Val 


Gin 


Val 







(21 lUFORllATION FOR SEQ ID tIO:4 95 

li) SEQUEMCE CHARACTERISTICS: 

(A) LEMGTU: 30B amino acidr 
(3) TYPE: amino acid 
iD) TOPOLOGY: linear 

(it) IIOLECULE TYPt: protein 

(ill J HiTPOTHETICAL; YES 

( v'i ) ORICIHAL SOURCE: 

(A) ORGAHTSM: Pocphyromonas qin-givalis 

(i.-) FEATURt: 

(AJ r4Al'tE/KEY: mi sc_f eature 

(B) LOCTVTXOH 1 , . .30fl 



{xij SEQUENCE DESCRIPTIOI^: SEQ ID NO: '195 





He 


He 


Arg 


Cys 


Leu 


lie 


Arg 


Arg 


Pro 


Arg 


Thr 


Val 


Leu 


Phe 


Gly 


1 








5 










10 










15 




Leu 


He 


Phe 


Val 


Val 


Gly 


Leu 


Phe 


Ser 


pad 


MsC 


Ala 


Gin 


Glu 


Lys 


tys 








20 










25 










30 






Asp 


Sex 


Leu 


Ser 


Thr 


Va] 


Gin 


Pro 


Val 


Pro 


Asn 


Ser 


Ssr 


Met 


Val 


Glu 






35 










40 










45 








Gin 


Thr 


Pro 


Leu 


Leu 


Sec 


He 


Asp 


His 


Pro 


Val 


Lou 


Pro 


Ala 


Ser 


Phe 




50 










55 










60 










G.1 n 


Asn 


Thr 


Arg 


Thr 


Leu 


Lys 


Arg 


Phe 


Arg 


Asp 


Lys 


His 


Leu 


Ser 


Asp 


65 










70 










75 










80 


Ala 


Leu 


Leu 


Asn 


Gly 


Leu 


Lys 


Pro 


His 


Arg 


Ssr 


Ser 


Leu 


Gin 


Leu 


Asn 










85 










90 










95 




Glu 


Glu 


Leu 


Asn 


Phe 


Ala 


Ala 


Glu 


Arg Arg 


Asp 


Phe 


Val 


Ser 


Pro 


Leu 








100 










105 










110 






Leu 


Gin 


Thr 


Arq 


His 


Ala 


Ala 


Gly 


Val 


Leu 


Ser 


Trp 


Arg 


Pro 


Thr 


Asp 






115 










120 










125 








Arg 


iret 


III J? 


Phe 


Tyr 


Thr 


Ser 


Gly 


Asn 


He 


Gly Leu 


Gly His 


Asp 


Leu 




130 










135 










140 










L6U 


Thr 


Gly 


Val 


Tlrg 


Lys 


Asp 


Phc 


Gly 


Trp 


Asn 


Ala 


Gly 


Ala 


Asp 


Phe 


145 










150 










155 










160 


Leu 


Leu 


Ser 


Gin 


Asn 


Leu 


Thr 


Ala 


His 


Val 


Glr. 


Gly 


Gly Trp 


Gin 


Gin 










165 










170 








175 




Asn 


Phe 


Gly 


Phe 


He 


Pro 


Met 


Thr 


Ala 


Val 


Asn 


Gly 


Gin 


Leu 


Arg 


Trp 








180 










185 










190 






Cln 


Ala 


Thr 


Glu 


Arg 


Leu 


Ser 


Phe 


Thr 


Thr 


Gly 


He 


Asp Tyr Arg 


Gin 






195 










200 










205 








Val 


Gin 


Trp 


Asn 


Ala 


Phe 


Asp 


Asn 


Arg 


Thr 


Phe 


Ser 


Leu 


Lys 


Gly 


S-r 




210 










215 










220 










Ala 


Arg 


Tyr 


Glu 


Val 


Met 


Asp 


Asn 


Val 


Phe 


Vai 


Asn 


Gly 


Phs Gly 


Ser 


225 










230 










235 










240 


Tyr 


Peo 


Leu 


Tyr 


Ser 


Ser 


Thr 


Arg 


Ser 


Gly 


Leu 


Asn 


Met 


Ala 


Val 


Pro 










245 










250 










255 




Met 


His 


Gly 


Phe 


Gly 


Pro 


Gin 


Tyr 


Gly 


Gly 


Set- 


Leu 


Glu 


Leu 


Lys 


Val 








2 60 










26S 










270 






Sec 


Glu 


Arg 


Phe 


Gly 


Phe 


Ala 


Val 


Gly Met 


Glu 


Arg 


Glu Tyr 


Asn 


lie 






275 










7B0 










285 








Trp 


Thr 


Arg 


Arg 


Trp 


Glu 


Thr 


His 


Tyr 


Phe 


Ala 


Tyr 


Pro 


Val 


Ph? 


Tyr 




290 










295 










300 










Gly 




Lys 


Lys 



























435 



445 



305 



(2) lUtXJRI-lATIOM FOR SEQ ID MO: 496 
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ii\ sequemcf: characteristics: 

(A) LErJGTH: 315 amino acids 

(B) TYPE: amino acid 
[Di TOPOLCMJ-/: linear 

(il) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

fA) ORGANISri: Porphyromonas gingiva lis 
{ix) FEATURE: 

iA) NAllE/raY; mi sc featur- 

(B) LOCATIOfi 1. . .315 

(xl) SEC>UEHCE DESCRIPTIOM: SEQ ID rJO:4?6 

net Lys Thr Asn Arg Arg Tyr Ala Phe Val Leu Pro Lsu Leu Leu Leu 

Thr Gly T.eu Leu Ala Trp Gly Gin A.p Ser Ser Hi. Gly Ser 11. Thr 

25 30 



Ala Phe Ala Thr A.p S.r Ser Ser Arg Glu L.u Pro Thr Glu Gin Ser 

Ala Tyr Arg Tie Hi. S.r Ala iy. „et Val Gly Gly G^y Gly Ser lie 

Thr Arg A.p Thr Tyr Leu Ser Pro Leu Arg Tyr Gly Gly Trp Thr L=u 

A.n Leu Leu Gly Glu Ly, Thr Phe Pro Leu lys Ala Ser A.p Ser Arg 

Trp Met He Arg Thr Gly His Glu Leu ^p Phe Ala Leu Mot Asp Asn 

Pro Ala Asn A.n Ala His Phe Tyr Ser l,eu Leu Tyr Asn G^y Ser Ala 
^-^^ 120 



AAV 125 

Ala Ala Leu Tyr Arg Leu Gly Ala Lys His L=u Arg Ala Ala Trp Met 



A.P A=n L«,. Arq Leu Al. I'^l Gly pco Gly Leu gJu lie Gly Leu Gly 



Gly He Tyr Ser Thr ^cg A-^n Gly A^n Asn Pro Ala Thr Leu Lys llu 

Tyr Thr Asn Ala He Ala Gin Ala Ser III Gly Tyr Tyr Val Pro Ser 

<31u Thr Ph. Pro L.u Tyr Phe Arg ^^u Leu Ser Gin He ^1° Leu Phe 

200 205 
Gly lie Ala Tyr Gly Asn Gly Phe Gly Glu Ser Tyr Tyr Glu Asn Phe 



Leu Leu Asn Asn Gly lie Gly Ser Leu His Phe Thr Tyr Pro Gl, 



230 



Lys Phe Thr Arg Phe Thr Thr Leu He Thr Asp He Pro He A^g 

Asn Phe Cys Thr Leu Arg Val Gly Tyr "g Tyr Ser His Leu g" Ser 
Ser Leu Asn Ala Leu Asp Thr Arg h" His Ser His Thr Ma Phe He 



275 



Gly Phe val Thr Glu Phe Tyr Arg Phe Arg Gly Arg Lys Ala He. Asn 

^ 300 
Thr Gly Arg Arg Thr Ser Leu Tyr Tyr His Asp 
^05 310 31^ 

(2) irrrORJIATIOH for SEQ id NO: 497 

(II SEQUENCE CHARACTERISTICS' 

(A) LENGTH: 285 amino acids 

(B) TYPE: amino acid 
(U) TOPOLOGY: linear 

(il) IKDf.ECULE TYPE: protein 

Uii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAJUSH: Porphyromonas glnglvalis 
(ixl FEATURE: 



PCT/AU98/01023 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



^Tsit. i)j=i. tp™ fi^ii ^ "^^i ""^^ ^p^' 'P*''' jf"-ii 
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(AJ NAI-IE/KEY: misc^f ea ture 
(B) LOCATIOti 1 . . .285 



(Xlj SEQUEHCF. DESCRIPTIOIl: SEQ ID HO: 4 97 



Met VaX 


al 


Thr 


Leu 


Leu 


Vdl 


He 


Va 1 


Gly 


He 


Val Val 


Val Val 


Arg 


X 
















10 






15 




Tyr Ssr 




Arg 


val 




val 


His 


Lys 


Thr 


Gly 


Thr Val 


Val Ssr 


Ala 






20 










25 








30 




Ala lie 


Phe 


Gly 


rhe 


He 


Leu 


Lev) 


Civ 


Lys 


Thr 


Val Pro 


Cys Asp 


Thr 




35 










40 








45 








Phe 


Phe 


er 


er 




Ser 


Asp 


Glu 


Pro 


Glu Ser 


Arg Val 


Al a 


50 










55 










60 




Tl- c 1 1 1 




Al a 


niB 


Leu 


Cys 


Glu 


He 


Cly 


Phe 


Gin He 


Mis Ala 


Ser 


65 








7 0 










75 






80 


Ser lie 


Ills 


Val 


Ala 


Val 


Arg 


Thr 


Asp 


Phe 


Gly Gin Ala 


Gly He 


His 








85 










90 






95 




Cys Pro 


Met 


Ala 


Thr 


Asp 


Ala 


Ser 


Ala 


Thr 


Glu 


Phe A^p 


Arg Ser 


Ala 






100 










105 








HO 




Glu Cy^ 


Ala 


Glu 


Arg 


Thr 


Ser 


Ala 


Gin 


He 


Asp 


Thr Ala 


He Arg 


Scr 




115 










120 








125 






Gin Ssr 


Gin 


He 


He 


Arg 


Thr 


His 


He 


Asp 


Thr 


Cys Pro 


Lys Ser 


Ser 


130 










135 
















r:ly Thr 


He 


Gly 


Glv 


Ser 


Thr 


His 


Thr 


Ser 


Leu 


His Leu 


Lys Val 


Pho 


145 








ISO 










155 




160 


JKsp Gly 


Arg 


Gly 


Glu 


Val 


oly 


His 


He 


His 


Pro 


Lyp Asp 


Gly Leu 


Arg 








165 










170 






175 




Phe Gly 


Val 


Vsl 


Glu 


Gly 


Tyr 


Ser 


He 


Gly 


Ser Tyr Val 


Asp Ala 


He 






180 










185 








190 




my He 


Gly 


Ala 


Thr 


His 


Ala 


Lys 


Ala 


Gl y 


He 


Ser Asp 


Thr Arg 


Thr 




195 










200 








205 




Gly He 


Ala 


Gly 


Gly 


Tyr 


ins 


Gly Gly 


Ser 


Gin 


His Gin 


Gin He 


Gly 


210 










215 










220 




Asp Val 


Thr 


Thr 


Il« 


He 


Gly 


Leu 


Gly 


Lys 


Phe 


Gly Leu 


Ala Asrt 


Val 


225 








230 










235 






240 


Gly Val 


Ser 


Asp 


Arg 


Gly 


Phe 


Arg Arg 


Gly 


Ala 


Ser Ser 


Tyr Asp 


Leu 








245 










2S0 






255 




Tyr Gly 


Leu 


Glu 


Leu 


His 


He 


Ala 


Lys 


Thr 


He 


Tyr Leu 


Ala Val 


Gly 






260 










265 








270 




Asp Gly 


Cys 


Leu 


Cys 


Arg 


Gin 


Ala 


Glu 


Arg 


Gin Acp Gly 







(2) INFORIIATIQM FOR SEQ ID NO: 4 98 

(i) SEQUEIJCE CHARACTERISTICS: 

(AJ IXHGTH: 599 anvino acids 
(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(il) HOLECULE TYPF.: protein 

(iiil HiPOTHETICAL: YES 

(vi> ORIGINAL SOURCE: 

(A) ORGA)^ISM: Porphyromonast glngivalis 

(ix) FEATURE: 

(A) tlAltE/KT.Y: iTii5c_f eature 
<B) LQCATIOH 1. . .599 



(Xi) SEQUEJJCE DESCRIPTION: SEQ ID NO: 4 99 



Met 


Glu 


Asn 


Leu 


Lys 


Asn 


Ho 


Gin 


Pro 


Arg 


Glu 


Asp 


Phe 


Asn 


Trp 


Glu 


1 








5 










10 










15 




Glu 


Phe 


Glu 


Ala 


Gly 


Gly 


Val 


His 


Ala 


Ala 


Val 


Ser 


Arg 


Gin 


Glu 


Gin 








20 










25 








30 






Glu 


Ala 


Ala 


Tyr 


Affp 


Lys 


Thr 


Leu 


Asn 


Thr 


He 


Lys 


Glu 


Lys 


Glu 


Val 






35 










40 










45 








Val 


Met 


Gly 


Arg 


Val 


Thr 


Ala 


He 


Asn 


Lys 


Arg 


Glu 


Val 


Val 


He 


Asn 




50 










55 








60" 










Val 


Gly 


Tyr 


Lys 


Ser 


Glu 


Gly 


Vai 


Val 


Pro 


Ala 


Thr 


Glu 


Phe 


Arg 


Tyr 


65 










70 










75 








80 


Asn 


Pro 


Glu 


Leu 


Lys 


Val 


Gly 




Glu 


Val 


Glu 


Val 


Tyr 


He 


Glu 


Asn 










85 










90 










95 




Gin 


Glu 


Agp 


Lys 


Lys 


Gly 


Gin 


Leu 


Val 


Leu 


Scr 


HI.? 


Arg 


Lys 


Gly 


Arq 



275 



280 



285 
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100 










105 










110 




Ala 


Ala 


Arg 


Ser 


Trp 


Glu 


Arg 


Val 


Asn 


Glu 


Ala 


Leu 


Glu 


Lys 


Asp Glu 






115 










120 










125 






lie 


Val 


Lys 


Gly 


Tyc 


Val 


Lys 


Cys 


Arg 


ThL 


Lys 


Gly Gly Met 


He Val 




130 










135 










140 








Asp 


Val 


?Ufi 


Gly 


lie 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly 


Ser 


Gin 


He 


Asp Val 


145 










150 










153 








160 


Aro 


Pro 


lie 


Arg 


Arp 


i yr 


Asp 


Ala 


Phe 


Val 


Glu 


Lys 


Thr 


Wet 


Glu Phe 








1 65 










170 










175 


1 *.-c 


ll9 


Val 


L*'S 


lie 


Asn 


Gin 


Clu 


Tyr 


Ly* 


Asn 


Val 


Val 


Val 


Ser His 






180 










195 










I 90 




i^i 3 


Val 


Leu 


lie 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lye 


Lys 


Glu 


Ho He 




195 










200 










205 








L* 'E 


Leu 


Gl u 


Lvs 


11 V 


Gin 


Val 


Le 
eu 


Gl u 


Gly 


lie 


Val 


Lys 


Asn He 




210 










215 










220 








Thr 


Ser 


'I'vr 
'■i 




Val 


Phe 


lie 


ASD 


Leu 


Glv 


Gly Val 


Asp 


Giy Leu He 












230 










235 








240 


His 


I le 


Thr 


•P 


L'»u 


Ser 


Trp 


Glv 


Ara 


Val 


Ala 


His 


Pro 


Giu 


Glu He 








2-15 










250 










255 


Yal 


... 


Lsvj 


Asp 


Gin 


Lys 


lie 


Asn 


Val 


Val 


lie 


Leu 


Asp 


Pho Asp Glu 








2 60 










265 










270 




Asp 






Arg 


1 1 o 


Ala 


L*-u 


Glv 




Lys 


Gin 


Leu 


Met 


Pro 


His Pro 




275 










280 










285 






rrp 


Asp 


Ala 


L-^u 


Asp 


Ser 


Glu 




Lvs 


Vai 


Gly Asp 


Lys 


Val 


hys Gl y 




290 










295 










300 








Lys 


Val 


Val 


Val 


Met 


Ala 




1 y r 


Glv 


Ala 


Phe 


Val 


Glu 


He 


Ala Gin 


305 










310 










315 








320 


ni \T 
VjJ y 


Val 


Glv.) 


Gly 


L«»u 


lie 


His 


Val 


Sec 


Glu 


IteC 


Ser 


Trp Thr 


Gin His 








325 










330 










335 


Leu 




Ser 


Ala 


Gin 


Asp 


Phe 


T.GU 


His 


Val 


Gly 


Asp 


Glu 


Val 


Glu Ala 








34 0 










345 










350 




Val 


lie 


I «»u 


Thr 


L-u 


ASD 


Ara 


Giu 


Giu 


Ara 


Lys 


Het 


Ser 


Leu 


Gly Leu 






355 










360 










365 






Lys 


Gin 


Leu 


Lvs 


Pro 


AsD 


Pro 


Trp 


Ala 


Asp 


lie 


Glu 


Thr 


Arg 


Phe Pro 


370 










375 










360 








Val 


Gly 


Ser 


Arg 


His 


His 


Ala 


rg 


ai 


Axg 


Asn 


Phe 


Thr 


Asn 


Phe Gly 


v^t 










390 










395 








4 00 


Val 


c 


Vnl 




lie 


Glu 


Gl u 


*jj.y 


Val 


Hsp 


Gly 


Leu 


Il9 


His 


He Ser 










4 05 










410 










415 


Asp 


Leu 




Trp 


Thi 


ys 


T 

jji 


lie 


Lys 


His 


Pro 


Ser 


Glu 


Phe 


Thr Glu 








4 20 










4 25 










430 




Val 


Gl y 


Ala 


Asp 


lie 


Glu 


Val 


Gin 


Val 


lie 


Glu 


He Asp Lys Glu Asn 






435 










440 










445 






Arg 


Arg 


Leu 


Ser 


L^su 


Gly 


(lis 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 


Pro 


Trp Asp 




450 










4S5 










460 








Val 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Giu 


Gly Thr Val 


465 










470 










475 








460 


114 


Ulu 


Val 


Met 


Asp 


Lys 


Gly 


Ala 


Vol 


Val 


Ser 


Leu 


Pro Tyr Gly Val 










403 










490 










495 


Glu 


Gly 




Ala 


Thr 


Pro 


Lys 


His 


Met 


Val 


Lys 


Giu 


Asp Gly Ser Gin 








500 










505 










510 




Ala 


Val 


Leu 


Glu 


Glu 


Lys 


Leu 


Pro 


Phe 


Lys 


Val 


He 


Glu 


Phe 


Asn Lys 






SIS 










520 










525 






Af p 


Ala 


Lys 


Arg 


lie 


lie 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 


Glu 


Asp Glu 




530 










535 










540 








Gin 


Lys 


Met 


Ala 


Gin 


Arg 


Glu 


Ala 


Asn 


Ala 


Glu 


Arg 


Lys 


Ala 


Glu Ala 












550 










555 








560 


L/s 


Ala 


Ala 


Gin 


Lys 


Glu 


Ala 


Ala 


Ala 


Glu 


Ala 


Ala 


Asn 


Pro 


Ala Gin 










565 










570 










575 


Ala 


Vai 


Glu 


Lye 


Ala 


The 


Leu 


Gly 


Asp 


Leu 


Gly 


Glu 


Leu 


Ala 


Ala Leu 








590 










585 










590 




Lys 


Glu 


Lys 


Le:u 


Ser 


Glu 


Asn 



















595 



(2] IMFORIiATIOn FOR SEQ ID MO; 4 99 

(11 SEQUENCE CHARACTERISTICS: 

[A) LRtlGTH: 550 amino acids 

(B) TYPE: amino acid 
tU) TOPOLOGY: linear 

(xi) IIOI.ECULE TYPE: protein 

(iiil HYPOTHETICAL: YES 



SUBSTTTUTE SHEET (Rule 26) (RCVAU) 



10 



45 
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WO 99/29870 

450/490 

(vl) ORIGINAL SOURCE: 

(A) ORGANIST-!: Porphyromonas glngivsH? 

(ix) FEATURE: 

(A) MAI-IE/KEY: raise feature 

(B) LOCATIOI^ 1. . .550 

(xi) SEQUEWCE DESCRIPTIOH: SEQ ID JJO:J99 

Met Gly Arg VM Thr Ala lie Asn Lys Arg Glu Val Val He A^n Val 

Gly Tyr Lys Ser GIu Gly Val Val Pro IL Thr Glu Phe Arg Tyr A^n 

Pro Glu Leu Ly. Val Gly Asp Glu Val Glu Val Tyr He Glu A.n Gin 

Glu Asp Ly. Lys Gly Gin Leu Val Leu Se. Hi. Arg Lys Gly Arg Ala 

Ala Arg Ser Trp Glu Arg Val Asn Glu Ala L^u tlu Lys Asp Glu lie 

20 val Ly. Gly Tyr Val Lys Cys Arg Thr Lys lly Gly Met He Val Asp 

90 Qi^ 
V.l Ph. Gly ne Clu Ala Ph, L,u Pro ^ly ser Gin He A.p Val Arc, 

105 110 

25 ^" Tit '^'^ "^^^ """^ 

Ile Val Lys II, A^n Gin Slu "-r Ly, A^n Val val vlt s„ HI. Ly3 

Val Leu He Glu Ala Glu ^eu Glu Gin Gin Ly. Jys Glu He He Gly 
or* 150 155 1 

30 Lys Leu Glu Ly« Gly Gin Val I^u Glu Gly He Val Ly, A.n He Thr 

ser Tvr Gly Val Ph. He Leu Gly «y Val A.p Gly Leu He His 

35 '*^" '^9 "1 Ala Hi, Pro Glu Gl'u He Val 

200 205 
Gin Leu ASP Gin Ly. He A«n Val Val He Leu A^p Phe Asp Glu Asp 

Z15 220 
Arg Lys Arg He Ala Leu Gly Leu Lys Gin Leu Met Pro Hi, Pro Trp 

40 ASP Ala Leu Asp Ser Glu Leu Lys Val Gly Asp Ly. Val Lys Gly Lys 



Val Val Val Met '^l Asp Tyr Gly Ala Ihe Val Glu He Ala Gin Gly 

26S ■?Tn 
Val Glu Gly Leu Ho His Val Ser Glu Het Ser Trp Thr Gin Mi, L-u 
_ 2B0 285 

Arg S,r Ala Gin Asp Phe Leu His Val Gly As-p Glu Val Glu Ala Val 

305 '^'-5 I-y-' "-^ ser Leu Gly Leu Lys 

50 Gin Leu Lys Pro Asp Pro Trp Ala Asp He g" Thr Arg Ph. Pro Val 
Gly ser Arg His His Ala Arg Val Arg "n Phe Thr Asn Phe Gly Val 

55 ^''^ '''' 35^ -^^y 11« His III Ser Asp 

360 3SS 

Leu Ser Trp Thr Lys Lys He Lys His Pro Ser Glu Phe Thr Glu Val 

Gly ... t!^ 380 



Ala Asp He Glu Val Gin Val He Glu He Asp Lys Glu Asn Arg 
60 Arg Leu Ser Leu Gly His Lys Gin Leu Glu Glu Asn Pro Trp Asp Vai° 
Phe Glu Thr Val Phe Thr Val Gly Ser He His Glu Gly Thr Val He 
Glu val Met ASP Lys Gly Ala Val Val' Ser Leu Pro Tyr tly Val Glu 
Gly Phe Ala Thr Pro Lys His ,'et Val Lys G,u Asp oJy Ser Gin Al. 



65 



460 
Glu 

70 aI: Lys Arg He He vli° Ser His Ser Arg Val Phe Glu Asp Glu r.Jn 



yal Leu G,u Glu Lys Leu Pro Phe Lys Vai He Glu Phe Asn Lys Asp 



485 ^95 



Lys net Ala Gin Arg Glu Ala Asn Ala Glu Arg Lys Ala Glu *pll Lyr 

Ma Ala Gin Lys Glu Ala Ala Ala Glu Ala Ala Asn Pro ^'a° Gin Ala 

520 



PCT/AU98/01023 
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Val Glu Lys Ala Thr Leu Gly Asp Leu Gly Glu Leu Ala Ala Leu Lyt" 

530 535 540 

Glu Lys Leu Ser Glu Asn 



{2] IHFORIIATIOU FOR SEQ ID MO: 500 

(ij SEQUEItCE CfiARACTERISTICS: 

(A) LENGTH: 4 58 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(iU MOLECULE TVPE; proLein 

aiil HVPOTHETICAI.: YES 

(vi) OiUGlHAL SOURCE: 

(A! ORCAIiISM: Po cphyromonas gingivalis 

(1.^) FEATURE: 

(A) ^iAHE/^:EV: misc_feature 
( D } LOCATIOt^ 1 ... 4 5 □ 

(Xi) SEQUENCE CESCRIPTIOII: SEQ ID 1)0:500 



Met 


lie 


Val 


Asp 


Val 


Phe 


Gly 


He 


Glu 


Ala 


Phe 


Leu 


Pro 


Gly Str 


Gin 


1 








5 










10 










15 




He 


Asp 


Val 


Arg 
20 


Pro 


Ho 


Arg 


Asp 


Tyr 
25 


Asp 


Ala 


Phe 


Val 


Glu 
30 


Lys 


Thr 


ir-f. 


Glu 


Phe 
35 


Ly= 


He 


Val 


Lys 


He 
4 0 


Asn 


Gin 


Glu 


Tyr 


Lys 
4 5 


Asn 


Val 


Val 


Val 


Ser 


His 


Lys 


Val 


Leu 


He 


Glu 


Ala 


Glu 


Leu 


Glu 


Gin 


Gin 


Lys 


Lys 




50 








55 










60 








Glu 


He 


He 


Gly 


Lys 


Leu 


Glu 


Lys 


Gly 


Gin 


Val 


Leu 


Glu 


Gly He 


Vol 


65 










70 










75 










80 


Lye 


Asn 


He 


Thr 


ser 
85 


Tyr 


Gly 


Val 


Phe 


He 
90 


Asp 


Leu 


Gly 


Gly 


Val 
95 


Asp 


Gly 


Leu 


He 


His 
100 


He 


Thr 


Asp 


Leu 


Ser 
105 


Trp 


Gly 


Arg 


Val 


Ala 
110 


His 


Pro 


Glu 


Glu 


He 
115 


Val 


Gin 


Leu 


A^p 


Gin 

120 


Lys 


He 


Asn 


Val 


Val 
125 


He 


Leu 


Asp 


Pl>e 


Asp 


Glu 


Asp Arg 


Lys 


Axg 


He 


Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


r-ict 




130 










135 










140 










Pro 


His 


Fro 


Trp 


A^p 


Ala 


Leu 


Asp 


Ser 


Glu 


Leu 


Ly5 


Val 


Gly 


Asp 


Lys 


145 










150 










155 










160 


Val 


Lys 


Gly 


Lys 


Val 
165 


Val 


Val 


^feL 


Ala 


Asp 
170 


Tyr 


Gly 


Ala 


Phe 


Val 
175 


Glu 


lie 


Ala 


Gin 


Gly Val 


Glu 


Gly 


Leu 


He 


Hl5 


Val 


Ser 


Glu 


Met 


Ser 


Trp 








180 










185 










190 






Thr 


Gin 


His 


Leu 


Arg 


Ser 


Ala 


Gin 


PiSp 


Phe 


Leu 


His 


Val 


Gly Asp 


Glu 






195 










200 










205 








Val 


Glu 
210 


Ala 


Val 


Tift 


Leu 


Thr 
215 


Leu 


Asp 


Arg 


Glu 


Glu 

220 


Arq 


Lys 


Met 


Ser 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Lys 


Pro 


Asp 


Pro 


Trp 


Ala 


Asp 


He 


Glu 


Thr 


225 










230 










235 












Arg 


Phe 


Pro 


Val 


Gly 
245 


Ser 


Arg 


His 


His 


Ala 

250 


Arg 


Val 


Arg 


Asn 


Phe 
255 


Thr 


Asn 


Phe 


Gly 


Val 


Pile 


Vol 


Glu 


lie 


Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 








260 










265 








270 






His 


He 


Ser 


Ai?p Leu 


Ser 


Trp 


Thr 


Lys 


Lys 


He 


Lys 


His 


Pro 


Ser 


Glu 






275 










280 










265 








Phe 


The 


Glu 


Val 


Gly Ala 


Aep 


He 


Glu 


Val 


Gin 


V3l 


He 


Glu 


He 


Asp 




290 










295 










300 










Lys 


Glu 


Affn 


Arg 


Arq 


Leu 


Ser 


Leu 


Gly 


His 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 


305 










310 










315 










320 


Pro 


Tcp Asp 


Vbl 


Phe 


Glu 


Thr 


Val 


Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 










32t 








Lys 


330 










335 




Gly Thr 


V«l 


He 


Glu 


Val 


Het 


Asp 


Gly 


Ala 


Val 


Val 


Sec 


Leu 


Pro 








310 










34 S 










350 






Tyr Gly Val 


Glu Gly 


Phe 


Ala 


Thr 


Pro 


Lys 


His 


llet 


Val 


Lys 


Glu 


Asp 






355 










360 










365 






Gly 


Ser 
370 


Gin 


Ala 


Vai 


Leu 


Glu 
37 5 


Glu 


Lys 


Leu 


Fro 


Phe 
3B0 


Lys 


Val 


He 


Glu 


Phe 


Asn 


Lys 


A?p 


Alo 


Lys 


Acq 


He 


lis 


Val 


S^r 


His 




Arg 


Val 


Phe 


385 










3?0 










395 










4 00 



545 



550 
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A^p 


Glu 


Gin 


Lyc 
4 05 


Met 


Aia 


Gin 


Arg 


Glu 
410 


Ala 


Asn 


Ala 


Glu 


Arg 
415 


Lys 


Ala 


Glu 


Ala 


Lys 
420 


Ala 


Aia 


Gin 


Lys 


Glu 
42b 


Ala 


Ala 


Ala 


Glu 


Ala 
430 


Ala 


Asn 


Pro 


Ala 


Gin 

435 


Ala 


Val 


Glu 


Lys 


Ala 
44Q 


Thr 


Leu 


Glv 




Leu 
445 


Gly 


Glu 


Leu 


Ala 


Aia 
450 


Leu 


Lys 


Glu 


Lys 


Leu 

455 


Sec 


Glu 


Asn 















(2) IIIFORJiATION FOR S£Q JO UOrSOl 

(i) SEQUEHCt: CHARACTERISTICS: 

(A) l-EUGTH: 4 26 amino acid^ 
(DJ TYPE: amino acid 
{DJ TOPOLOGY; linear 

Hi) lIOLF.ajJ.E TfFE: protein 

(iii) HYPOTHETICAL: YES 



(vi) ORIGIMAL SOURCE: 

(AJ ORGAmSlh Forphyromonas glnglvalls 



Ux) FEATURE: 

(A) MAHE/KEY: mi5C_fe 

(B) LOCATION 1. . .426 



SEQUBNCE DESCRIPTION: 



Met 


Glu 


Phe 


Lys 


He 


Val 


Lys 


He 


1 








5 








Val 


Ser 


His 


Lys 


Val 


Leu 


He 


Glu 








20 










Glu 


He 


He 


Gly 


Lys 


Leu 


Glu 


Lys 






35 










40 


Lys 


Asn 


He 


Thr 


Ser 


Tyr 


Giy 


Val 




5Q 










55 




Gly 


Leu 


Ho 


His 


He 


Thr 


Asp 


Leu 


65 










70 






Giu 


Glu 


ile 


Val 


Gin 


Leu 


Aap 


Gin 










85 








Phe 


hep 


Glu 


Asp 


Arg 


Lys 


Arg 


He 








100 










Pro 


His 


Pro 


Trp 


Asp 


Ala 


Leu 


Asp 






H5 










120 


Val 


Lye 


Gly 


Lys 


Val 


Val 


Val 


HeC 




130 










135 




He 


Ala 


Gin 


Gly 


Val 


Glu 


Gly 


Leu 


145 










ISO 






Thr 


Gin 


His 


Leu 


Arg 


Ser 


Ala 


Gin 










165 








Val 


Glu 


Ala 


Val 


He 


Leu 


Thr 


Leu 








180 










Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Lys 


Pro 






195 










200 


Arg 


Phe 


Pro 


Val 


Gly 


Ser 


Arg 


His 




210 










215 




A.*n 


Phe 


Giy 


Val 


Phe 


Val 


Glu 


He 


225 










230 






His 


He 


Sex 


Asp 


Leu 


Ser 


Trp 


Thr 










24b 








Phe 


Thr 


Glu 


Val 


Gly 


Aia 


Asp 


He 








260 










Ly.-T 


Giu 


Asn 


Arg 


Arg 


Leu 


Ser 


Leu 






215 










280 


Pro 


Trp 


Asp 


Val 


Phe 


Glu 


Thr 


Vai 




290 










295 




Gly 


Thr 


Val 


He 


Glu 


Val 


Met 


Asp 


305 










310 






Tyr 


Gly 


Val 


Giu 


Gly 


Phe 


Ala 


Thr 










325 








Gly 


Ser 


Gin 


Ala 


Vai 


Leu 


Glu 


Glu 








340 










Phe 


Am 


Lyff 


As^p 


Ala 


Lys 


Arg 


lis 






355 










360 



ature 



SEO ID 1/0:501 



Asn 


Gin 


Glu 


Tyr 


Lys 


Asn 


Val 


Val 




10 










15 




Ala 


Glu 


Leu 


Giu 


Gin 


Gin 


Lys 


Lys 


2b 










30 






Gly 


Gin 


Val 


Leu 


Glu 


Gly 


He 


Vai 










45 








Phe 


He 


Asp 


Leu 


Gly Gly 


Val 


Asp 








60 










Ser 


Trp 


Gly 


Arg 


Val 


Ala 


His 


Pro 






75 










ao 


Lyr 


He 


Asn 


Val 


Val 


He 


Lciu 


Asp 




90 














Ala 


Leu 


Gly 


Leu 


Lys 


Gin 


Leu 


Met 


1C5 










110 






Ser 


Glu 


Leu 


Lys 


Vai 


Gly 


Asp 


Lys 










125 








Ala 


Asp 


Tyr 


Gly 


Ala 


Phe 


Vai 


Glu 








140 










He 


Hts 


Val 


Ser 


Glu 


Met 


Ser 


Trp 






155 










160 


Asp 


Phe 


Leu 


His 


Val 


Giy 


Asp 


Glu 




no 










175 




Aep 


Arg 


Glu 


Glu 


Arg 


Lys 


Met 


Ser 


185 










150 






Asp 


Pro 


Trp 


Aia 


Asp 


He 


Giu 


Thr 










205 








His 


Ala 


Arg 


Val 


Arg 


Asn 


Phe 


Thr 








220 










Glu 


Glu 


Gly 


Val 


Asp 


Gly 


Leu 


He 






235 










240 


Ly.? 


Lys 


He 


Lys 


His 


Pro 


Ser 


Giu 




250 










255 




Giu 


Val 


Gin 


Val 


lie 


Giu 


He 


Asp 


265 










270 






Gly 


His 


Lys 


Gin 


Leu 


Glu 


Glu 


Asn 










285 








Phe 


Thr 


Val 


Gly 


Ser 


He 


His 


Glu 








300 










Lys 


Gly 


Ala 


Val 


Val 


Ser 


Leu 


Pro 






315 










320 


Pre 


Lys 


HK*? 


Mef 


Val 


Lys 


Glu 


Asp 




330 










325 




Lys 


Leu 


Pro 


Phe 


Lys 


Val 


He 


Glu 


34 5 










350 






lis 


Val 


Ser 


His 


Ser 


Arg 


Val 


Phe 










365 
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Glu 


3-70 


Glu 


Gin 


Lys 


H-t 


Ala 
375 


Gin 


Acg 


Giu 


Ali* 


Aifii 
300 


Ala 


Glu 


Arg 


Lye 


Ala 


Glu 


Ala 


Lys 


Ala 


Ala 


Gin 


Lys 


Glu 


AI a 


Ala 


Al a 


Glu 


Ala 


Ala 


Asn 


305 










390 










395 










400 


Pro 


Ala 


Gin 


AI a 


Val 
405 


Glu 


LysT 


Ala 


Thr 


Leu 
410 


Gly 


Asp 


Lau 


Gly 


Glu 
415 




Ala 


Ala 


Leu 


Lys 
420 


Glu 


Lyi- 


Leu 


Ser 


Glu 
12^ 


Am 















12) lrJEX>RIIATTOtl FOR SF.Q ID riO:502 

(i| b'KQUtllCE CHARACTERISTICS: 

(A) LEMGTH: 24 0 amino acids 

(B) T'i'Pr.: amino acid 
ID) TOPOLOGY: lin-sar 

(ii) MOLECULE TYPE: protsin 

(iii) HYPOTHETICAL: YES 

(Vl) ORTGTNAL SOURCE: 

(A) ORGAIJTSM: Porphyroinonas glngivalis 

(ix] FEATURE: 

{A) HAIIE/KEY: misc_f eaiurs 

(B) LOCATIOtJ 1 . , .24 0 

(xi) SEQUEMCE DESCRIPTIOH: SEQ ID NO: 502 



Hot 


Lys 


Lys 


Ala 


He 


Leu 


Ser 


Gly 


Ala 


Ala 


Leu 


Leu 


Leu 


Gly Leu Cys 


1 








5 










10 








lb 


Ala 


Asn 


Ala 


Gin 


Asn 


Val 


Gin 


Leu 


His 


Tyr 


Asp 


Phe 


Gly 


His Ser He 








20 










25 










30 


Tyr 


A.<?p 


Glu 


Leu 


Asp 


Gly 


Arg 


Pro 


Lys 


Leu 


Thr 


Thr 


Thr 


Val Giu Asn 




35 










4C 










45 




Phe 


Thr 


Pro 


Aep 


Lys 


Trp 


Gly 


Ser 


Thr 


Phe 


Phe 


Phe 


He 


Asp Met Asp 




50 










55 










60 






Tyr 


Thr 


Gly 


Lys 


Gly 


lie 


Gin 


Ser 


Ala 


Tyr 


Trp 


Giu 


He 


Ser Arg Glu 


65 










70 










75 






80 


Leu 


Lys 


Phe 


Trp 


Gin 


Ala 


Pro 


Val 


Ser 


lie 


His 


Leu 


Glu 


Tyr Asn Gly 










85 










90 








95 


Gly 


Lgu 


Ser 


Thr 


Ser 


Phe 


Thr 


Phe 


Gly 


His 


Asp 


Ala 


Leu 


He Gly Ala 








100 










105 










110 


Thr 


Tyr 


Thr 


Tyr 


Asn 


Asn 


Pro 


Ser 


Phe 


Thr 


Arg 


Gly 


Phe 


Thr He Thr 






115 










120 










125 




Pro 


Mst 


Tyr 


Lys 


His 


Leu 


Gly 


Ala 


His 


Asp 


Phe 


His 


Thr T^^r Qln He 




130 










135 










140 






Thr 


Gly 


Thr 


Trp 


Tyr 


Met 


His 


Phc 


Leu Asp 


Gi- 


Leu 


Leu 


Thr Phe Asn 


14S 










150 










les 






160 


Gly 


Phe 


].eu 


Asp 


Leu 


Trp 


Gly 


Phe 


Pro 


Gin 


Glu 


Asn 


Pro 


He Gly Gly 










165 










170 








175 


Pro 


Val 


Leu 


Lye 


61 u 


Gly 


Acp 


Lys 


Phe 


Val 


Phe 


Leu 


Ser 


Glu Pro Gin 








180 










185 










190 


Phe 


Trp 


lie 


Asn 


Lifiu 


Asn 


Arg 


He 


Lys Gly 


lie 


Acp 


Lys 


Arp Phe Asn 






195 










200 










205 




Leu 


Ser 


lie 


Gly 


Thr 


Glu 


Met 


Glu 


He 


Ser 


Arg 


Asn 


Phe 


Ala Arg Het 




210 










215 










220 






Asp 


U/s 


Phe 


Ser 


Cys 


He 


Pro 


Thr 


Leu 


Ala 


Val 


Lys 


Trp Thr Phe Asn 



230 235 240 



(2( IHFORI-IATIOIJ FOR SEQ ID 110:503 

fi) SEQUENCE CHAKACTERISTICS: 

(A) LKNGTH: 4 34 amino acids 

(B) TYPE: amino acid 
[D) TOPOLOGY: linear 

til) MOLECULE TYPE; protein 

(iii) HYPOTHETICAL: YES 

(Vll ORIGINAL SOURCE; 

(A) ORGAIJISH: Porphyromonas gingival! s 

(ixl FEATURE: 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



Jill!, .-^i „5 -A?==* -rtJ' ri"*" ilS. ^T.S if^l'. 

O ^ Q «^ ic::; g u o c:: ca. u u 



WO 99/29870 PCT/AU98/01 023 

454/ 490 

(A) IIAI'IB/KEY: ml ccf oat lire 
(B> LOCATICW i ... 434 

txi} SEQUEliCE DESCRI FTIOl I : SEQ ID I JO: 503 



Met 


Tyr 


Lys 


Asp 


Tyr 


Lys 


Gly 


Leu 


Tyr 


Ala 


Ser 


Leu 


Arg 


Trp 


Tyr Ala 


1 








5 










10 










15 




Leu 


He 


He 


Gly 


Leu 


Leu 


Phe 


Ala 


Ala 


Asp 


Gly 


He 


Gin 


Ala 


Gin 


Asn 








20 










25 










30 






Asn 


Asn 


Phs 


Thr 


Glu 


Ser 


Pro 


Tyc 


Thr 


Arg 


Phe 


Gly 


Leu 


Gly Arg 


L=\i 






35 










40 










4b 








Gly 


Glu 


Arg 


Thr 


Thr 


He 


Ser 


Gly 


His 


Ser 


net 


Gly 


Gly 


Leu 


Giy 


Val 




50 










55 










60 










Gly 


Leu 


Acq 


Gin 


Gly 


Thr 


Tyr 


Val 


Asn 


Alii 


VaI 


Asn 


Pro 


Ala 


Ser 


Tyr 


65 










70 










7 5 










80 


Syr 


Ala 


Val 


Aep 


Ser 


Het 


Thr 


Phe 


He 


Phe 


Asp 


Phe 


Gly 


Ala 


Ser 


Thr 








85 










90 










95 




Gly 


He 


The 


Trp 


Tyr 


Ala 


Giu 


Asn 


Gly 


Lys 


Lys 


Asp 


Asn 


Arg 


Lys 


Met 








100 










105 










HO 






Gly 


Asn 


He 


Glu 


Tyr 


Phe 


Ala 


Met. 


Leu 


Phe 


Pro 


He 


Ser 


Lys 


Ser 


He 




115 










120 










125 








ALa 


ll-st 


Sec 


Ala 


Gly 


Vil 


Lt£U 


Pro 


Tyr 


S^r 


Ala 


Ser 


Gl y Tyr Gin 


Phe 




130 










135 










140 










Gly 


Sec 


Val 


A-p 


Gin 


Val 


Glu 


Gly 


Giy 


Ser 


Val 


Gin 


Tyr 


Thr 


Ary 


Lys 


1-15 










ISO 










155 










160 


Tyr 


L«U 


Gly 


Thr 


Gly 


Asn 


Leu 


A^n 


Arp 


Leu 


Tyr 


Val 


Gly 


He 


Gly 


Ala 










165 










17 0 










175 




Thr 


Pro 


Fhs 


r.ys 


Asn 


Phe 


Ser 


He 


Gly 


Ala 


Asn 


Ala 


Ser 


Ser 


Leu 


PJie 








100 










185 










190 






Gly 


Arg 


Phe 


Thr 


His 


Ser 


Arg 


Gin 


Val 


He 


Phe 


Ser 


Thr 


Glu 


Ala 


Pro 




195 








200 










205 








Tyr 


Am 


Pro 


Val 


His 


Leu 


Ser 


Thr 


Leu 


Tyr 


Leu 


Lys 


Ala 


Ala 


Lys 


Phe 




210 










215 










220 










Asp 


Phe 


Gly 


Met 


Gin 


Tyr 


His 


Leu 


Leu 


Leu 


Lys 


Ssr 


Asp Arg 


Ser 


Leu 


225 










230 










235 










240 


Val 


He 


Gly 


Ala 


Val 


Tyr 


Ser 


Pro 


Arq 


Val 


Lys 


Met 


His 


Ser 


Glu 


Leu 








245 








250 










255 




Thr 


Gin 


He 


Lys 


Asn 


Gin 


Val 


Gin 


Asn 


Gly 


Val 


Val 


Val 


Glu 


Ser 


Giu 








260 










265 










270 






Thr 


Gin 


Glu 


Tyr 


Tie 


Lys 


Gly 


Met 


Asp 


Tyr 


Tyr 


Thr 


Leu 


Pro 


His 


Thr 






275 










280 










285 








Leu 


Gly 


He 


Gly 


Phe 


Ser 


Tyr 


Glu 


Ly3 


Lys 


A.3P 


Lys 


Leu 


Leu 


Leu 


Gly 




290 










2 9b 










300 










Ala 


Arp 


Val 


Gl n 


Tyr 


Ser 


Lys 


Trp 


Lys 


Gly 


Glu 


Lys 


Phe 


Tyr 


Lys 


Ser 


305 










310 










315 










320 


Asp 


Cys 


Lys 


Phe 


Gin 


Asp 


Arg 


He 


Arg 


Val 


Ser 


J.eu 


Gly 


Gly Glu 


He 










325 










330 










335 




lie 


Pro 


Asp 


H- 


A^n 


Ala 


Val 


Gly 


Het 


Trp 


Pro 


Lys 


Val 


Arg 


Tyr Arg 








34 0 










34 5 










350 






Phe 


Gly 


Leu 


His 


Gly 


Glu 


Arn 


Ser 


Tyr 


Leii 


Lys 


Va) 


Pro 


Thr 


Lys 


Gly 






355 










360 










365 








Gly 


Val 


Tyr 


Gin 


Gly 


Tyr 


His 


He 


Val 


Gly 


Ala 


Val 


Phe 


Gly He 


Gly 




3*7 0 










37 S 










300 










He 


Pro 


Leu 


Asn 


Asp 


Arg 


Arg 


Ser 


Phe 


Val 


Asn 


Val 


Ser 


Leu 


Glu 




385 










390 










395 










4 00 


Asp 


Arg 


Leu 


He 


Pro 


Lys 


Glu 


Gly 


Met 


He 


Lys 


Giu 


Asn 


Ala 


Leu 


Lys 










405 










410 










415 




Leu 


Thr 


Phe 


Gly 


Leu 


Thr 


t?he 


Asn 


Glu 


Ser 


Trp 


Phe 


Lys 


Lys 


Leu 


Lys 








42C 










425 










430 






Leu 

































10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



{2) infORllftTIOtJ FOR SZQ ID MO: 504 

65 (li SEQUEtlCE CHARACTERISTICS; 

(A) LEHGTH: 926 amino acids 
(BJ TYPE: aiaino acid 
{a) TOPOL0G*i: linear 

70 (in MOLECULE TYPE: protein 

Uiit HYPOTHETICAL: YES 

{vi( ORIGTIIAL SOURCE: 
75 (A) ORGAIIISM: Porphyromonas gingivaiis 



SUBSTITUTE SHEET (Rale 26) (RO/AU) 



u '"siJ o . 



wo 99/29870 

455/490 



(ixl FEATURE: 

(A) NAME/KEY: mlscfeature 

(B) LOCATION 1 . . . 926 

(yA) SEQUENCE DESCRI PTIOM : SEQ ID HO: 504 

MeL Aiy Ser He Tyr Gin Leu Leu Lou Scr 11= Leu Leu Ala S-r Leu 

^ 10 lb 

Gly Phe Val Gl y Leu Glu Ala Oin Gin Ala Gly Val Ala Gl^- Arq Val 

Leu Asp Glu Glu Gly Asn Pro Het lie Gin Ala Asn Val Gin Leu Val 

35 40 45 

cm Ser Thr Gly Gin Val Ala Val Ala Ala Gly Ala Thr A^n Glu L-s 

-0 5S 60 

Gly Leu Phe See Leu Lys Thr Ser Gin Glu Gly Asp Tyr He Leu Arg 

75 00 
Val ser Tyr Val Gly Tyr Thr Thr His A^p Glu Lys He Ser Leu Arg 

85 90 05 

Aan Gly Gin Thr He Thr Leu Lys Asp He Ser Msr Asn Glu Asp Ala 

10C» 105 120 

Arg Leu Leu Gin Ser Val Thr Val Gin Ala Lys Ala Ala Glu Val Val 

115 120 125 

Val Arg Asn Asp Thr Leu Gin Phe Asn Ala Gly Ssr Tyr Thr Val Ala 

130 135 
Gin Gly Ala Ser He Glu Glu Leu He Lys Lys Leu Pro Gly Ala Glu 
150 

He Gly Ser Asp Gly Lys He Thr 11- Asn Gly L73 Asp H- S-r L"- 

165 170 175 •* 

He Leu Val Asp Gly Lys Glu Phe Phe Ser Lys Asp Pro Gin Val Ala 

180 185 190 

He Lye Asn Leu Pro Ala Asp Met Val Asn T.ys Val Gin Val L-u Asn 

195 200 205 

Lys Leu Ser Glu Leu Ser Arg Met Ser Gly Phe Asp Asp Gly Glu Glu 

210 215 220 
Glu Thr Val He Asn Leu Thr Val Lys Pro Glu Lys Lys Lys Gly L-u 
230 235 240 

t'he Gly Thr Leu Gin Ala Gly Tyr Gly Thr Asp Gin Arg Tyr Met Ala 
24 5 



250 255 



Gly Gly Asn Val Asn Arg Phe Asp Gly Asn Lys Gin Trp Thr Leu II- 

260 265 270 

Gly Ser Ala Asn Asn Thr Asn Asn Het Gly Phe Ser Glu Met Asp Ser 

275 280 2G5 

Glu Het Gly Scr H^r Thr Phe Phe Ser Pro Gin Gly Gly Gly Arg Arg 

290 295 300 

Gly Phe Gly Asn Ser Gly Gly Val Thr Ser Ser Ser MeL Leu Gly Glv 

310 315 
Asn Phe Snr Val Gin Phe Ser Ser Ala Leu Asn Thr Gly Gly Asp Ala 

325 330 335 

Arg Tyr Gly Tyr Asn Asp Lys Ala He Glu Thr Thr Lys Arg Val Glu 

345 355 
Asn He Leu Ala Glu Gly Asn Thr Tyr Met Asp Glu Asn He Leu Glu 

355 360 365 

Arg Ser PJie Ser His Asn Gly Gin Ala Arg Phe Arg Met Gin Trp Lys 

370 375 380 

Pro Ser Glu Arg Thr Glu Val Val Phe Glu Pro Asp Leu Ser He Ser 
390 305 -Q^ 

Lys He Asp Gly Phe Phe Asn Asp Thr Tyr Glu Thr Lys Asp Ala Thr 

-^OS 410 4j^5 

Gly He Ser He Asn Lys Gly Ser He His Gin Thr Thr Gin Gly Asn 

420 425 430 

Asn Phe Arg Leu Asn Gly Glu Leu Asp Il« Ser His Lys i.eu Asn Asp 

Glu Gly Arq Thr He Ser Ala Ser Val Ser Gly Gly Leu Thr Asp Glu 
455 

Asp Gly Asp Gly He Tyr Gin Ala Val Leu Gin Ser Val Glu Thr Asn 
4 wo 470 475 

Gin Lys Gin Phe Asn Asp Asn Ser Asn Leu Gin Tyr Arg Leu Aro tlu 

^^5 4 90 ^ 05 

Ser Tyr Val Glu Pro Leu Gly Lys Asn Tyr Phe Ala Gin Ala Tl- T.*n 

505 510 
Asn Arg Arg Phe Ser Arg Arg Asn Ser Asp Arg Glu Val Tyr Arg Leu 

515 520 525 

Gly Asp Asp Gly Gin Tyr Ser He Leu Asp Ser Gin Tyr Gly Leu Ser 
^53.' 535 540 



SUBSnrUTE sheet (Rule 26) (RO/AU) 



u^-'h ti«^. a-"" «^ ^ ^F-i, ■«=iii, .pj^, ^puj, -rf:^!, 

iL„il "SJS ™i5 C13 ,«iL, SL. la :ilJi 'O ii:::; O ILJ U 



WO 99/29870 



456/490 

Tyr Scr A^-n Glu Phe Thr Gin Tyr Arg Tin r:iy Lr.u A.^n Leu Lys Lvs 

555 e^n 

He Ala l.yv. Thr Trp A.-p Tyr Thr Vai Gly Phe Asn VaX A^p Pro Asn 

56o ?S70 575 

Arg The Val Ser Tyr Arg Ser Val Ala Gly Val Glu Gin A^p l.ys L^u 
SflO 

Ala Phe A.^n Arg Val A..n Leu Ser Pro Met Leu Arg He A^n Tyr Lys 

^'^ 600 505 

Pro Ser Arg Thr Thr Asn Leu Arg Val Asp Tyr Arg Gly Arg Thr Thr 

Gin rro Ser He A^n Gin He Ala Pro Val Gin lip lie Thr Asn Pro 

630 535 
Leu Phe Vai Thr Glu Gly Arn Pro Gly Leu Lys Pro Ser Tyr Ser A^n 

A.n Val net Ala Met Phe Ser Asp Phe A^p Ala Ly. Ser Gin Arg Ala 

665 6-.0 
Ph^ hsr, lie Val Phe Phe Gly Arn Tyr Thr Phe Anp A.<,p lie Val Pro 



eeo 585 



A-«n Thr His Tyr A-^p Pro Ser Thr Gly He Arg Thr Thr Arg Tyr Glu 

^- - 695 7QQ 

Asn Ala Ser Gly Thr Trp Gin Ala A^n Leu Hi.^ Gly Thr Leu Ser Leu 

Pro Leu Lys Asn Arg Ala Phe Ser Phe Arg lle'u Ser Leu Phe A^rn Zrg 

^25 73(1 ^ ^ 

Leu Ala Glu Gly Gin Sor Fhe He Asn A^p A^p Lys A^n Lys Ala Leu 

745 750 



S-er Fhe Arg Thr Arg Glu Arg Leu Thr Leu Thr Tyr Arg A^n A^n Trp 

760 765 
H= A^p Thr Ser He Gly Gly Asn He Gly Phe Tyr Met Ala Asn Asn 
, 780 



Ser Leu Ser Gly Gin Lys Asp Ser Arg Thr Tyr Asp Phe Gly Gly Asn 

Tyr Gin Val Ala Leu Thr Leu Pro Tyr Gly Phe Arg He Asp Ser Asp 

805 gxo flic 

v.l Glu Tyr Asn Thr Asn Ser Gly Tyr Ser Gly Gly Phe Ser Leu Asp 



Glu Trp Leu Trp Asn Ala Ser Leu Ser Tyr Ser Phe Leu Arg Asp Lys 

Air. Gly Thr Leu Arg Val Asn Gly Tyr Asp He Leu G^y Gin Arg S»r 

,? 060 
Ser He Ser Arg Ser Ala Ser Ala He Asn He Glu Glu Ser Met Ser 

Asn Thr He Gly Arg ^yr Val Met Val Asp Phe He Tyr Arg Phe Acn 



Al. Phe Ser Gly Gly Gly Ser Arg Ser Hp His Gin Arg Gly Asn Met 

905 

Asn ^^-g Fro Gly Pro Pro Phe Gly Gly Gly Arg Arg Pro Scr 



915 



920 925 



(2) IIIFORJlATIOt^ FOR SEQ TD IJO:505 

<i) 5ZQVE14CE CHARACTERISTICS: 

(A) LENGTH: 4 00 amino acids 

(B) TYPR: amino acid 
(D) TOPOLOGY: linear 

(11) HOLECULt; TYPE: protein 

HYPOTHETICAL: YES 

{v1 ) OKIGIWAI. SOURCE: 

(A) ORGAIfISM: Porphyronvonas gingi%-alis 

iiy.) FEATURE: 

(A) MAI1E/KEV: misc_£eature 

(B) LOCATION 1...400 

SEQUENCE DESCRIPTION: SEQ ID 1IO;505 

H^t Val Mot Lys Leu He Lys Arg Ser Leu Leu Leu^ Leu Gly Ala Val 

^ ^ 15 
Leu L-u ue Thr Lau Pro Ala Tyr Ser Gin Asn Asp Asp He Phe Glu 

Asp lie Kr Thr Ser Ary Lys IL II, Arg Lys Gin Asn Gin Val 

40 j5 



PCT/AU98/01023 



SUBSTITUTE SHEET (RiiJe 26) (RO/AU) 



-!Uii :3 .X J^:: q o m u o i£; S O G 



PCT/AU98/01023 

457/490 



hys A^p Trp Gin Asn Gin Glu Aap Gly Tyr Gly A^p A^p rhr Glu Tvi: 

50 5S gQ 

Thr Val Ala Ser Asp Arg Asp He Asp Ala Tyr Asn Arg Arg Asp Gly 

75 80 
Gin Ser Tyr A.-p Gly hys Lys L^u Ser Lys Asp Lyr Lys Arg Asp Ser 

85 90 95 

Thr Arg Ser Ser Val Pro Gly Arg Tyr Ser Arg Arg Leu Ala Arg Ph- 

105 iiQ 
Tyr Ly^ Pro A^n Thr He Val lie Ser Gly Ala A^p A^n Val Tyr Val 

Thr A^p Asp Gly Glu Tyr Phe Val Tyr Gly Aep Glu Tyr Tyr Asp Arp 
135 2.10 



Ala Ser Ser Val Asn II- 
145 150 



Tyr He Asn Ser Pro Trp Cys Asp Pro Phe 
155 160 



Pro Tyr Thr Ser Trp Tyr Pro Sor Pho Sar Gly Trp Tyr A^n Tyr Thr 

1Gb 170 
Trp Asn Tyr Pro Trp Phe Tyr Tyr Gly Ser His He Gly Tro Gly Gl- 

180 IBS 15*0 

Tyr Tyr Pro Gly Tyr A.-n Trp Tyr Trp Ser Tyr Tyr Tyr Asp Pro Ph- 

195 200 205 

Tyr Asn Pro Tyr Gly He Gly Het Gly Trp Gly Tyr Pre Tyr Gly Trp 

210 21S 22f^ 

Gly Ser Tyr Tyr Gly Trp Gly Gly Tyr Pro Gly Val He His Ills Tyr 
2^*^ 235 240 

His His Tyr Pro Ly^ Uys Thr Tyr Ser Asn Gly Gin His Ser Gly Ala 

2-15 250 255 

Tyr Tyr Ser Tyr Gly Arg ?ro Asn Arg He r.ys CJy Gl--- Thr Ser Gl- 

260 265 270 

Ala Lys Leu Gly Thr Gly Arg Tyr Asp Arg He Gin Asn Ser Ser S-r 

200 285 

^Sn ^^"^ ^-"^ ^^"^ Lys Pro Asn Asn Asn Leu 

295 300 
Gin Asn Val Lys Ser Gly Arg Thr Gly Arg Ala Asn Arg Asp Arg Asn 

He Glu Thr Val Thr Pro Asn Asn Gly Gin Lys Gin Asn Arg Pro vfl 
325 

Phe Gin Gin Asn Gin Ser Gly Asn Asp Arg Pro Thr Gly Arg Asn He 

^'^^ 345 
Arg Ser Glu Arg Gin Gly Glu Asn A£«n Asp Arg Thr Phe Ser Thr Pro 
^ ^ 355 360 3€5 

'^^'^ ^"^ '^^^'^ Ser Arg Ser Ser 

370 375 

ser v,ly Ser lict Scr Gly Gly Gly Gly Arq Ser Gly Arg Gly Arg Asn 
35<^ 395 ^ 400 

(2) IHFORltATTOH FOR SEQ ID r JO: 506 

ii) SEOLfEMCE CHARACTERISTICS: 

(A) LEUGl'H: 398 ainino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(ii) HOLECULE TYPE: protein 

(Hi) HYPOTHETICAL: YLS 

(vi) ORIGINAL SOURCE: 

(A) ORGAJUSM: Porphyromonae gingival is 

(Ix) FEATURF.: 

lA) NAI-IE/KEV: nilsc_f eatur- 

(B) LOCATION 1...39e 

(xi) SEQUEfJCE DESCRTPTIOH: SEQ ID 110:506 

Met Lys Leu He Lys Arg Ser Leu Leu Leu Leu Gly Ala Val Leu L-u 

5 10 lb ' 

He Thr Leu Pro Ala Tyr Ser Gin A.'.n Asp Asp He Phe Glu Asp Asp 

25 30 
He Tyr Thr Ser Arg Lys Glu He Arg Lys Gin A^n Gin Val Lys Asp 
3- 40 45 Jr H 

Trp Gin Asn Gin Glu Asp Gly Tyr Gly Asp Asp Thr Glu Tyr Thr Val 

^-^ 55 50 

Ala Ser Asp Arg Asp T r e Asp Ala Tyr Asn Arg Arg Asp Gly Gin Ser 

"^5 go 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



ti"Hi -ff"'!! 'ir^t ,ai -i=^t fi^ii ,-tif . .,73 -^inn tr~i» 

li ji o «^ ^ a toi iu iL.^ a sc: -o u jui 



wo 99/29870 PCT/AU98/01023 

458/490 

Tyr A*p Giy Lys Lys Leu Ser Lys Acp Lys Lys Arg Asp Sec Thr Arg 

65 on 05 

Ser Ser Val Pro Gly Arg Tyr Ser Arg Arg Leu Ala Ara Phe Tyr J.ys 

100 ICS ' 110 

Vco Asn Thr II- Val He Ser Gly Ala Asp Asn Val Tyr Val Thr Asp 

115 120 125 

A^p Gly Glu Tyr Phe Val Tyr Gly Asp Glu Tyr Tyr Asp Asp Ala Ser 

130 135 140 

Ser Val Asn He Tyr He Asn Ser Pro Trp Cys A^p Pro Phe Pro Tyr 
J-^S ISO 155 160 

Thr Ser Trp Tyr Pro Ser Phe Ser Gl y Trp Tyr Am Tyr Thr Trp Asn 

165 170 175 

Tyr Pro Trp Phe Tyr Tyr Gly Ser His He Gly Trp Gly Gly Tyr Tyr 

180 105 190 

Pro Giy Tyr Asn Trp Tyr Trp Ser Tyr Tyr Tyr As-p Pro Phe Tyr Asn 

1?5 200 205 

Pro Tyr Gly He Gly Met Gly Trp Gly T-yr Pro Tyr Gly Trp Gly Scr 

210 215 220 

Tyr Tyr Gly Trp Gly Gly Tyr Pro Gly Val He His His Tyr His His 
230 23S 240 

Tyr Pro Lys Lys Thr Tyr Ser Asti 01 y Gin Hi^- Ser Gly Ala Tyr Tyr 

245 2S0 25b 

S-r Tyr Gly Arg Pro Asn Arg He Lys Gly Gly Thr S-r Gly Ala Ly? 

260 265 270 

Leu Gly Thr Gly Arg Tyr Asp Arg He Gin Asn San Ser Ser Gin Lyc 

275 280 2e5 

Asn Lys Phe Gly Leu Gin Ser Asn Lys Pro Asn Asn Asn Leu Gin Asn 

290 295 300 

Val Lys Ser Gly Arg Thr Gly Arg Ala Asn Arg Asp Arg Asn He Glu 
305 310 

Thr Val Thr Pro Asn Asn Gly Gin Lys Gin Asn Arg Pro Val Phe Gin 

325 330 335 

Gin Asn Gin Ser Gly Asn Asp Arg Pro Thr Gly Arg Asn He Arg Ser 

340 34b 350 

Glu Arg Gin Gly Glu Asn Asn Asp Arg Thr Phe Ser Thr Pro Ser Arg 

355 360 365 

Ser Asn Ser Asn Gly Gly Phe Ser Thr Pro Ser Arg Ser Ser Ser Gl^- 

370 375 3B0 

Ser Met Ser Gly Gly Gly Gly Arg Ser Gly Arg Gly Arg Asn 
385 390 395 

(2) IIJFORI-IATION FOR SEQ ID NO: 507 

(i) SECJEMCE CHARACTERISTTCS: 

(A) LEtJGTH: 581 amino acids 
{B) 'l-YPE: amino acid 
(D) TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

( i i i I Hi* POTHET I CAL : YES 

(Vl) ORIGIIfAL SOURCE: 

{A) ORGAJ4ISM: PorpJiyroraonas ginglvalts 

(Ix) FEATURE: 

{A> NAI'lE/KEY: mlsc_feature 
{B» LOCATIOIJ 1 . . ,581 

(xl ) SEQUEMCE DESCRIPTIOH: SEQ ID (10:507 



tlet 


He 


Arg 


Lys 


Leu 


lie 


Leu 


Leu 


Lc-u 


Ala 


Leu 


Mc-t 


Pro 


Val Ala 


Ser 


1 








5 










10 








15 




Val 


Ala 


Phe 


Ala 


Val 


Pro 


Thr 


Asp 


Ser 


Thr 


Glu 


Ser 


Lys 


Asp Asn 


Arg 


He 






20 










25 










30 


Leu 


Thr 


Ser 


Het 


Gin 


Ser 


Ser 


Ser 


Leu 


Asn 


Arg 


Asp Asp Ala 


Pro 


Asp 




35 










40 
















Lys 


Trp 


Gin 


Pro 


Met 


His 


Ala 


Asn 


Phe 


Ser 


He 


Gin 


Ser Asp 


Het 




50 










55 
















Leu 


Leu 


Ser 


Thr 


Ala 


Gin 


Lys 


Ser 


Lys 


Asn 


Thr 


Trp 


Phe 


Gly Asn 


Ser 


65 


Tie 








70 










75 




80 


Tyr 


Het 


Gly 


He 


He 


Lys 


Asn 


Asn 


Tyr 


Leu 


Giu 


Phe 


Gly Ala 


Arg 


Phe 


Glu 






65 










90 








95 


Asp 


Leu 
100 


Tyr 


Lys 


Pro 


Leu 


Pro 

105 


Gly 


His 


Glu 


Pro 


Glu Met 
110 


Gly 
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Arg Gly Val Pro His Met Tyr Val Ly^ Gly Ser Tyr His Trp Ala Glu 

115 120 125 

Leu Thr Met Gly Asp Phe Tyr Asp Gin Phe Gly Ser Gly Met Val Phe 

130 135 140 

Arg Thr Tyr Glu Glu Arg Asn Leu Gly Tlo Asp Asn Ala Val Arg Gly 
14S 150 15S 160 

Gly Arg lis Val Leu Thr Pro Phe Asp Gly Val Arg Val Lys Giy lie 

165 170 175 

Ala Gly Gin Gin Arg JKsn Tyr Pho Anp Arg Thr Gly T.ys Vnl Phn Pisn 

IBO leS 190 

Ser Gly Arg Gly Tyr Leu Leu Giy Ser Asp Leu Glu Leu Asn Val Glu 

195 200 205 

Arg Trp Ser Ser Ala Met Arg Asp A^in Asp Tyr His Leu A)a Tlo Gly 

210 215 220 

Gly Ser Phe Vai Ser Lys His Glu Ala Asp Glu Asp lie Phe Val Gly 
225 230 23t 24Q 

Val Gly Glu Asp Arg Lys Arq Leu Asn Leu Pro Leu Asn Val Pro lie 

245 250 255 

Met Gly Leu Arg Thr Asn PIjs Gin Lys Gly Gly Leu Ala Leu Tyr Ala 

260 26b 27 0 

Glu Tyr Gly Tyr Ly? Tyr Asn Asp Pro Ser Ala Asp Asn Aso Tyr lie 

275 280 205 

Tyr His Asp Gly Gin Ala Ala Leu Leu Ser Ala Ser Tyr Ser Lys Lys 

290 2S5 300 

Gly Met Ser He Leu Leu Gin Ala Lys Arg Cys Glu Asn Phe Ala Phe 
305 310 ■ 315 320 

Arg Ser Lys Arg Ser Ala Gin Leu Thr Pro Leu Het He Aen Tyr Het 

325 330 335 

Pro Ala Phe Thr Gin Ala His Thr Tyr Thr Leu Ala Ala He Tyr Pro 

3^0 345 350 

Tyr Ala Thr Gin Pro Gin Gly Glu Trp Ala Phe Gin Gly Glu Leu Arg 

355 360 365 

Tyr Asn Phft Ala Arg Arg Thr Ala Leu Gly Gly Arg Tyr Gly Thr Gly 

370 375 380 

Leu Arg He Asn Vai Ser His Val Arg Gly Leu Asp Lys Lys Met Leu 
385 390 395 400 

Lys Glu Asn Pro Asp Glu Leu Tie Gly Thr A.-'p Gly Tyr Thr Val Ser 

405 410 415 

Phe Phe Gly Het Gly Asp Leu Tyr Tyr Ser Asp He Asp Val Glu He 

420 425 430 

Thr Lys Lys Val Ser Pro Gly Phe Asn Phe Thr Leu Thr Tyr Leu Asn 

435 440 445 

Gin He Tyr Asn Asn Lys Val Leu His Gly Ala Ala Gly Glu Lys Pro 

450 455 460 

Glu Lys He Tyr Ala Asn He Phe Vai Tyr Asp Gly Lys T:,'r Lys Leu 
46S 470 475 480 

S-^r Asn Lyc Vol AIo Leu Arg Thr Glu L^u Gin Tyr Leu Mis Thr Lys 

485 4 90 4 95 

Gin Asp Gin Gly Asp Trp He Tyr Gly list Ala Glu LtiU Set He Lftu 

500 505 510 

Pro Ser Leu Het Lsu Ser Leu Ser Glu Gin Tyr Asn He Gly Glu Thr 

515 520 525 

Lys Lys His Tyr Val M-t Gly Ser Val Thr Tyr Thr His Gly Ala His 

530 535 S4 0 

Arg Val Ala Phe Ser Ala Gly I.ys Thr Arg Ala Gly Met Asn Cys Ser 
545 550 555 560 

Gly Gly Val Cys Arg Val Val Pro Glu Thr Gin Gly Phe Tyr Leu Ser 

5S5 570 575 

T^r Ser Thr Asn Leu 
580 

(;:) INFOWIATIOII FOR SEQ ID HO: 500 

ii) SEQUEIICE CHARACTERISTICS: 

{A) LEIIGTH: 239 amino acids 
(B) TiTPE: amino acid 
(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(Hi) HYPOTHETTCAF,: YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAllISH: Porphyromonas gingival Is 
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( 1 X ^ FEATURE : 

(A) HAt'lE/KEY: mlsc feature 

(B) L0CATI014 1 . . .239 

(Xl) SEQUENCE DESCRIPTION: SEQ ID MO:508 

Met Arg Ser Leu Pli« Leu Ser Ala Leu Arg Ser Ser Scr Leu His Gly 

^5 10 15 

Ssr Glu Arg Arg Ser Arg He Ser Ser Ser Val Val Met Ser He Arg 

20 25 30 

Gin Lys He Arg Leu Phe His Leu Ser Val Cvr Ala Gin Thr His Asp 

35 40 45 

His Leu He Clu He His Leu Val Cys He Glu Phe Gly Ala He Asp 

50 55 60 

Thr Asp Glu Phe Ary Leu Ser Ser His Ala Tyr Thr Thr Ser Pro Thr 
«^ 70 75 80 

His Thr Gly Ala H- His Ills Asn Cys lie Glu Arg Ser Tyr Gly Arg 

85 Oto 05 

Tyr Leu Val Thr Phe Gly Gin Glu Arg Asn Glu Leu His His His S^r 

iOO 105 HO 

Arg Pro Asp Arg Asn Ala Glu Val Tyr Arg Phe Pro Phe Asp Asn Ala 

lis 120 125 

Fhe His Ser He Arg Tyr Glu Ala Phe Arg Pro He Arg Pro He Val 

IJtJ 135 j^^^ 

cyr His Asp Asp His Phe He Ala He Gly Ser His Leu Phe Phe Lvs 
' 150 155 160 

Asp Asn Gin He Phe Ser Ser Gly Ser Gin Tyr Asp Asn Tyr Thr Val 

165 170 175 

Aln Cys Phe Val Glu Ser Leu His Asp Arg Glu Gin Arg Ser His Thr 

180 185 190 

His Thr Ala Ser Gly Thr Acn His Cys Al« Asp Lou Phe Asp Met Arq 

195 200 205 

Thr I.eu S«r Gin Arg Thr Tyr His He Arg Asp lie Val Ala Asp Phe 

210 215 220 

Glu Phe Gly Gin Phe Leu Gly Arg Phe Ala Mis Gly Leu Asn His 
225 230 235 

(2) III FORMAT lOCJ FOR SEQ ID MO: 509 

ii) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 211 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: Unear 

(ii) MOLECULE TYPE: protein 

(ill) HYFOTHETTCAL: YES 

(Vi) ORIGINAL SOURCE: 

(A) ORGAJUSH: Po rphyromonas gingival is 

(ix) FEATURE: 

(A) HAI'lE/KEY: misc_f eaLure 

(B) LOCATION 1 . . .211 

(xH SEQUENCE DESCRIPTION: SEQ ID NO: 509 

net Scr He Arg Gin Lys He Arg Leu Phe His Leu Ser Val Cys Ala 

^5 10 15 

Gin Thr His Asp His Leu He Glu He His Leu Val Cys He Glu Ph- 

20 25 30 

Gly Ala He Asp Thr Asp Glu Phe Arg Leu Ser Ser His Ala Tyr Thr 

3S 40 45 

Thr Ser Pro Thr His Thr Gly Ala He Mis His Asn Cys He Glu Arg 

55 GO 
Ser Tyr Gly Arg Tyr Leu Val Thr Phe Gly Gin Glu Arg Asn Glu Leu 

^0 75 80 

His His His Ser Arg Pro Asp Arg Asn Ala Glu Val Tyr Arg Phe Pro 

85 90 95 

Ph* Asp Asn Ala Phe His Ser He Arg Tyr Glu AVff Phs Arg Pro Il» 

100 105 110 

Arg Pro He VaI Cys His Asp Asp His Phe He Ala He Gly Ser His 

115 120 125 

Leu Phe Phe Lys Asp Asn Gin He Pho Ser Ser Gly Ser Gin Tyr Asp 
1-30 135 j^^n 
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Tyr 


Thr 


Val 


Ala 




Ph- 


Val 


Glu 


Ser 


Leu 


Hi9 


Asp Arg Glu 


Gin 


115 










ISO 










155 






160 


Arg 


Sec 


Hi5 


Thr 


His 


Thr 


Ala 


Ser 


Giy 


Thr 


Asn 


His 


Cys Ala Asp 


Leu 










165 










170 






175 




Phe 


As-p 


Met 


Arg 


Thr 


Leu 


Ser 


Gin 


Ar<5 


Thr 


Tyr 


His 


He Arg Asp 


He 








IflO 










185 








190 




Val 


Ala 


Asp 


Phe 


Glu 


Phe 


Gly 


Gin 


Phe 


Leu 


Gly 


Arg 


Phe Ala His 


Gly 



1?5 200 205 



Leu As-n His 
210 

(2) INFORIIATIOII FOR SEQ ID NO: 510 

(!) SEQUEfrCE CHARACTERTSTTCS: 

lAI LEIIGTH: ^ei amino acidr 
(31 T-^PE: amino acid 
{D> TOPOLOGY: linear 

(ii) tlOLF-CULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

{vil ORIGIJIAL SOURCE: 

(Al ORGAHlSItr Porphyromonas gingival is 

t i;: ) FEATURE : 

(A) HAIIE/KEY: mlsc feature 

(B) LOCATia; 1 . . .781 



{Xll SEQLIF.HCK DF.SCRT PTIOM : SEQ ID NO: 510 



Met 


Asp 


Arg 


Pro 


Lys 


Pro 


Ser 


Tyr 


He 


Val 


Arg 


He 


Ala 


Ala 


He 


Leu 


1 








5 










10 










15 






Leu 


Phe 


Val 


Gly 


Arg 


Pro 


Leu 


Phe 


Ala 


Gin 


Ser 


Tyr 


Val 


Asp 


Tyr 








20 










25 










30 






Val 


Asp 


Pro 


Leu 


He 


Gi y 


Thr 


Leu 


Ser 


Ser 


Phe 


Glu 


Leu 


Ser 


Ala 


Gly 






35 




















45 






Ajin 


Thr 


Tyr 


Pro 


Val 


He 


Gly 


Leu 


Pro 


Trp 


Gly 


Met 


Asn 


Ser 


Trp 


Thr 




50 










55 










60 










Pro 


Met 


Thr 


Gly 


Val 


Pro 


Gly 


Aifp 


Gly 


Trp 


Gin 


Tyr 


Thr Tyr 


Ser 


Ala 


65 










70 










75 










eo 


His 


Lyc 


He 


Arg 


Gly 


Phe 


Lys 


Gin 


Thr 


His 


Gin 


Pro 


Ser 


Pro 


Trp 


He 










85 










90 










95 




Asn 


Asp 


Tyr 


Gly 


Gin 


Phe 


Ser 


Leu 


Leu 


Pro 


Lou 


Thr 


Ala 


Pro 


Gin 


Lys 








100 










105 










110 




Pro 


Ser 


Ser 


Asn 


Asp 


Ser 


He 


Ala 


Leu 


Thr 


Lys 


Trp 


Cys 


Lys 


Gin 


Leu 






115 










120 










125 








Phe 


Ser 


Asp 


Glu 


Gin 


Thr 


Ser 


Trp 


Pho 


Ser 


His 


I.ys 


Al a 


Glu 


Thr 


Ala 




130 










i:;6 










lie 










Thr 


Pro 


Tyr 


Tyc 


Tyr 


Ser 


Val 


Tyr 


Leu 


Ala 


Asp 


Tyr 


Asp 


Thr 


Arg 


Val 


145 










ISO 










155 










160 


Glu 


Met 


Ala 


Pro 


Thr 


Glu 


Arg 


Ala 


Als 


He 


Phe 


Arg 


73s 


Arg 


Tyr 


Ser 










165 










170 






17S 




Gly 


Asn 


Thr 


Glu 


Ser 


Gly 


Ser 


Gly 


Arg 


Trp 


Leu 


Arg 


Leu 


Asp Ala 


Phe 








180 










185 










190 






Thr 


Gly 


Gly 


Ser 


Glu 


He 


Ser 


He 


Val 


Asp 


Pro 


His 


Thr 


Val 


Val 


Gly 






195 










200 










205 








He 


Ser 


Arg 


Lyj 


Asn 


Ser 


Gly 


Gly 


Val 


Pro 


Ala 


Asn 


Phe 


Ala 


Cys 


Tyr 




210 










215 










220 






Phe 


He 


Leu 


Gin 


Ser 


Asp 


Thr 


Pro 


M=t 


Ala 


Asp 


Val 




Leu 


Glu 


Thr 


225 










230 










235 










240 


A^p 


Thr 


Gly 


Ly- 


Ser 


Asp 


Glu 


Gly 


Thr 


Arg 


Ala 


Trp 


Ala 


Ala 


Cys 


Arg 










245 










250 










255 




Phe 


Asp 


Ser 


Gin 


Glu 


Vol 


Thr 


Val 


Arg 


Val 


Ala 


Ser 


Ser 


Phe 


He 


Ser 








260 










265 










270 






Val 


Glu 


Gin 


Ala 


Glu 


Arg 


Asn 


Leu 


Alu 


Glu 


Val 


Lys 


Giy 


GLn 


Ser 


Phe 






275 










280 








285 








J\F.p 


Arg 


He 


Arg 


Leu 


Ala 


Gly 


Arg 


Glu 


Ala 


Trp 


Asn 


Lys 


Vai 


Leu 


Gly 














205 










300 








Arg 


He 


His 


Val 


Glu 


Gly 


Gly 


Thr 


Lys 


T\sp 


Glu 


Arg 


Thr 


Thr 


Phe 


Tyr 


305 










310 










315 










320 


Ser 


Ala 


T.ftti 


Tyr 


Arg 


cys 


Leu 


Leu 


Phe 


Pro 


Arg 


Arg 


Phe 


Tyr 


Glu 


Glu 










325 










330 










335 




Asp 


Ala 


Ser 


Gly 


Asn 


Ph = 


Val 


His 


Tyr 


Ser 


Pro 


Tyr 


Asn 


Gly Glu 


Val 



340 345 350 
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Leu Pro Gly Tyr Leu Tyr Thr A/rp Thr Gl y Phe Trp A^p Thr Phe Arg 

355 360 365 

Ala Leu Phe Pro Leu Leu Asn Leu Leu Tyr Pro Asp Giu Asn lie Lys 

370 3-75 380 

Tl<2 Gin Glu Gly Leu Leu A/?n Val Tyr Arg Clu Ser Gly Phe Phe Pro 
385 390 395 400 

Giu Trp Aia Ser Pro Gly His Arg Asp Cys Met He Gly Asn Asn Ser 

405 410 415 

Ala Ser Val Leu Ala Arp Ala Tyr ieu Lys Cly Val Aro Val GUi Asp 

420 425 ' 430 

Thr Arg Thr Leu Met Asn Gly Leu Leu His Ala Thr Lys Ala Val His 

435 440 445 

Pro Ly.-; lie Ser Ser Thr Gly Arg Ly* Gly Trp Glu Trp Tyr Asn Ser 

450 455 460 

Leu Gly Tyr Val Pro Ala Asp Ala Gly He Arp Glu Ser Ala Ala Arg 
465 4T0 475 4G0 

Thr I.ou Glu Tyr Al « Tyr Asn Asp Trp Cys He Leu Arg Leu Gly Arg 

485 490 495 

Thr Leu Gly Trp Asp Arg Al- Ala Leu Asp Thr Leu Ala His Arg Ser 

500 505 510 

Met Asn Tyr Arg His I.eu Phe Asp Pro Glu Thr Ly;? Leu Het Arg Gly 

515 520 525 

Arg Asn Gin Asp Gly Ser Phe Arg Thr Pre* Phe Ser Pro Phe Lys Trp 

530 535 540 

Gly Asp V5I Phe ThT Glu Gly Asn Ala Trp His Tyr Thr Trp Ser Val 
545 550 555 560 

Phe His Asp Val Gin Gly Leu He Asp Leu llet Gly Gly Asp Arg Pro 

5SS 570 575 

Phe Val Ser Met I.eu Asp Ssr Val Phe Asn Thr Pro Pro Met Phe Asp 

580 585 590 

Glu Ser 'i'yr Tyr Gly Phe Val He His Glu He Arg Glu Met Gin He 

595 600 605 

A) a Asp Met Gly Asn Tyr Ala His Gly Asn Gin Pro He Gin His Met 

610 615 620 

He Tyr Leu Tyr Asn His Alo Gly His Pro Trp Lys Ala Gin Glu Arg 
625 630 635 640 

Leu Arg Glu Val Met Gly Arg Leu Tyr Arg Pro Thr Pro Asp Gly Tyr 

64 5 650 6SS 

Cys Gly Asp Glu Asp Asn Gly Gin Thr Ser Ala Trp Tyr Val Phe Ser 

660 665 670 

Ala Leu Gly Phe Tyr Pro Val Thr Pro Ala Thr Asp Gin Tyr Val Leu 

675 680 685 

Gly Ser Pro He Phe Ser Lys Val He Leu Ser Phe Pro Asp Gly His 

690 695 700 

Lys Thr Val Leu His Ala Pro Ala Asn Ser Ala Asp Thr Pro Tyr He 
''OS 710 715 -720 

Arg Ser He Ser Val Glu Gly Lys Glu Trp Ser Cys Asn Tyr Leu Thr 

725 730 735 

His Glu Gin Leu Arg Ser Ser Ala Ser He Gin Trp Met net Asp Thr 

74 0 74 5 750 

Lys Pro Asn Tyr Asn Arg Gly Met Lys Glu Ssr Asp Arg Pro Tyr Ser 

755 760 765 

Phe Ser Thr Glu Gin Gin Arg Arg Ala Asn His Ser Asn 
770 775 780 

(2) IHfOFa-IATION FOR SEQ ID HO: 511 

(i) SEQUEHCE CHARACTERISTICS: 

(A) LEMGTH: 271 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) riOLECULF, TYPE: protein 

(iii) HYPOTHETICAL: YES 

{VI ) ORIGINAL SOURCE: 

(A) ORGAt 1 1 SH: Porphyromonas gingival is 

iix) FEATURE: 

(A) HAIIE/KEY: rolsc_£ea tare 
(B| LOCATIOM 1. . .271 

t-^il SEQUENCE DESCRIPTION: SEQ ID NO: 511 
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Het Lys Ser Het Arg Ser Val i.eii I.ou T.eij I,gu Phe Pro I.cu Ser 

1 S 10 15 ~ 

Leu Tie Thr Ala Leu Gly Cys Ser Asn Asn Lys Ala Ala Glu Ser Lys 

20 25 30 

Ser Val Ser Phe Asp Ser Ala Tyr Leu Glu Arg Tyr lie Pro Leu Arg 

36 40 45 

Ala Asp lie Asp Thr ?ro Ser Leu His Val Met lie Ser Tyr Val Tyr 

SO 55 60 

Pro Ser Gly Asp Aap Hct Leu Thr Glu lie Phe A^n Gly Leu Leu Phe 
65 70 75 80 

Gly Asp Ser Leu Met Asp Ser Ser Ser Pro Glu Atn Ala Het Glu Gly 

85 90 95 

Tyr Ala Gin Met Leu G3 y Glu A^p Tyr Arg Ser Asn Asn Ala Glu Ala 

100 ICS lie 

Arn Leu Gin Gly Leu Pro Ser Asp Leu Leu Asp T^-r lie Tyr Lys Gin 

lis 120 125 

Glu Aen Thr He Ala Tyr Cys Asp Thr Gly Lou Tic Ser Thr Arg Tin 

130 135 1^0 

Asn Thr Tyr Thr Tyr Glu Gly Gly Ala His Thr Glu Asn Thr Val Arg 
145 150 155 160 

Phe Ala Asn He T.ou Arg Thr Thr Gly Lys Val Leu Glu Glu Arg Asp 

165 170 175 

He Phe Lys He Asp Tyr Ala Glu Arg Leu Ssr Ala Leu He He Gly 

180 105 190 

Gin Leu Val His Asp Phe Gly T.yr: Thr Thr Pro Ala Glti Leu Asp Ala 

195 200 205 

He Gly Phe Phe Asn Ala Glu Glu He GLn Pro Asn Gly Asn Phe Het 

210 215 220 

He Asp Asp Lys Gly Leu Thr Tyr Cys Phe Asn Glu Tyr Gin He Ala 
225 230 235 240 

Ala Tyr Ala Arg Gly Ala Val Tyr Val Arg Leu Gly Tyr Asp Val Leu 

245 250 255 

Ala Pro Leu Leu Arg Asp Asp Ser Pro Leu Lys Arq Tyr Leu Pro 
260 265 270 

(2) INFORi-lATIW POR SEQ ID HO: 512 

(i) SEQUENCE CHARACTERISTICS: 

(A) LEHGTH: 270 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(ill) HYPOTHF.TTCAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGAHISH: Porphyromonas gingival! s 

Ux) FEATURE: 

(A) NAIIE/KEY: misc_feature 

(B) LOCATION 1...270 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 512 



Met 


Lys 


Ser 


Het 


Arg 


Ser 


Val 


Leu 


Leu 


Leu 


Leu 


Phe 


Pro 


Leu 


Ser 


Leu 










5 










10 










15 




He 


Thr 


Ala 


Leu 


Gly 


Cys 


Ser 


Asn 


Asn 


Lys 


Ala 


Ala 


Glu 


Ser 


Lys 


Ser 








20 










25 










30 




Val 


Ser 


Phe 


Asp 


Ser 


Ala 


Tyr 


Leu 


Glu 


Arg 


Tyr 


Tic 


Pro 


Leu 


Arg 


Ala 






35 










40 










45 






Asp 


He 


Asp 


Thr 


Pro 


Ser 


Leu 


His 


Val 


Het 


He 


Ser 


Tyr 


Val 


Tyr 


Pro 




i>0 










55 










60 






Uer 


Gly 


Asp Asp 


Het 


Leu 


Thr 


Glu 


He 


Phe 


Asn 


Gly 


Leu 


Leu 


Pho 


Gly 


65 










70 










75 








80 


Asp 


Ser 


Leu 


Met 


Asp 


Ser 


Ser 


Ser 


Pro 


Glu 


Asn 


Ala 


Met 


Glu 


Gly Tvr 










85 










90 










95 




Ala 


Gin 


Met 


Leu 


Gly Glu 


Asp 


Tyr 


Arg 


Ser 


Asn 


Asn 


Ala 


Glu 


Ala 


Asn 








100 










105 










110 






Leu 


Gin 


Gly 


Leu 


Pro 


Ser 


Asp 


Leu 


Leu 


Asp 


Tyr 


He 


Tyr 


Lyt- 


Gin 


Glu 


Asn 




115 










120 










125 






Thr 


He 


Ala 


Tyr 


Cys 


Asp 


Thr 


Gly 


Leu 


He 


Ser 


Thr 


Arg 


He 


Asn 




130 










135 










140 








Thr 


Tyr 


Thr 


Tyr 


Glu 


Gly 


Gly 


Ala 


His 


Th r 


Gl u 


Asn 


Thr 


Val 


Arg 


Phe 


145 










150 










155 








160 
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Ala A^n He L^u Arg Thr Thr Gly Lys Val Leu Glu Glu Arg Asp II 



170 



Phc I.y. no A.p Tyr Ala Glu Arg Leu Ser Ala Leu He He Gl'y Gin 



65 



70 



75 



loci 1 an 

Leu Val Hi, Asp Phe Gly Lye The Thr Pro Ala Glu Leu Asp Al= IK. 

200 ZO*- 
Gly Phe Phe Arn Ala Glu Glu lie Gin Pro A-^n Gly Ph, Met He 

A.p A.P Lys Gly Leu Thr Tyr Cys Phe Asr. Glu Tyr Gin He Ala Ala 

Tyr Ala Arg Gly Ala Val Tyr Val Arg Leu Gly Tyr A.p Val Leu M 

250 255 



Pro Leu Leu Arg Asp Arp Ser Pro Leu Lyi Arg Tyr Leu Pro 
15 270 
(2) IMfORJ^ATIOJI FOR SEQ ID NO:513 



(J) SF.QtJKriCE CHARACTERISTICS: 

(A) LENGTH: 267 amino acids 
(D) TYPE: amino acid 
(LM TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(:li) HYPOTHETICAL: YES 



[Vl) ORIGTNAT,. SOCJRCE: 

(A) ORGAMISM: Porphyromona<3 gingival is 
30 lix) FEATURE: 

(A) MAIIE/KEY: misc featur- 

(B) LOCATION 1. . .267 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:513 
Het Arg Ser Val Leu Leu Leu Leu Phe Pro Leu Ser Leu He Thr Al ^ 

^ 5 10 .rr ' 



Leu Gly Ci-s Ser Asn A^n Lys Ala Ala Glu Ser Lys Ser Val Ser Phe 
40 Asp ser Ala Tyr Leu Glu Arg Tyr lie Pro Leu Arg Ala A^p He A,p 



Thr Pro ser Leu His Val Met 11, Ser Tyr Val Tyr Pro Ser Gly Asp 
Asp Met Leu Thr Glu lie Phe Arn Gly Leu ,,=u IL Gly Asp Ser Leu 

Met A..p Ser Ser Ser Pro Glu Asn Ala Het Glu Gly Tyr Ala Gin 11^ 

03 90 oii 

Leu Gly Glu A^p Tyr Arg Ser A^n Asn Ala Glu Ala Asn Leu Gin Gly 
50 Leu Pro Ser Asp Leu Leu Asp Tyr "e Tyr Lys Gin Glu l^n Thr He 



Ala Tyr Cys A.^p Thr Gly Leu He' Ser Thr Ar, He Al' Thr Tyr Thr 



135 



55 lis """^ III ''''' rtl ^'^^ 

Glu 
Leu 

60 Asp Phe Gly Ly.^ Thr Thr Pro Ala III t.o., n., _ 



A.p Tyr Ala Glu Arg Leu Ser Ala Leu He He Gly Gin Leu vl\ 

185 

Sly I.ys Thr Thr Pro Ala Glu Leu Asp Ala lie Gly Phe Phe 



Asn Ala Glu Glu He Gin Pro Asn Gly Asn Phe Met Tie Asp Asp Lys 



205 

> Asn Gly Asn Phe Het 

Gly Leu Thr Tyr Cys Phe III Glu Tyr Gin He Ma Ala T,^r Ala Arg 
Gly Ala Val Tyr Val Arg L«u Gly Tyr A.p III Leu Ala Pro Leu ^eu 

Arg Asp Asp Ser Pro Leu Lys Arg Tyr Leu Pro 

260 265 

(2) INFORMATION FOR SEO ID NO: 51 4 

(1) SEQUEHCE CHARACTERISTICS: 

(A) LENGTH: 259 amino acids 
<e) TYPE: amino acid 
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(D) TOPOLOGY: linear 

(ii) IIOLECULE TYPE: protein 

Uli) HYPOTHETTCAL; YES 

(vi) ORIGINAL SOURCE: 

[A] ORGAMISM: Porphyroinonas gingival is 

{ix) FEATURE: 

(A) NAIIE/KEY: misc_f cature 

(B) LX)CATIOM 1...259 

(xi) SEQUENCE DESCRIPTION: SEQ ID HO: 514 

Met Lye Lys Thr Thr Leu The (ily Ser He Cys Ala Leu Leu Leu Pht 

15 10 15 

Leu Gly Leu Ser Ala Asn Ala Gin Scr Lys Leu Lyr He Lys S-r He 

20 25 30 

Glu Ala Ala Thr Thr Phe Ser Ser Ala Thr Ala Gly Asn Gly Phe Gly 

35 4 0 4S 

Gly Asn He Phs Gly Met Ai^p Met Ser He Arg Met Arg Val His His 

50 55 60 

Ser He Leu Pro Glu Gly Leu Aep Phe Ser Val Gly He Hts Glu Arg 
65 70 75 30 

Arg Ala His Trp Glu Glu Ala Gly Ser Pro Lys- Leu Met Tyr Thr Asn 

85 90 95 

Val Pro Ser He He Gly He Val Glu Lys Val He Val Pho Glu Asp 

100 105 110 

Ala Glu Asp Phs Phe Asp Lys Lys Ala Leu Gly Arg Phe Leu He Ser 

115 120 125 

Leu Gly He Ser Tyr Thx Lys His Leu Gly Ala Tyr Trp Gly Trp Thr 

130 135 140 

Asn Aep Ala His He Leu Phe Ser Pro He Pro Lys Ser Lys Val His 
145 150 155 160 

Tyr Asp Thr Tyr Thr Arg Ala Gly Ser Asp Leu Val Leu Gin Ser Glu 

165 no 175 

Asp Val Ala Thr Val Ser Asn Gly the Ser Pro Gly He Gly Leu Lys 

180 185 190 

Ser Ser He Trp Trp Lys Met Pro He Li's Ser Lys Tyr Asp Phe Arg 

155 200 205 

Leu Gly Phe Ser Leu Gly Tyr Glu Tyr Leu Asn Leu Leu Tyr Pro Tyr 

210 215 220 

Arg Asn Phe Lys Leu Asp Gly Asn Lys Pro Leu Ser Ala Leu Ser Pro 
225 230 235 240 

Arg Met Asn His Tic Gly His Val Gly Phe Asn Phe Thr Val Gly Leu 
245 250 255 

Trp Thr Asn 

(2) INFOW'IATION FOR SEQ ID NO: 515 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1266 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

tiii) HYPOTHETICAL: YES 

(vl) ORIGINAL SOURCE: 

(A) ORGAMISH: Porphyromonas yingivalis 

<ix) FEATURE: 

(A) UAilE/KEY: misc_f ea tur e 
(P) LOCATIOH 1...1266 

(xi) SEQUENCE DESCRIPTTaJ: SEQ ID NO: 515 

Met Gly Lys Tyr Lys Arg Ala Lys Tyr Arg Tyr Trp Leu Phe Pro Phe 

^5 10 15 

Cys Set Asp Tyr Tyr Thr Phe Glu Gly Val Thr Phe Leu Cys Ala S-r 

20 25 30 

Asp Asp Met Thr Thr Lys Lys Pro Gin AIa He t.cu A.-^p L-u Glu Lys 
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Ala Tyr A.n lie Glu He Pro tip Leu Ser Ser Gin gL Gly He Sec 
55 



T|p ser Val Asn Arg Tyr Phe Lys Gin Acp ssr S^r Gly Ala Val Val 

Glu Leu cy.- Leu Arq Glu Cy. Gin He Glu Ser Met Thr Trp Leu 

Asp Phe Pro Ala Leu Lys Ly.= Lou A^p Leu Ser Tyr Asr, Gin lie Ser 

10 Lv. Leu Glu Gly Leu Glu Arg Leu ^hr Ser Leu Thr Ly. lIu Arg Leu 

120 125 
Arg Ser Arn Gin He Arg Lys Leu Glu Gly Leu A.p ser L=u Thr Ser 



130 



15 I's '*^'" ^" ISO ^" ^"^ ill ^^--^ i-^" 

Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu Arp Asn sfn 

He ser Ly, Leu Glu Gly Leu Glu Arg 11°, Thr Ser Leu Ala ^hr Leu 

20 Glu Leu Ser Gly As„ Gl„ He Arg Lyr Leu Glu GJ y I.eu ciu Arg Leu 

200 ->os 
Thr Ser Leu Ala Thr Leu Glu Leu Ser Gly Aan Gin He Arg Lys Leu 

25 zz" ^"^ f"^^ Iri'l Arg Leu Arg Ser 

A^n Uln He Ser Ly^ Leu Giu Gly Leu Glu Arg Leu Thr Ser Lou Ma 
Thr Leu Glu Leu Scr Gly A.=n Gin He Arg Lys Leu Glu Gly l" Glu 

30 Arg Leu Thr Ser Leu Ala Thr Leu sfu Leu Scr Giy Arn Cln He Ser 

280 285 
Ly.. Leu Glu Gly Leu Glu Arg Leu Ser Ser Leu Thr Lys Leu Arg Leu 

35 Tot '■^^ Thr Scr 

Leu Thr Lys Leu Ser Leu Ser Asp Asn Gin ill ser Lys Leu Glu lit- 
Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu A.^p Asn Gin 

40 He Arg Lys Leu Glu Gly Leu Glu III Leu Thr Ser Leu Thr Lys Leu 

360 

Arg Leu Arg Ser A,n Gin He Ser Lys Leu Glu Gly Leu A.p Ser Leu 



75 



45 3^1 ■"'^ "-^^ ^" '^•P Lys Leu 



390 



Glu Cly Leu Glu Arg i:;u Th. Ser Leu Ala Gl" Leu Tyr Leu Leu C 



Asn Gin He Arg i'yl Lou Glu Cly Leu Asp GLy Leu Ala Ser Leu Thr 
50 Arg Leu Ser L^u Arg Arg Asn Gin 111 Ser Lys Leu Glu Gly Leu Asp 



440 



Arg Leu L,-e Val Leu Arg Lys 11^ Asp Val ser Gly til Asp He Gin 



455 



55 Tel ]]l ^-1" Gin Thr Leu 

Glu I.ys Leu Arg He His Asp Asn Pro Phe vlf Ala Ser Ser Gly Leu 

4 QQ JUL 

He Leu Ser Pro Tyr Asp Asn His Leu Glu He Lys Ala Leu Leu 

60 Glu Lys Glu Lys Glu Lys Gin Lys Lyf Thr Ser Val Glu ^yr Hl= Pro 

520 52.; 
Phe Cys Lys Val Met Leu Le. Cly Asn His .er Ser Gly Ly. Thr Thr 

Tyr 
His 

56ii 570 



65 5^5 ""'^ ""^^ ""^^ '^^^'^ ^^'^ A^n Thr HI 



Val Leu Ser He His Arg 



555 



560 



r Arn A^n Pro Asn Ala He Fh- Tyr Asp 

Asp Tyr Ty 

580 



Phe Cly Gly Gin Asp Tyr Tyr His Gly He Tyr Gin Ala Phe Phe Thr 
70 Thr Gin ser Leu Tyr Leu Leu Phe Trp Asp Ala Lys Lys "p Arg Asn 



595 600 60' 

Phe .-al ser Val Asp Asp Lys Giu Tyr Gin Thr Leu A^n Phe Asn Arg 



Pro Tyr Trp Leu Gly Gin lie Ala Tyr Ala Cv= A^n Arg Cys Met Ser 
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Val Gly Cly A^n Pro Asp Gly Lys Asp Thr Pro Gin Thr Thr Asp Asp 

^•^^ €50 
Thr lie lie lie Gin Thr His Ala Asp Glu Thr Gly Ala Lys Gin Gin 
c 665 S70 

D Thr Leu Gly O/s Ala Ala Glu Asn Gly Val Lnu Glu Glu 11" Tvr Val 

690 685 
Ser L^u Glu Pro Lys Ala Asn Ser Ala Val His Ala Leu Asn Tyr Leu 
695 700 

10 '^^^ ^""^ ''''^ ^'^^ ''''' '^^^ ^^'^^ Scr Tl^ Gin 

'"-^ ^10 715 720 

II- Thr Glu Lys Asp Lys Gly Leu Tyr Glu Ala Leu Pro Thr II- Ala 

725 730 735 

Gly Asp Asn Lye His He Pro Tlo Ser Lf>u Glu A3,t Lou Ala Ala Gin 



■'''O 745 ' ' 750 

Leu Asn Lys Gly Arg Ala Glu Asn Asp Leu Tyr Thr lie Glu Tyr L-u 

760 765 

Gin Thr Glu Leu Asn Gin Leu Ser Leu Arq Gly Glu Val Leu T^'r Tyr 

20 7ft1 ^^'^ i*!" ^^"^ Ala Ala 

^'^ 7S5 ooo 

Pho V.nl Gin Met. Ho Hip Gly Glu He Leu Gin Lys Asp Asn 11- Asn 

805 810 BjLc^ 

Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys Lys Leu His Asn Leu 
oe 825 

Z5 S^r scr Gly Sc:r He Phe Glu Glu Asp Gly Gin Asn Gly Asn Met: 11- 

840 845 
Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr Glu Asp Lys Asp Cys 
855 860 



30 865 ^^"^ '^^^^ lin ^^"^ ^""^ ^^'^ Tyr 

875 RRO 
Lys Trp Leu Thr Leu Gly Phe Glu Arg Pro Asn Phe Val Leu Lys Phe 

890 895 
Glu Arg Phe He Pro Phe Gly Leu He Asn Gin He He Ala Tyr Tvr 
oc 905 

JD Gly Arg Glu Glu Gly Aia Leu Lys Arg Tyr Trp Arg Asp Gin Val II- 
91-- 920 925 

Phe Thr Ala Gly Arg Glu Met Asp Arg Gin Thr Leu Glu Gin Glu Glu 
?30 935 

40 tis "^^^ ofa ^^"^ '^^^ ^V-r Gin Ho Trp 

955 960 
He Lys Leu A.^p Phe Thr A?p Leu Ala He Ser Val Phe He Lys Glu 

965 970 ty-jc^ 

Gin Arg Lys Thr Ser Ala Lys Asp Met Gin Arg Lys Glu Ala Thr He 
AC 985 990 

4& Leu Ser A«p Met Leu Asp Met Tyr Trp Asn Asn He Pro Pro Arg Glu 
-95 1000 1005 

fiTn*^^^ ^^'^ ^''"^ ^9 Ser Thr He Arg Glu 

■lOlO 1015 1020 



50 1025 



Thr Asn Arg I.y..^ Ly.-, Arg Pro He Gin Asp Leu Ty^^^u Ser Cys Ala 

1030 1 ' 1040 

Gin Ala Asp Lys? Asp Leu Thr Glu Ser His Tyr lie His I.eu Gly Thr 

1045 1050 10S5 

Leu Asp Asp Glu Ser Lys Thr Thr Ala Arg He Ala Ala Tyr Pro Leu 
qc 1065 1070 

Lys Asn Gly Val Hfi A.-p Lys Glu Arg Val Arq Glu Val Ser Thr Arg 
1075 1080 1085 

To5n^^'^ '^^^ '•hr Ala Ly^ Gin 

i^^O 1095 1100 

60 Hd5^"'' ^HtO^'^ ^'^^ ^^"^ ""^^ '^'^'^ "^^^ 

Leu Gin Phe Phe Lys Pro Leu Glu Lys Asn Gly^Gln He Glu J}^ Tyr^^^^ 

^^^^ 1130 1135 

Tyr Asp Lys Leu Thr Ly. Phe Glu Thr Pro He His Pro Glu He Arg 
At: , ^l^^O 1145 iicr, 

65 Lys Arg He Val Glu Ala Asp C^^s Il« He Ala Leu He Ser Gin Arg 

1160 1165 
T,-r Leu Ala Thr Asp Tyr He Leu Asp His Glu Leu Pro Val Phe Arg 
1175 3 180 

1190 lisi^ 1200 



70 TxBs'''" ^""^ ^llsl'^ L^-^ ^^hsf' ""^^ ^""^ 

Glu Asp Asp Glu Phe Leu Arg Glu Lys Tyr Phe^Ma Gin Lys Ala Gin 
1205 1210 1215 

He He Asn Leu Gly Lys Glu Gly Lys Thr He Lys Ala Tyr Asp S-r 
7c; , ^^20 1225 1230 

/i5 He Thr Ala Ser Ala His Arg Asp Glu Asn Trp Vol Ala Vul Val Aro 
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' ^-^^ 1240 1245 

Glu Fhe L,.: Glu Lys He Leu Arg He Thr Lys Gin Glu Val Asn Thr 

if"" 1255 1260 

Asp Glu 

1265 

iZ] IMFX>KIIA'n0H FOR SEQ ID HO: 516 

(i| SEQUEHCE CHARACTERISTICS: 

(A) LEHGTH: 1232 amino aulds 

(B) TYt'E: amino acid 
(D) TOPOLOGY: linear 

(il) HOI.ECULE TYPE: protein 

(ill ) HYPOTHETICAL: YtS 

(vi) ORIGIHAL SOURCE: 

(A) ORGAIIISI-I: Porphyromona s gingival! ^ 

{ 1 >: ) FEATURE : 

{A| HAHE/KEY: mi s cofeature 

(B) LOCATIOri 1...1232 

(>:i) SEQtlEJlCE DESCRIPTIOl/: SEQ ID nO:Slc 
Met Tnr Thr Lys Lys Pro Gin Ala He Lou y-.cp Leu Glu Lyc Ala Tvr 

i ^ 1^ 15 

Asn He Glu He Pro Asp Leu Ser Ser Gin Glu Gly He Ser Trp S-r 

X, . 25 3n 

Val Asn Arg Tyr Phe Lys Gin Asp S=r Scr Gly Ala Val Val Glu Leu 

40 45 
Cy^ Leu Arg Glu Cys Gin He Glu Ser Met Thr Trp Leu He A^p Phe 

55 60 
Pro Ala Leu Lys Lys Leu Aep Leu Ser Tyr Asn Gin He Ser Lys Leu 

■'O 75 80 

Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys Leu Arg Leu Arg Ser 

B5 90 Q5 

Asn Gin He Arg Lys Leu Glu Gly Leu Asp Ser Leu Thr Ser Leu Thr 

iOO 103 110 

Lys Leu Ser Leu Ser Asp Asn Gin He Ser Lys Leu Glu Gly Leu Glu 

120 125 
Arg L^u Thr Ser Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin He Ser 

135 140 
Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu Glu Leu 
150 j^gQ 
Ser Gly Asn Gin He Arg Lyc Leu Glu Gly Leu Glu Arg Leu Thr Ser 

Leu Ala Thr Leu Glu Leu Ser Gly Asn gIh He Arg Lys Leu gIu Gly 

180 195 

Leu Glu Arg Leu Thr Ser Leu Thr T.ys Leu Arg T.eu Arg Ser Asn Gin 

^^=> 20O 205 

He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Thr Leu 

Glu Leu Ser Gly Asn Gin He Arg Lys Leu Glu Gly Leu Glu Arg Leu 

Thr ser Leu Ala Thr Leu Glu Leu Ser Gly Asn Gin He Ser Lys llu 

250 255 
Glu Gly Leu Glu Arg Leu Ser Ser Leu Thr Lys Leu Arg Leu Ara Ser 

260 265 270 

Asn Gin He Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr 

280 285 

^''^ ^-"^ ^'-^ 1^/^ Leu Glu Gly Leu Glu 

295 300 
Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin He Arg 

31S 320 
Lyr Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Thr Lys Leu Arg L-u 
a 325 3:^0 

Arg ber Asn Gin He Ser Lys Leu Glu Gly Leu Asp Ser Leu Thr S-r 

■'''^ 345 
Leu TJir Lys Leu Ser Leu Scr Asp Asn Gin He Ser Lys Leu Glu Glv 
355 * 
Leu Glu Arg Leu Thr Ser Leu Ala Glu Leu Tyr Leu Leu Asp Asn Gin 
Ti I . 3fi0 

He Arg Lys Leu Glu Gly Leu A^p Gly Lou Ala Ser Lou Thr Arg I.eu 
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20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 
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385 










390 










395 










400 


Ser 


Leu 


Arg 


Arg 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Asp 


Arg 


Leu 










405 










410 










415 




LyE 


Vol 


Leu 


Arg 


Lys 


Leu 


A^p 


Val 


Ser 


Gly 


Asn 


Asp 


He 


Gin 


Ser 


He 






420 










425 










430 






Asp 


Asp 


He 


Lya 


Leu 


Leu 


Ala 


Pro 


He 


Leu 


Glu 


Gin 


Thr 


Leu 


Glu 


Lys 


435 










440 










445 








Leu 


Arg 


He 


Hi£ 


Asp 


Asn 


Pro 


Phe 


Val 


Ala 


Ser 


Ser 


Gly 


Leu 


He 


Leu 




450 










4 55 










460 










Ser 


ero 


ryr 


Asp 


Asn 


His 


Leu 


Pro 


Glu 


He 


Lys 


Ala 


Leu 


Leu 


Giu 


Lys 












470 










475 










48C 


Glu 


Lys 


Giu 


r.ys 


Gin 


Lys 


Ly5 


Thr 


Ser 


Val 


Glu 


Tyr 


His 


Pro 


Phe 


Cys 








485 










4 90 










4 95 






Vai 


Met 


Leu 


Leu 


Gly 


Asn 


His 


Ser 


Ser 


Gly 


Lys 


Thr 


Thr 


Phe 


Leu 






500 










505 










510 






Ser 


Gin 


Tyr 


Asp 


Thr 


Asn 


Tyr 


Thr 


Tyr 


Cln 


Lys 


Asn 


Thr 


His 


Val 


Leu 






515 










520 










525 








Ser 


lie 


His 


Arg 


Ser 


Asn 


Asn 


Pro 


Asn 


Ala 


lis 


Phe 


Tyr 


Asp 


Phe 


Gly 




S30 










535 










5-10 










Gly 


Gin 


Asp 


Tyr 


Tyr 


His 


Gly 


He 


Tyc 


Gin 


Ala 


Phe 


Phe 


Thr 


Thr 


Gin 


54S 








550 










555 










560 


S^c 


L^u 


Tyr 


Leu 


Lsu 


Pits 


Trp 


Asp 


Ala 


Lys 


T.ys 


Asp 


Arg 


Asn 


Fhe 


Val 








565 










570 










575 




Ser 


Val 


hsp 


Asp 


Lys 


Glu 


T>/r 


Gin 


Thr 


Leu 


Asn 


Phe 


Asn 


Arg 


Pro 


Tyr 








580 










585 










590 






Trp 


Leu 


•31 y 


Gin 


He 


Ala 


Tyr 


Ala 


Cys 


Asn 


Arg 


Cys 


Ilet 


S*r 


Val 


Gly 




S95 










600 










605 








Gly 


Asn 


Pre 


Asp 


Gly 


Lys 


Asp 


Thr 


Pro 


Gin 


Thr 


Thr 


Asp 


Asp 


Thr 


He 


610 










615 










620 










Ile 


lie 


Gin 


Thr 


His 


Al- 


Asp 


Glu 


Thr 


Gly 


7a a 


Lys 


Gin 


Gin 


Thr 


Leii 


G25 










630 










635 










640 


Gly 


Cys 


Ala 


Ala 


Giu 


Ann 


Gly 


Val 


Leu 


Glu 


Glu 


He 


Tyr 


Val 


Ser 


Leu 






645 










650 










655 




Giu 


Pro 


Lys 


Ala 


Asn 


Ser 


Ala 


Val 


His 


Ala 


Leu 


Asn 


Tyr 


Leu 


Asn 


Glu 






660 










665 










€70 






Arg 


Val 


Arg 


Giu 


Val 


Val 


Ala 


Ser 


Arg 


Ser 


Lys 


Ser 


He 


Gin 


He 


Thr 




67 5 










680 










685 








Giu 


Lys 


Asp 


Lys 


Gly 


Leu 


Tyr 


Glu 


Ala 


Leu 


Pro 


Thr 


He 


Ala 


Gly 


Asp 




€90 










695 










700 










Asn 


Ly.«« 


His 


He 


Pro 


He 


Ser 


Leu 


Glu 


Ala 


Leu 


Ala 


Ala 


Gin 


Leu 


Asn 


7 05 








710 










715 










720 


Lys 


Gly 


Arg 


Ala 


Giu 


Asn 


Asp 


Leu 


Tyr 


Thr 


He 


Glu 


Tyr 


Leu 


Gin 


Thr 








725 










730 










735 




Giu 


Leu 


Asn 


Gin 


Leu 


Ser 


Leu 


Arg 


Gly 


Glu 


Val 


Leu 


Tyr 


Tyr 


Arg 


Glu 








740 










745 










750 






hsxi 


Giu 


Lys 


Leu 


Asn 


Atti 


Tyr 


Val 


Trp 


Leu 


Asp 


Pro 


Ala 


Ala 


Phe 


Val 






755 










760 










765 








Gin 


Met 


lie 


His 


Gly 


Glu 


He 


Leu 


Gin 


Lys 


Asp 


Asn 


He 


Asn 


Arg 


Gly 




770 










775 










780 










Thr 


Val 


Pro 


Lys 


Arp 


He 


Phe 


Glu 


Cys 


Lys 


Leu 


His 


Asn 


Leu 


ser 


Ser 


780 










790 










7 95 










800 


Gly 


Ser 


He 


Phe 


Giu 


Glu 


Asp 


Gly 


Gin 


Asn Gly 


Ai-n 


Met 


He 


Leu 


Gin 








805 










810 










815 




Leu 


Leu 


Leu 


Giu 


Glu 


Leu 


He 


Va] 


Tyr 


Glu Asp 


Lys 


Asp 


Cys 


Tyr 


Val 








820 










825 










830 






lie 


Pro 


Gly 


Tyr 


Leu 


Pro 


Leu 


His 


Ser 


Asp Asp 


Glu 


Ala 


Tyr 


Lys 


Trp 






935 










840 










845 








Lsu 


The 




Gly 


Phe 


Glu 


Arg 


Pro 


Asn 


Phe 


Val 


Leu 


Lys 


Phe 


Glu 


Arg 




B50 










855 










860 










Ph* 


Il-a 


ero 


Phe 


Gly 


Leu 


He 


Asn 


Gin 


He 


He 


Ala 


Tyr 


Tyr 


Gly 


Arg 


665 










870 










875 








880 


Giu 


Giu 


Gly 


AJ« 


L<»M 


Lys 


Arg 


Tyr 


Trp 


Arg Asp 


Gin 


Val 


He 


Phe 


Thr 










885 










R90 










895 




Ala 


Gly 


Arg 


Giu 


Met 


Asp 


Arg 


Gin 


Thr 


Leu 


Glu 


Gin 


Glu 


Glu 


G.l (J 


Ly-- 








900 










90b 










910 






Giu 


Gly 


Leu 


Pro 


Lys 


Thr 


A^n 


Ala 


Glu 


Asp 


Tyr 


Gin 


lie 


Trp 


He 


Lys 






915 










920 










925 








Lsu 


Asp 


Phe 


Thr 


Aip 


Leu 


Ala 


He 


Ser 


Val 


Phe 


He 


Lys 


Glu 


Gin 


Arg 




930 










935 










910 










Lye 


Thr 


Scr 


Ala 


Lys 


Asp 


Met 


Gin 


Arg 


Lys 


Glu 


Ala" 


Thr 


He 


Leu 


Ser 












950 










955 










900 


Asp 


Met 


Leu 


Afc-p 


Met 


Tyr 


Trp 


Asn 


Asn 


He 


Pro 


Pro Arg 


Glu 


Gin 


He 








965 










970 










9-^5 




Gly 


Asp 


Lys 


Asp 


Thr 


Giu 


Gin 


Thr 


Arg 


Ser 


Thr 


Tie 


Arg 


Glu 


Thr 


Asn 








900 










985 










990 
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Arg Lys Lys Arg Pro lie Gin Asp Lau Tyr Leu Sei Cys Ala Gin Ala 

?95 lOOO 1005 

A-'Tp I.ys Ar»p l.eu Thr Glu Sex His Tyr lie His Leu Gly Thr Leu Asp 

1010 1015 1020 

Asp Glu Sec Lys Thr Thr Ala Arg lie Ala Ala Tyr ?ro Leu Lys Asn 
102S 1030 1035 1040 

Gly Va.1 Tie A?p T.ys Glu Arg Val Arg Glu Val Ser Thr Arg Pro Tyr 

1045 1050 1055 

Lys His Lfeu Ser Val Asn Lys Asn Leu Ala Thr Ala Lys Gin lie Phe 

1060 1065 1070 

lie Ser Tyr Ser Lys Glu Asp Gin Thr Glu Leu Glu Thr Cys Levi Gin 

1015 1080 1085 

Pha Phs Lys Pro Leu Glu Lys Asn Gly Gin He Glu lie Tyr Tyr Acp 

1090 1095 1100 

Lys Leu Thr Lys Phe Glu Thr Pro lie His Pro Glu lie Arg Lys Arg 
1105 1110 1115 1120 

He Val Glu Ala Asp Cys He Tlo Aln Leu Tic Ser Gin Arg Tyr T.cu 

1125 1130 1135 

Ala Thr Asp Tyr He Leu Asp His Glu Leu Fro Val Phe Arg Glu Tyr 

1140 1145 1150 

Asn Lys The He Val pro He Leu IIg Lys Pro Cys Thr Phe Glu Asp 

1155 1160 1165 

Asp Glu Phe Leu Arg Glu Lys Tyr Phe Ala Gin Lys Ala Gin He He 

1170 1175 1180 

Asn Leu Gly Lys Glu Gly Lys Thr 11;^ Ly^ Mo Tyr Asp Ser Ilc^ Thr 
1185 1190 1195 1200 

Ala Ser Ala His Arg Asp Glu Asn Trp Val Ala Val Val Arg Glu Phe 

1205 1210 1215 

Lys Glu Lyc Ho Leu Arg Tie Thr Lys Gin Gin Val A.-rn Thr Astp Glu 
1220 1225 1230 

(2) THFORIIATION FOR SEQ ID NO: 517 

(i) SEQUEMCE CHARACTERISTICS: 

(A) LENGTH: 1175 amino acids 
(R) TYPE; amino acid 
(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: protein 

(iii) Hx'POTHETlCAL: YES 

tvi) ORIGINAL SOURCE: 

(A) ORGAfflSH; Pcrphyromonas gingivalis 

(ix) FEATURE: 

(A) NAJfE/KEY: miec_f eaturo 

(B) LOCATION 1. . .1175 

Ui) SEQUENCE DESCRIPTTON: SEQ ID HO: 517 



net 


Thr 


Trp 


Leu 


He 


Asp 


Phe 


Pro 


Ala 


Leu 


Lys 


Lys 


L&u 


Asp 


Leu 


Ser 










5 










10 










15 




Tyr 


Af n 


Gin 


He 


Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 








20 




















30 






Thr 


Lys 


Leu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Arg 


Lys 


Ltju 


Glu 


Gly 


Lsu 






35 










40 










45 








Asp 


Ser 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 


Leu 


Ser 


Leu 


Ser 


Asp 


Asn 


Gin 


He 




50 










55 










6C 










Ser 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Glu 


Leu 


Tyr 


65 








70 










75 










30 


Leu 


Leu 


Asp 


Asn 


Gin 


He 


Ser 


Lye 


Leu 


Glu 


Gly 


Let I 


Glu 


Arg 


Leu 


Thr 










85 










90 










95 




Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


L«u 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 


Leu 


Glu 








100 










105 










110 






Gly 


Leu 


Gilt 


Arg 


Leu 


Thr 


Ser 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 






115 










120 










125 








Gin 


He 


Arg 


Lys 


Leu 


Glu 


Gly 


Leu 


Glu 


Arg 


Leu 


Thr 


Ser 


Leu 


Thr 


Lys 




130 










135 










140 










I.cu 


Arg 


Leu 


Arg 


Ser 


Asn 


Gin 


He 


Ser 


Lys 


Leu 


Giu 


Gly 


Leu 


Glu 


Arg 


145 










ISO 










155 










160 


Leu 


Thr 


Ser 


Leu 


Als 


Thr 


Leu 


Glu 


Leu 


Ser 


Gly 


Asn 


Gin 


He 


Arg 


Lys 










165 










170 










175 




Leu 


Giu 


Gly 


Leu 


Giu 


Arg 


Leu 


Thr 


Scr 


Leu 


Ala 


Thr 


Leu 


Glu 


Leu 


ser 








180 










185 










190 
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15 



20 



35 



45 



60 
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Gly Afn Gin Tie Ser Lys I.ru Glu Gl y Leu Glu Arg Leu Ser Ser L-u 

195 200 205 

Thr Lys Leu Arg Leu Arg Ser Asn Gin 11- Ser Lys Leu Glu Gly Leu 

210 215 220 

Glu Arg Leu Thr Ser Leu Thr hys I.eu Ssr T.-u Ser Asp Aa^ vlln II- 
230 235 240 

Ser Lys Leu Glu Gly Leu Glu Arg Leu Tnr Sex Leu Ala Glu Leu Tyr 



75 



I.eu Leu Aep A.-n Gin lie Arg Lys Leu Glu Gly Leu Glu Arg Leu Thr 

2^0 265 270 

Ser i.eu Thr Lys Leu Arg Leu Arg Ser A.n Gin lie Ser Lys Leu Glu 

2^5 280 205 

Gly I.CU Arp Ser Leu Thr Ser Leu Thr L,- Leu Ser Leu Ser Asp Asn 

^'O 295 300 

Gin lie Ser Lys Leu Glu Gly Leu Glu Arg Leu Thr Ser Leu Ala Glu 

310 315 
Leu Tyr Leu Leu A^p Asn Gin He Arq Ly- Leu Glu Gly Leu Asp Glv 

325 330 
Leu Aia Ser Leu Thr Arg Leu Ser Leu Arg Arg Airn Gin lie S-r L-- 

345 350 
Leu Glu Gly Leu Aep Arg Leu Lys Val Leu Arg Ly? Leu Asp Val S-r 
355 360 36«= 



Gly Asn Asp lie Gin Ser He A:,-p A^p 11^ Lys Leu Leu Ala Pro II- 

376 JQQ 

T.ou Glu Gin Thr Leu Glu Lys Leu Arq lie Jlis Asp Asn Pro Phe Val 



390 305 400 

Leu He Leu Ser Pro Tyr AffD Asn His Leu Pro 
4^55 4ir. 

Leu GJu Ly.-; Glu Lys Glu Lys Gin Ly.-? Lys 

''^O 425 



lie Lys Ala Leu Leu GJu hy,-, Glu Lys Glu Lys Gin Lys Lys Thr S-r 

''20 425 430 

Val Glu Tyr His Pro Phe Cys Lys Val Met Leu Leu Gly Asn His Ser 

Ser Gly Ly^r Thr Thr Phe Leu Ser Gin Tyr Asp Thr A^n Tyr Thr Tyr 

455 460 

Gin Lys Asn Thr His Val Leu Ser He His Arg Ser A^n Asn Pro Asn 
•ibs 470 t-ti^ 



Ala lie Phe Tyr Asp Phe Gly Gly Gin Asp ^yr Tyr His Gly II- ^yr 

490 495 
Gin Aia Phe Phe Thr Thr Gin Ser Leu Tyr Leu Leu Phe Trp Asp Ala 

Lys Lys Asp Arg Asn Phe Val Ser Val Asp Asp Lys Glu Tyr Gin Thr 

515 520 525 

Leu Asn Phe Asn Arg Pro Tyr Trp Leu Gly Gin He Ala Tyr Ala Cys 
^-'^ 535 



Asn Arg Cy.. I,et Ser Val Gly Gly Asn Pro A^p Gly Lys Asp Thr Pro 

550 555 
Gin Thr Thr Asp Asp Thr He He He Gin Thr His Ala Asp Glu Thr 

555 575 
Gly Ala Lys Gin Gin Thr Leu Gly Cys Ala Aia Glu A^n Gly Val Leu 

505 590 
Glu Glu He Tyr Val Ser Leu Glu Pro Lys Ala Asn Ser Ala Val His 

595 600 605 

Ala Leu Asn Tyr Leu Asn Glu Arg Val Arg Giu Val Val Ala Ser Arg 

55 ser Lys Ser He Gin He Thr Glu Lys Asp Lys aly Leu Tyr Glu Ala 

«30 635 640 



Leu Pro Thr He Ala Gly Asp Asn Lys His He Pro He Ser L-u Glu 
645 

Aia Leu Ala Ala Gin Leu A.^n Lys Gly Arg Ala Glu A.sn Asp L-u Tyr 
€60 6S5 — - ~ ^ 



Thr He Glu Tyr Leu Gin Thr Glu Leu Asn Gin Leu Ser lIu Arg Gly 

600 6ab 

Glu Val Leu Tyr Tyr Arg Glu Asn Glu Lys Leu Asn Asn Tyr Val Trp 

b5 L,u A^p fro Ala Al» Phe Val Gin Met He Kl. Gly Giu He Leu Qln 

710 71S 
Lys Asp Asn He Asn Arg Gly Thr Val Pro Lys Asp He Phe Glu Cys 

Asn Leu Ser Ser Gly Sej 

''^^ -MS 750 



— 'jv 735 

Lys Leu His Asn Leu Ser Ser Gly Ser He Phe Glu Giu Asp Gly Gin 

'MS 750 
Asn Gly Asn Met He Leu Gin Leu Leu Leu Glu Glu Leu He Val Tyr 

Glu Asp Lys Asp Cys Tyr Val He Pro Gly Tyr Leu Pro Leu His Ser 

Asp Asp Glu Ala Tyr Lys Trp Leu The Leu Gly Phe Giu Arg Pro Asn 
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785> 










7S0 










795 








800 






Leu 


Lys 


Phs 


Glu 


Arg 


Phe 


He 


Pro 


Phe 


Gly 


Leu life 


Asfi 


Gin 










805 










810 








815 




I1& 


lie 


Ala 


Tyr 


Tyr 


Gly 


Arg 


Glu 


Glu 


Gly 


Ala 


Leu 


Lys Arg 


Tyr 


Trp 








820 










825 








830 






Arg 


Asp 


Gin 


Val 


He 


Phe 


Thr 


Ala 


Gly 


Arg 


Giu 


ileL 


Asp Arg 


Gin 


Thr 






S35 










840 










845 






Leu 


Glu 


Gin 


Glu 


Glu 


Glu 


Lys 


Glu 


Gly 


Leu 


Pro 


Lys 


Thr Asn 


Ala 


Glu 




850 










855 










860 








Asp 


Tyr 


Gin 


ale 


Trp 


He 


tys 


Leu 


Asp 


Phe 


Thr 


Asp 


Leu Ala 


He 


Scr 


665 










870 










875 








880 


Val 


Phc 


11 0 


hys 


Glu 


Gin 


Arg 


Lys 


Thr 


Ser 


Ala 


Lys 


Asp Met 


Gin 


Arg 










885 










890 








895 




Lys 


Glu 


Ala 


Thr 


He 


Leu 


Ser 


Asp 


Met 


Leu 


Asp 


Met 


Tyr Trp Asn 


Asn 








900 










905 








910 






lie 


rro 


Pro 


Arg 


Glu 


Gin 


He 


Gly 


Asp 


Lys 


Asp 


Thr 


Giu Gin 


Thr 


Arg 






915 










920 










925 






Ser 


Thr 


lie 


Arg 


Glu 


Thr 


A£n 


Arg 


Lys 


L-^'s 


Arg 


Pro 


He Gin 


Asp 


Leu 














935 










940 








Tyr 


Leu 




Cys 


Ala 


Gin 


Ala 


Asp 


Lys 


Asp 


Leu 


Thr 


Giu Ser 


His 


Tyr 


945 










950 








955 








960 


11% 


His 


Leu 


Giy 


Thr 


Lisu 


Asp 


Asp 


Glu 


Scr 


Lyc 


Thr 


Thr Ala 


Arg 


He 










965 










97 0 








975 




Al a 


Ala 


Tyr 


Pro 


Leii 


Lys 


Asn 


Gly 


Val 


He 


Asp 


Lys 


Glu Arg 


Val 


Arg 


















9R5 








990 






Glu 


VaI 


Sec 


The 


Arg 


Pro 


Tyr 


Lys 


His 


L^u 


Ser 


Val 


Asu Lys 


Asn 


Leu 
















1000 








lOQb 






Ala 


Thr 


Ala 


LV£ 


Gin 


He 


Phe 


lie 


Ser 


Tyr 


Ser 


Lys 


Glu Asp 


Gin 


Thr 




1010 








1015 








1020 






Glu 


Leu 


Glu 


Thr 


Cys 


Leu 


Gin 


PUe 


Phe 


Lys 


Pco 


Leu 


Glu Lys 


Asn 


Gly 


1025 








1030 








1035 






] 


Gl n 


Tlo 


Glu 


He 


Tyr 


Tyr 


Asp Lys 


Leu 


Thr 


Lys 


Phe 


Glu Thr 


Pro 


He 










1045 








1050 






1055 


His 


Pro 


Glu 


He 


Arg 


Lys 


Arg 


He 


Vai 


Glu 


Aia 


Asp 


cys Tie 


He 


Ala 








1060 








1065 






1070 




Leu 


lie 


Ser 


Gin 


Arg 


Tyr 


Leu 


Ala 


Thr 


Asp 


Tyr 


He 


Leu Asp 


His 


Glu 






1075 








loeo 








1085 






Leu 


Pro 


Vdl 


Phe Arg 


Glu 


Tyr Asn 


Lys 


Thr 


He 


Val 


Pro He 


Leu 


He 




1090 








1095 








1100 






Lys 


Pro 


Cys 


Thr 


Phe 


Glu 


Asp Asp 


Glu 


Phe 


Leu 


Arg 


Glu Lys 


Tyr 


Phe 


1105 








1110 








1115 








Aid 


Gin 


Lye 


Ala 


Gin 


He 


He 


Asn 


Leu 


Gly 


Lys 


Glu 


Gly Lys 


Thr 


He 










1125 








H30 






1135 


Lys 


Ala 


Tyr 


Asp 


Ser 


He 


Thr 


Ala 


Ser 


Aid 


His 


Arg 


Asp Glu 


Asn 


Trp 








1140 








1145 






HbO 




Vil 


Ala 


Val 


Val 


Arg 


Glu 


Phe 


l.yfl 


Glu 


L'\*s 


lis 


Leu 


Arg lie 


Thr 


Lys 






llbl* 








1160 








1165 






Gin 


Glu 


Vai 


As n 


Thr 


Asp 


Glu 



















1040 



1120 



1170 1175 



IZ) INfc'OKilATION FOR SEQ ID HO: 51 3 

(11 SF.QUF.IiCE CHARACTERISTICS: 

(A) LENGTH: 22 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) llOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

ORIGIMAL SOURCE: 
(A) ORGAMISIi: Porphyromona.? gingivalis 

(i>:l E-EATURE: 

(A) NAUE/KEY: misc^feature 

(B) LOCATION 1 . . .229 

(xi) SEQOEIJCE DESCRIPTIOM: SEQ ID 110:518 

Met llec Lys Lys Ala Phe Val Phe Val Leu Leu Vai Cys Leu Phe Sec 

IS 10 IS 

Ser Phe Ser Ser Ser Aia '31n Thr Thr Thr Asn Ser Sex Arg Ser Tyr 

20 25 30 

Phe Thr Gly Arg He Glu Lys Vai Ser Leu Asn Lfeu Gly Val Pro Pro 
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35 40 
Val Ser Thr Glu Val Trp Gly M-c Thr His Asp Ala Asn Gly Leu Pro 

35 60 
Fhe Glu He Pro He Ser Phe Ser Arg Phe Am S-r Gin Gly Asp li- 
es 70 -- J f " 



75 80 



Ala Thr Thr Tyr Tyr He Ala Asn Ser Glu Ala Thr Leu Asn Glu Trp 

ft-'i 90 05 

Cys Asp Tyr Ala His Pro Gly Gly Tie Val Arg Val Glu GJ y Arq Fh- 

100 105 no 

Trp Lys Her Thr Tyr A.<^n He Pro Thr Tyr Asn Ala Val Cys Thr Arg 

115 120 125 

He Thr Phe Glu Asn Gin Glu He Glu Gly Thr He Val Leu He Pro 

130 135 140 

Lys Pro Lys Val Ser Leu Pro His Val Ssr Glu Ser Val Pro Cys II- 
1^*^ 1-° 155 160 

Arg Thr Glu Ala Gly Arg Glu Phe He Leu Cys Glu Glu Asp Asp Thr 

165 170 175 

Phe Val Ser His Asp Gly Asn Glu Val Thr He Gly Gly Lys Pro Phe 

180 185 190 

Leu Leu Asn Thr Asn Val Lyr Ho Val Gly Asp Val Sor Gin Lv° T"r 

135 200 205 

Ala Val Gly Val Gly Glu He Arg Phe Leu Gin He Cys Ala Gin Thr 

210 215 220 

Val Ser Gin Gin Lys 
225 

(a) IMFOWIATIOM for SEQ id II0:519 

ii) 3EQUE1ICE CHARACTERISTICS: 

(A) LEMGTIl: 228 amino acids 
{B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) ORIGII^AL SOURCE: 

(A) ORGAfJISM: Porphyromonas gingivalis 

fix) FEATURE: 

(A) HAI'iE/KEY: misc_feature 

(B) LOCATIOt^ 1 . . .228 

(Xi) SEQUENCE DESCRIPTIOI/: SEQ ID HO: 51 9 
net LV3 Lys Ala Phe Val Phe V-1 Leu Leu Val Cys Leu Phe Ser Ssr 



1 5 



10 15 



Phe Ser Ser Ser Ala Gin Thr Thr Thr Asn Ser Ser Arg Ser Tvc Ph- 

20 25 30 

Thr Gly Arg He Glu Lys Val ser L-u Asn Leu Gly Val Pro Pro Val 

35 40 45 

Ser Thr Glu Val Trp Gly HeL Thr His Asp Ala Asn Gly Leu Pro Phe 

55 60 
Glu Tie Pro He Ser Phe Ser Arg Phe Asn Ser Gin Gly Asp He Ala 

'^^ 75 eo 

Thr Tnr T-yr Tyr He Ala Asn Ser Glu Ala Thr Leu Asn Glu Trp Cvs 

85 90 95^^ 

Asp Tyr Ala His Pro Gly Gly He Val Arg Val Glu Gly Arg Ph- Trp 

1'5<> 105 HO 

Lye Met Thr Tyr Asn He Pro Thr Tyr Asn Ala Val Cys Thr Arg He 

115 120 125 

Thr Phe Glu Asn Gin Glu 11*^ Glu Gly Thr He Val Leu He Pro Lys 

13*^ i:i^ 140 

Pro Lys Val Ser Leu Pro His Val Ser Glu Ser Val Pro Cys He Arg 
^''^ IS'^ 155 160 

Thr Glu Ala Gly Arg Glu Fhe He Lou Cys Glu Glu Asp Asp Thr Ph- 

165 170 175 

Val S-r His Asp Gly Asn Glu Val Thr He Gly Gly Lys Pro Phe Leu 

180 185 - iQc 

Leu Asn Thr Asn Val Lyt He Val Gly Asp Val Ser Gin L-s T"r A]n 

195 200 205 

Val Gly Val Gly Glu He Arg Phe Leu Gin He Cys Ala Gin Thr Val 

^^-^ 215 220 

Ser Gin Gin Lys 
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225 



{2) IMFORIIATIOH FOR SEQ ID HO: 520 

(i) SEQUENCE CHARACTERISTICS: 

(A) I.EMCTTH: 540 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

(il) HOLECULE TYPE: protein 

(ixi) HYPOTHETICAL: YES 

(vi) ORrGIMAT. SOURCE: 

(A) ORGANISt^: Porphyromonas gingiva lis 

tJx) FFATtmE: 

tA) MAI IE/ KEY: raisc_f eature 
IB) LOCATiai 1 . . .540 

SEQUEUCk: DBSCRIPTIOH : SEQ ID HO:S20 

HeL Lys Thr Lys Val Leu Arg Lys Ph» Vai Val Ala Ala Phe Ala Val 

1 5 10 IS 

Ala Thr r.eu Cy.«? Pro Leu Ala Gin Ala Gin Thr ff*t Gly Gly At-p Asp 

20 25 30 

Val Lys Vai Val Gin Tyr Asn Gin G.lu Lyff Leu Val Gin Thr Arg Met 

35 40 45 

S-r Val Ala Asp Asn Gly Trp II a Tyr Val Met The Hi£ Ser Gly Tyr 

50 55 60 

Asp Thr Gly Asn Sor Asn Val Lys He Phe Arg Ser Lys Asp Gin Gly 
St "70 75 80 

Ala Thr Tyr Gin Lys Leu Arg Asp Trp Asp Pro S-sr Asp Asp Tyr Gin 

85 90 95 

Phe Gin Asp Phc Asp lie Val Val Thr Gly Lys Asn Glu Ser Asp lie 

100 105 110 

Lys lie Trp Ser Val Glu Leu Het Asn Lys Pro Gly Gly Tyr I.ys Ser 

115 120 125 

Arg Val Ala Val Phe 5f?r Arg Asp Ala Asn Ala Gin Asn Ala Lys Leu 

130 135 140 

Val Tyr Lys Glu Asp Phe Ser Asn Val Gin Leu Tyr Asp Val Asp lie 
145 150 155 160 

Ala Ser Ai?n Ty^ Arg Ser Pro Ser Ser Leu Asn Asn Gly Gly Asn Pro 

165 IfO 1"7S 

Phe Ala Leu Ala Phe Ala Tyr Thr Gly Phe A-rn Asn Thr His Lys lie 

180 165 190 

Ser Phe Val Asp Tyr Val Phe Sec Leu Asn Gly Gly Gin Asn Phe Asn 

195 200 205 

Lys Asn Leu Leu Phe Ser Gin Asp Gly Glu L-s Lys lie Asp Lys Val 

210 215 220 

Asp Leu Ser Leu Gly Ser Thr Ser Glu Ser Met Gly His Asn Ala Trp 
225 230 235 240 

Pro Leu Met Gly Val Val Phe Glu Het Asn Lys Gin Gly Gly Lys Ser 

245 250 255 

Asp lie Gly Phe Leu Ser Asn Phe Val Asp Asn Asp Pro Glu Phe Gin 

250 265 210 

Trp Ser Gly Pro lie Lys Val Ser Glu Ser Asp Met Ser Phe Ser Pro 

275 280 265 

Lys He Gin Met Leu Leu Asp Glu Asp Asn Asn Thr lie Asn Gly Glu 

290 295 300 

Ser C'ys His Asn Phe Ilet lis Thr Tyr Ser Asp Tyr Asp Ser Glu Tyr 
305 310 315 320 

Ser Asp Trp Asp He Arg Tyr Val Tyr Pro Lys Lys Ser Phe Lye Tyr 

325 330 335 

Glu Lys Gly Lys Thr Pro Thr Met Asp Asp Leu Val Glu Ala Phe Leu 

340 345 350 

Thr Ala Ser Tyr Gin Ser Glu Thr Asn Ser Gly Leu Gly Tyr Asp Ly? 

355 300 365 

Asn Ala Asn Hln Tyr Leu He Thr Tyr Ala Lys Lys Glu Glu Asn Gly 

37 0 37 5 390' 

Thr Asn Thr Leu Lys Tyr Arg Trp Ala A^n Tyr A?p Lys He His Asn 
365 330 305 400 

Lys Asp Leu Trp Ser Asp Thr Phe Thr Tyr Thr Ser Ser Ala Asn Ala 

405 410 415 

Leu Tyr Thr Pro Gin Val Asp He Asn Pro The Lys Gly Leu Val Cyc 
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•*20 425 
Trp S,r Trp Val Glu Tyr Leu Pro Gly Lyr Arg lie Val Trp Ser Asp 

445 

Thr Gin Trp Thr His Ala Am Gly VaJ Glu Arp Tie Val Hst Gin Glu 
455 

Gly Ser Met Lys Leu Tyr Pro Asn Pro Ala Gin Glu Tyr Ala Val 11- 
i . ^'^^ 475 

Ser Leu Pro Thr Ala Ala A^n Cys Lys Ala Val Val Tyr A.-p n-t Gin 
485 

Gly Arg Val Val Ala Giu Ala Ser Phe Ser Gly Asn Giu Tyr Arg Leu 

"* 505 52^0 

Asn Val Gin His Leu Ala r.ys Gly Thr Tyr 11*^ I.eu l.ys Va] Val S-r 

515 S20 525 

Asp Thr Glu Arg Phe Val Glu Lys Leu I^e Val Glu 
530 535 540 

(2) IMFORJIATIOK FOR SEQ ID NO: 521 

(1) SF.QaEMCE CHARACTERISTICS; 

(A) LEJIGTH: 771 amino acids 
(BJ TYPE: amino acid 
(H) TOPOLOGY: linear 

Ul) MOLECULE TYPE: protein 

(lii) HYPOTHETICAL: YES 

<vi) ORIGINAL SOURCE: 

(A) ORGAIIISII: Porphyromonar gingivall^ 

(ix) FEATURE: 

(A) IE/KEY: mi sr. featur- 

(B) LOCATION 1. . .771 

3EQUEMCE DESCRIPTIOM: SEQ ID MO: 521 

Met Leu Thr 11 o Arg Asn Phe Leu Leu Phe Cys Cys Leu Ser Leu lie 

Ala Phe Ala Ala Asp Ala Gin Ser Ser vil Ser Ser Gly Arg Arg Leu 

25 30 
Thr Glu Tyr Val Asn Pro Phe lie Gly Thr Ala A^n Tyr Gly Thr Thr 

Asn Pro Gly Ala Val Leu Pro Asn Gly Leu llet Ser vll Thr Pro Phe 

55 go 
Asn Val Ser Gly Ser Thr Glu Asn Arg Pho Acp Lys Asp Ser Arg Trp 

Trp ser Ala Pro Tyr Ser Ala Asp Asn Ser Tyr Cys He Gly Phe lev 



85 on 



lie Leu 
Tyr Gly 



Hie Val Asn Leu Ser Gly Val Gly Cys Pre Glu Leu Ser Glv 

105 

Leu Het Ala Thr Ser Gly Thr Phe Asp Pro Affp Tyr Cys C"s 

115 120 X25 "* 

Ser Ser Leu Ser Arg Glu Tyr Ala Arg Pro Gly clu Tyr Lys Ala Val 

^-3" 135 14Q 

Leu Asp Lys Tyr Gly lie A^p Ala Ala Val Thr Val Thr Glu Arg Thr 
150 15S - g_ 

Ala Leu Thr Glu Phe Ala Pho Pro Glu Gly Glu Gly His II- L-u L-u 

165 170 
Asn Leu 'Jly Gin Ala Leu Ser Asn Glu Ser Gly Ala Ser V.,1 Arg PI,- 

100 185 , 

Leu Asn Asp Ser Thr Val Val Gly Ser Arg Leu Met Gly Thr The Cys 

1°5 200 205 

Tyr Asn Pro Gin Ala Val Phe Arg Gin Tyr Pn^ Val Leu Glr. Val Ser 

Arg Arg Pro He Ser Ala Gly Tyr Trp Lys Lys Gin Pro Pro Wet Th 



230 



235 240 



Val Giu Ala Gin Trp P^p Ser Thr Ala Gly Ly3 Tyr Lys Gin Tyr A.-p 

^ 245 250 255 

Gly Tyr Lys Arg Glu Het Ser Gly Asp Asp He Gly Val Arg Phe S-r 

^60 265 27n 

Phe Asn Cyj- A^p Gin Gly Glu Lys He Tyr Val Arg Ser Al Val S-r 

280 28*^ 
Phe Val Ser Glu Ala Asn Ala Leu Tyr Asn Lou Glu Ala Glu Gin Glu 

290 295 300 

Glu Val Phe Lys Ser Val Gly Gly Asn Pro Ala Lys Ala Phe Ser Al. 
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305 










310 










315 










320 


lie 


Arg 


Ser 


Arg 


A\a 


He 


Glu 


Arg 


Trp 


Glu 


Glu 


Ala 


Leu 


Gly 


Thr 


Val 










325 










330 










335 




Glu 


Val 


Glu 


Gly 


Gly 


Thr 


Pro 


Asp 


Glu 


Lys 


Thr 


He 


Phe 


Tyr 


Thr 


Ala 








340 










345 










350 






Leu 


Tyr 


His 


Lsu 




Hp 


His 


Pro 


Asn 


He 


Leu 


Gin 


Asp 


Ala 


Asn 


Gly 




355 










360 










365 








Glu 


Tyr 


Pro 


Met 


Met 


Gly 


Ser 


Gly 


Lyt- 


Thr 


Gly 


Asn 


Thr 


Ala 


His 


Asp 




370 










375 










380 










Arg Tyr 


Thr 


Val 


Phe 


Ser 


Leu 


Trp 


Asp 


Thr 


Tyr 


Arg 


Asn 


Val 


His 


Pro 


385 










390 










3 95 










400 


Leu 


Leu 


Cys 


Leu 


Leu 


TiT 


Pro 


Glu 


Lys 


Gin 


Lsu 


Asp 


Met 


Val 


Arg 


Thr 










405 










410 










415 




Leu 


He 


Aep 


Met 


Tyr 


Arg 


Glu 


Ser 


Gly 


Trp 


Leu 


Pro 


Arg 


Trp 


Glu 


Leu 








420 










425 










430 






Tyr 


Gly 


Gin 


Glu 


Thr 


Leu 


Thr 


Met 


Glu 


Gly Aep 


Pro 


Scr 


Leu 


He 


Val 




435 










440 










445 








Ho 


Asn 


Asp 


Thr 


Trp 


Gin 


Arg 


Gly 


Leu 


Acg 


Ala 


Phe 


Asp 


Thr 


Ala 


Thr 




450 










455 










4 60 










Ala 


Tyr 


Glu 


Ala 


Met 


Lys 


Lye 


Asm 


Ala 


Ser 


Sor 


Ala 


Gly Ala 


Thr 


His 


4 65 








470 










475 










480 


Pro 


He 


Arg 


Pro 


Arp 


Asn 


A.»p 


Asp 


Tyr 


Leu 


Thr 


Leu 


GlY 


Phe 


Val 


Pro 










485 










4 90 










495 




Leu 


Arg 


Glu 


Gin 


Tyr 


A5p 


Asn 


Ser 


Val 


Ser 


Hi;; 


Ala 


Lsu 


Glu 


Tyr 


T J r 








500 










505 










510 






Leu 


Ala Acp 


Trp 


Asn 


T.eii 


Ser 


Arg 


Phe 


Ala 


His 


Ala 


Leu 


Gly 


His 


Lys 






515 










sr>D 










525 








Glu 


hsp 


Ala 


Ala 


Leu 


Phe 


Gly 


Lys 


Arg 


Ser 


Leu 


Gly Tyr 


Arg 


H3s 


Tyr 




530 










535 










54 0 










Tyr Acn 


Lys 


Gl u 


Tyr 


Glv 


Het 


Leu 


Cys 


Pro 


Leu 


Leu 


Pro 


Asp 


Gly 


Ser 


545 










550 










555 










560 


Phe 


Lea 


The 


Pro 


PUe 


Asp 


Pro 


T.ys 


Gin 


Gly 


Glu 


Asn 


Phe 


Glu 


Pro 


Asn 










565 










570 










575 




Pro 


iJly 


Ph* 


His 


Glu 


Gly 


Ser 


Ala 


Tyr 


Asn 


Tyr 


Ala 


Phe 


Phe 


Val 


Pro 






580 










585 










590 






Hi£ 


Afp 


He 


Gin 


Gly 


I.eu 


Ala 


Arg 


Leu 


Het 


Gly 


Gly 


Ala 


Lys 


Val 


Phe 






595 










600 










60S 








Ser 


Glu 


Arg 


Leu 


Gin 


Lys 


Val 


Phe 


Asp 


Glu 


Gly 


Tyr 


Tyr Asp 


Pro 


Thr 




610 










615 










620 










Asn 


Glu 


Pro 


Asp 


He 


Ala 


Tyr 


Pro 




Leu 


Phe 


Ser 


Tyr 


Phe 


Pro 


Lys 


625 










630 










635 










640 


Glu 


Ala 


Trp 


Arg 


Thr 


Gin 


Lys 


Leu 


Thr 


Arg 


Glu 


Leu 


He 


Asp 


Lys 


His 










645 










650 










655 




PU« 


Cys 


Asn 


Ala 


Pro 


Asn 


Gly 


Leu 


Pro 


Gly 


Asn 


Asp 


Asp 


Ala 


Gly Thr 








660 










665 










670 






Met 


Ssr 


Ala 


Trp 


Leu 


Val 


Tyr 


Ser 


Met 


Leu 


Gly 


Phe 


T^-r 


Pro 


Asp 


Cys 






675 










630 










605 








Pro 


Gly 


S^c 


Pro 


Thr 


Tyr 


Thr 


Lnii 


Thr 


Ser 


Pro 


Val 


Phe 


Pro 


Arg 


Val 




6S0 










655 










700 










Arg 


11- 


Arg 


Leu 


Asn 


Pro 


Gin 


Tyr 


Tyr 


Pro 


Gin 


Gly 


Glu 


Leu 


He 


He 


70S 










710 










715 










720 


Thr 


Th r 


Asn 


Thr 


Glu 


Asn 


Gin 


Pro 


Thr 


Asp 


Ser 


He 


Tyr 


He 


His 


Thr 










725 










730 










735 




Val 


Ser 


Leu 


Gly 


Asn 


Lys 


Thr 


Leu 


Pro 


His 


Gly Thr Ar-g 


His 


He 


Ser 








740 










745 










750 






His 


Ala 


Asp 


Leu 


Val 


Arg 


Cys 


Gly 


His 


Leu 


Arg 


Tyr 


Glu 


Leu 


Ser 


Asn 






755 










7 60 










7 65 








Acg 


Pro 


Arg 






























770 































(2) IMroWJATIOM FOR SEQ ID MO: 522 



(1) SEQUEJiCE CHAKACTERISTICS: 

tA) LENGTH: 776 amino acids 
65 IB) TYPE: amino acid 

ID) TOPOLOGY: linear 

(ill MOLECULE TYPE: protein 

70 tiiil HYPOTHETICAL: YES 

(Vl) ORIGIHAL SOURCL: 

(A J ORGAfnSM: Porphyromonas gingival is 

75 (i^;) FEATURE: 
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(A) NAIiE/KEY: ml sc_ feature 

(B) LOCATION 1. . .776 

SEQUFir/CE DESCRIt>•^IO^^: SEP ID 110:522 

Met Cys Lys lie Arg Phe Ser Leu 7.5»u Gin Ala Leu Val Val Cys L-u 
^ S 10 15 ' 

T.cu Phc Thr Ser The Ser Leu Gin Ala Gin Glu Glu Gly He Trp A^n 

20 25 30 

Thr Leu Leu Ala He His Lys Thr Glu Lys Ala Val Glu Thr Pro Lv 

35 40 45 

Ly3 Val Phe Ala Val Ala Asn Gly Val Leu Tyr Ser Val Gly Lys Glu 

50 55 60 

Ala Pro His Glu Ala Lys He Phe APp Arg Tie Ser Gly Leu Ser Asp 
tt ''^ ^5 80 

Thr Ser Val Sec Ser He Ala Tyr Ser Glu Gin Leu Zys Ser Leu Val 

B5 90 95 

He Tyr Tyr Ala Ser Gly A^n He Acp He Leu A^p Glu Ala Glv Aro 
nn 1-5 110 

Val Thr Acn Val Pro Ala Leu Lys Arp Asn He A^p Leu He Asp hyc 

115 120 125 

Thr Leu Asn Arg Leu Leu He Val Gly Arg Ala Tyr Leu Ala GI" 

130 135 140 

Gly Phe Gly Leu Ser Val Leu Asp Vai Ala Glu Ala Arg He Pro A] a 
l-*^ 150 

Thr Tyr Ala Lys Gly Thr T.y.-. Val Thr A^p Val Ala Ly.- Leu Asp A^n 

165 170 
Ar;p Arg Leu Leu Met Leu Lys Glu Gly Gin Leu Phe He Gly l.ys C.}u 
nn ISO 

Thr Asp Asn L«u Gin Asp Pro Ala Ala Trp Thr Ala Leu Ser Leu Asm 

195 200 205 

Leu Pro Met Gly Ser Val Thr Gly Leu Gly He Val Gly Glu Arp He 
210 215 220 

Qr; Xi'f ^^"^ ^"^P ^"^y "^^1 Tyr Val Ala Ala Asn Gin Ser 

230 235 240 

Phe Giu Pro Glu Leu Leu Leu Ser Ser Ser Ala Asp Ser Arg L-u T-r 

2^5 250 255 

Val Thr Asp Azg Gly Leu Phe He Cys Ala Glu Asn Arg He Tyr Ph« 
An 265 270 

-aU He Glu Lys Gly Arg Lys Thr Thr Gin Phe Pro He Ala Asp Val L-u 
2^5 280 285 

Gly Val Gly Ala Met Asn Glu Ser Asn Thr Ala Tyr He Ala L-u Gly 
290 295 300 

4C ^^^* Leu Leu Ala Glu Gly Ser Thr Ala Glu 

-^05 310 315 ^-^o 

Ala Met Pro Vs,l Ala Phe Asp Gly Pro Gly Asp Asn Asp Phe Tyr Glu 

325 330 3^5 

Met Arg Phe Ser His Gly Arg Leu Tyr Ala 7.1s S^r Gly Leu Trp Glv 
en •^'"'^ 34 5 3C.0 

50 Thr Asn Leu Met Gly His Ala Gly Met Va. Lys Leu Tyr A^p Gly Asn 

360 365 
Arg Trp Thr Asn Phe Asp Lys Lyt? Thr Val Gin Giu Gin Leu Glv Glr 
3^0 375 380 ^ ^ 

cc fij' '^^^ 11* A^P lie Ala Val S«r Afc-n Gly Asp 

390 395 

Pro Asp His Phe Phe Val Gly Thr Trp Gly Asn Gly Leu Phe Glu Fh- 

405 410 ^25 

Lys Asp Gly Lys Ala He Ala Arg Tyr Ser Gly Asn Glu Thr Ala H- 

''^O 425 430 



Ala Glu Cys Asn Pro Gly Asp Ala Arg Val Lys Ala He Ala Phe Asp 

^35 445 

Asn Lys Gly Asn Leu Trp Gly Thr Leu Gly Ala Val Gly T.ys Asn He 
450 455 

65 les '^^^ ^^'^ ^'"^ '^^'^ '^'""^ ^^'^ '^^''^ 

Arp VAl Ala Asn Leu Ala Ser Phe Gly Asn Met Tie Tie Leu Pro A^n 



4fl5 



4 90 495 



Gly Asp Lys Trp Val Asn He Leu His Arg Ser Gly Gly Ser Thr Arg 

7n ^^'^ 505 510 

/U Lys Gly Val Leu He Phe Asn Asp Arg Gly Thr Pro Glu Thr Thr S-r 
515 520 525 

Asp A^p Ser His Leu Tyr Val Glu Gin Phe Val Asn Arg Leu Gly AU 

75 ^^'^ '^^'^ ^^'^ '-'^^ vll Asp His Asn Gly 

^'^^ 550 555 
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Ser Val Trp Met Gly Ser Asp He Gi^- He Phe Gly Vai Tyr Asn Ala 

Ala GJy Val Leu Se^ Ser Thr Ser Thr Pro He Ala Val Arg Pro Val 

585 

Gly t-ly GU. Glu Pro Asn Leu Tyr Tyr Val Leu A^p Lys v;i Thr Val 

600 

Thr Asp He Vai Val Asp Lys Leu Asr. His Lys Trp Val Ala Thr Gin 

615 520 

Gly Thr Gly Leu Tyr Leu Leu i>er Glu Asp Cys Ser Lys He Leu Ala 

635 cAn 

Gin Phe Thr Val Giu Asn Ser Pro Leu Leu Ser Asn Asn He Leu Scr 

Leu Ala Leu A^n Asp Asp A.sn Gly Leu Leu Tyr He Gly Thr Alf Asp 

660 665 

Gly Leu Met Thr Phe Gin Thr Gly Thr Gly Ser Gly Ser Ala Ser Glu 

^'^ 680 

Leu Asp Gly Val Tyr Va] Tyr Pro Asn Pro Leu Arg Pro Glu Tyr Pro 

695 700 

Asp Gly Vai Thr He Ala Gly Leu Gin Ala Gly Cys Ser V«l Lys He 

Thr Asp Thr Thr Gly Arg Leu Leu Tyr Gin Thr Glu Ser Val Thr Thr 

qiu Val Lys Trp Asn Ala Arg Gly Ala Asp Gly A.-n Arg Val Ala Ser 

140 T.1C. 



Gly Val Tyr Ala Val Ala Val Tyr Asp Pro Val Ser Lys Lys Ser Lys 

760 7f;5 
Leu He Arg Phe Ala Val He Arg 

(2) iriFOKHATlOH FOR SEQ TD HO: 523 

(1) sequence: CHARACTERISTICS: 

(A) LEtlGTH: 1150 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) H-xPOTHETlCAL; YES 

(vi) ORIGIMAL SOURCE: 

(A) ORGAT-ilSId: Porphyroinonas gingiva lis 

t ix ) FEATURE : 

(A) NAT IE/KEY: misc_feature 
(3) T/x:ATiai 1...1158 

(Xi) SEQUEHCE DESCRIPTION: SEQ ID NO:523 

net Lys Arg lie Leu Pro He Val Ala Phe Leu Ser Leu Phe Leu Ala 

Leu Ala Leu Pro Ala Lys Ala Gin Arg aL Met Gly Lys Thr Ma Asp 

25 30 

Arg ser Leu Met Ala Ser Gly His Trp Val Lys He Arg Val Asp Ala 

4 0 4 5 

Ser Gly Val Tyr Arg Leu Thr Asp Glu Gin Leu Arg Ala Asn Gl- Ph- 

t 60 

Ser Asp Pro Ser Lys Vai Gly Val Phe Gly Tyr Gly Gly Gly Val Leu 

Pro Glu Asp Leu Ser Arg Tie Thr Thr Asp lip Leu Pro Pro Val Fro 

^ 90 95 

Val Leu Arq Gin Gly Asn Ala Leu Tyr Ph^ Tyc Ala Val Gly Pro Val 

lOS iij 

Thr Trp Phe Tyr Asn Pro Ala Lys Thr Thr Net Glu His Thr Val Asn 

120 125 

Thr Tyr Ser Thr His Gly ^'r Tyr Ph, Leu Ser A.p Ala AU Gly Ala 

Pro L«u Gin net. Sor Gin Tyr Thr Gly Gly Gly aH Ser Ala Glu Ala 



Leu Il« Arp Tyr Tyr A^p Glu Leu Met Leu h" Glu' oln Glu L«„ Tyr 



16S 170 



Ser Pro Lys cUi Ser Gly Arg Asp Leu Tyr Gly Glu Ser Phe Ser Ala 
Val A.n Thr Arq Thr Val Ly= Phe Pro L«u Arg Gly Asn The Arg Ser 



PCT/AU9aAH023 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



Ser Gly Glu Leu Gly Thr V;j1 phe Ser Tyr 11- Ala Lys Ala Arg Sec 

210 215 220 

Ala Gly Gly Gly Arg Glu iiet Ser Leu Ser Ala Asn Gly He Leu He 
225 2^0 23b 240 

Phe Ser Asp Pro Phe Ser Hr^t Thr Ssr Asn Glu Val Sec Asn Ser Tyr 

245 2S0 2SS 

Leu Ala Gly Ly^ Lya Arg Arg Leu Tyr His Ser Thr Pro Met Asn Ser 

260 265 270 

Leu Vai Asn Glu Leu Arg Leu Asp Ala fiisn Tyr i*-r Hat Thr Gly Asp 

2 280 2B5 

Ala Val Asn Leu Asp the He Glu Val Ala Thr Gin Aen Aso LfiU Arg 

290 295 300 

Tyr Asp Gly Ala Pro Met His He Arg Arg Phe Ser Asn Leu Pro Val 
305 310 315 320 

Leu Gly Gly Glu Ser Cys Arg Phe Val He Ser Glu Val Pro Glu Ser 

325 330 335 

Lteu Val Val Leu Gin Ala Asn Ser Ser Leu Thr Ala Ser Leu Val Pro 

340 345 350 

Val Ly-? Thr Val Gly Asp Lys Thr He Glu Phe Val Ala Pro Pro Lyc 

355 360 365 

Gly Gin Ai»p Arg Arg Thr He Asn Thr Phe Tyr Ala Val Asp Leu Ser 

3V0 375 380 

Gin Ala Ser Ala Pro Glu He Leu Gly Ala Val Pro Asn Gin Asn Leu 
385 390 395 400 

His Gly Glu Glu He Pro Asp Leu He He Val Ser Thr Gin Ala Leu 

405 410 415 

Leu Leu Glu Aia Asp Arg Leu Ala Thr Tyr Arg Arg Glu Lys Asn Gly 

420 425 430 

Leu Lys Val Leu Val Val Leu Gin Glu Gin Val Fhe Asn Glu Phe Ser 

435 440 445 

Gly Gly Thr Pro Asp Ala Thr Ala Tyr Arg Leu Phe Ala Lys Met Phe 

450 455 450 

Tyr Asp Arg Trp Lys Ala Asn Ala Pro Val Gly Glu Thr Phe Pro HeU 
465 470 475 480 

Gin Met Leu Leu Phe Gly Asp Gly Ala Hie Aep Asn Arg Lys Val Ser 

405 490 495 

Val Ala Trp Gin T.ys Pro Tyr Leu Gin Gin Thr Glu Phe Leu Leu Thr 

500 505 510 

Phe Gin Ala Val Asn Ser Thr Asn Val Asn Ser Tyr Val Thr Asp Aj^p 

515 520 525 

Tyr Phe Gly Leu Leu Aep Asp Gin Pro Ala Ser Val Asn He Gly Trp 

530 535 540 

Arg Asn Tyr Asn liet Ala Val Gly Arg Phe Fro Vsl Arg Thr Pro Ala 
545 550 S55 560 

Glu Ala Arg He Ala Val Asp Lys Thr He Arg Tyr Glu Glu Asp Arg 

565 570 b/b 

Glu Ser Gly Ala Trp Arg Ho Arg Ala Cys Phe Ala Ala Asp Asn Gly 

580 585 590 

Asp Lys His Ala Thr Glu Thr Ser Arg Leu lie Asp Thr Val Lys Arg 

595 600 60S 

Tyr Ala Pro Ala He Met Pro Val Arg Ala Phe Gin Asp Val Tyr Pre 

610 615 620 

His Val He Glu Asn Gly Leu His Ser Ho Pro Gly Ala Lys Lys Lys 
625 630 635 640 

Met Leu Glu Thr Leu Gin Ser Gly He He Leu Leu Asn Tyr Ala Gly 

645 650 655 

His Gly Gly Pro Ala Gly Trp Ser Asp Glu His Lou Leu Thr Leu Asn 

660 665 670 

Asp He Hiii Lys Phe Asn Tyr Lys His Met Pro 11- Trp He Thr Ala 

675 680 685 

Thr O/s As-p Phe Ala Asn Tyr Asp Ser Gin* Thr Thr Ser Ala Gly Glu 

690 695 700 

Glu Val Phe Leu His Glu Lys Ser Gly 'L'hr Pro lie Met Phe Ser Thr 
705 710 715 

Thr Arg Val Val Tyr Asn Thr Gin Asn Glu T.ys He Asn Gly Phe Met 

725 730 735 

Leu Arg Arg Hot Fhe Glu Lys Ala Lys Asp Gly Aro Tyr Ary T)ir Met 

740 745 ' 7b0 

Gly Glu He He Arg Ser Ala Lys Gin Gly llet L^u Ser Thr Val Phe 

•755 760 765 

Pro Asp Ser Tie Asn Gin Leu Ser Phe Phe Leu (let Gly Asp Pro Sor 

770 775 780 

Val Arg Met Asn Leu Pro Thr His Lys Val Gin Leu Thr Ala He Asn 
705 790 795 800 

Gly Gin Asp Pro Glu Gly Gin Tyr Gly Thr He Met Leu Lys Ser Leu 
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B05 810 015 

Glu Arg Val Ala Leu Lys Gly Lys Val Thr Asp Glu Lys Gly Thr Ph- 

820 825 830 

Asp Glu Thr Phe Ser Gly Ly.^ Val Phe Leu Thr Val Phe Asp Gly Arg 

835 840 845 

Lys Lys Met Thr Ala Leu Glu Glu Glu Gly Asn Arp Leu Ser l.^^u Val 

850 855 860 

Tyr Tyr A,Tp Tyr Pro Asn Val Met Tyr Ala Gly lie Ala Glu Val Lys 
870 875 BOO 

Asp oly Leu Phe Glu Thr Ser Phe He Val Pro Lys Asp Val A*?n Tyr 

885 890 895 

Ser Glu His Glu Gly Arg He Asn Leu Tyr Ala Tyr Asn Glu Ser Thr 

900 90S 
Lys Ala Glu Ala Met Gly Val Asp Phe Ser He Arg Val Gin Pro Glv 

915 920 925 

He Pro Asp Glu Val Thr Glu Asp Asn Thr Pro Pro Glu He He S-r 

930 935 
Cys Phe Leu Asn Asp Ser Thr Phe Arg Ser Gly Asp Gl»j Val Asn Pro 

950 955 
Thr Pro Leu Phe Met Ala Glu Val Phe Asp Leu Asn Gly He Asn 11 = 

965 970 975 

Thr Gly Scr Gly Val Gly His Asp He Thr Leu Cys He Asp Gly Arg 

?90 985 
Ala Asp Leu Thr Tyr Asn Leu Asn Ala Tyr Phe Thr Ser Ser Ala Thr 

1000 1005 

^""^ ^"if.^^"'* ^""^ ^^"^ ^^'^ "'^ Ala Leu Ala Giu 

1010 1015 10^0 

Gly Asp ni3 Thr Ala Arg Leu Thr Val Trp A^p He Phe Acn Asn Ala 
1025 1030 1035 1040 

Val Kis His Asp Phe Scr Phe Arg Val Val Asp Gly He Ala Pro Asp 

1045 1050 1055 

Val Ala Asp Val He Leu Phe Pro Asn Pro Val Arg Glu Ser Ala Thr 

1060 1065 1070 

Phe Arg He Phe His Asn Arg Pro Gly Ser A;?p Leu Asn Val Ala Val 

1075 1080 1085 

Glu He Tyr Asp Phe Thr Gly Arg Leu Val Asn Ser Leu Pro Val Lys 

10?0 1095 1100 

Thr Tyr Ser Ser Sor Tyr Gly Glu Pro He Glu He Lys Trp Asp L-u 
tz \. 1115 1120 

Thr Ser Lys Tyr Gly Val Lys He Gly Asn Gly Phe Tyr Leu Tyr Arg 

1125 1130 1135 

Cys Val Val Asn Sor Pro Gly Gly Gin Thr Ala Ser Met Ala Lys Lys 

1110 1145 iisS 

Met He Val Val Gly Gin 
1155 

(2) IMFORilATIOH FOR SEQ ID NO: 52 J 

<n SEQUEI^CE CHARACTERISTIC'S: 

(A) LENGTH: 961 amino acids 

(B) TYPK: oral no acid 
(Dl TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(vi) CRIGIHAL SOURCE: 

(AJ ORGANISM: Pcrphyromonas gingivaiis 

iix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION 1. . . 961 

(ill) SEgUENCE DESCRIPTION: SEQ ID MO: 524 

Met Lys' Lys Leu Phe Pro Leu L«ti Leu Leu He Leu Ser He Leu Val 

, 5 10 IS 

Gly cys Gly Lys Lys Glu Lys His Ser Val Thr Giu He Ala Arg Gin 

20 25 30 

Lyi. Lys Arg Tl- Thr Ala Leu Leu Tyr Glu Lys Giu Leu Pro Thr Asp 

35 40 45 

Ser Val Lys Gin Leu Tyr Glu Asn Ser Val Gin Asn Lys Asn Leu Val 

55 60 

Gly Gin ^:et Leu Phe Ala He Glu Vsl Gly Lys Arg Met Arg Asn Met 
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65 70 75 80 

Ser Gin Tyr Thr Asp AIq ll-t r,eu Tyr His Gin hys Gly Leu Asn Aia 

85 90 OS 

Ala Leu Asn Leu Arg Asp Thr He Val Ala Ala Gin Ala Trp Asn Hi*' 

105 110 
Leu Gly Thr Asp Ser Arg Arg Tie Gly Ala Leu Ala Glu Ala Ser Asp 

115 120 125 

Tyr His Tyr Lys Ala Leu S*r Leu He Glu Ser Phe Ser Gly Asn Gin 

135 140 
Asn Arg Pro Ala He Lys Ala Arg Ser Ala Ala Leu Asn Glv 11- Gl- 
ISO 155 ■ ig'o 

Asn He Asn Leu Glu Leu Gly Tyr His Asp Glu Ala Glu Lys Asn Ph- 

16S 170 175 

Leu Lys Ala Leu Gin Gly Glu Lys Glu Leu Asp Ser Pro Leu Gly Gin 

IflO 105 190 

Ala He Asn Tyr Ala Asn Leu Gly Arg He Tyr Arg Gin Arg Lys Glu 

195 • 200 205 

Tyr Asp Lys Ala Arg Thr Tyr Phe Leu Leu Ser Leu Glu Gin Asn Asn 

210 215 220 

Met Ala Glu Asn Leu Met Gly He Gly Leu Cys Ser He Asn Leu Glv 
22S 230 235 ?40 

Glu Val Asp Glu Glu Lys Gly Asp Tyr Gin Lys Ala Leu Gin Glu Tyr 

245 250 255 

Ala Thr Ala Tyr Lys Leu Met Glu Gin Leu Sir Aco Arg Trp His Trn 

S^"^ 265 ' 270 

Leu Asn Ser Cys He Pro Met Ala Arg He Asn Leu Lys Gin Glv Asn 

275 280 285 

Glu Arg Leu Tyr Gin His Phe He Ser Leu Ala Glu Gly Thr Ala Lv- 

290 295 300 

Glu He Asn Ser Thr Ser His Leu He Glu He Tyr Asn Leu Gin Tyr 
305 310 315 

Glu Asn Leu Glu Arg Lys Lys Glu Tyr Lys Gin Ala Lou Glu Ala Ph« 

325 330 
Cys Leu Ser Lys Thr Leu Ser Asp Ser Met Ser He Ala His Lys Val 

340 345 
Ser S-r He Gin Glu Thr Arg Phe Asn Tyr Glu Arg Asn Lys S*r Gin 

355 360 365 

Lys Glu Leu Glu Glu He Gin Gin Val Ser Lys Ala Lys Gin Glu Lys 

370 375 380 

Ser Lys Phe He Leu Leu Scr Thr Leu Fhe Ala L»u Ph» H- Ssr II- 
390 395 " " " 4OO 

Leu Leu He Ser Val Leu Thr Tyr Ala Tyr Arg Gin Gly Lys Lys His 

405 410 415 

Asn Lys Leu He Lys Glu Thr Asp Lys Leu Arg Ser Gly Phe Phe Thr 

420 425 430 

Gly He Thr His Glu Phe Arg Thr Pro He Thr Val He Gin Gly Leu 

435 440 445 

Asn Glu Lys tJet Ser Ser Ser Pro Asp Leu Gin Ala Ser Asp Ara Thr 

450 455 460 

Glu Leu His Lys He He Asp Arg Gin Ser Sor His Met Leu Asn Leu 
470 475 480 

Val Asn Gin Leu Leu Asp He Cys Lys He Arg Ser Gly Val Ser Thr 

485 490 495 

Pro Glu Trp Arg Asn Gly Asp He Val Ser Phe Val cin He L-u II- 

SOO 505 510 

Asp Ser Phe Ala Pro Tyr Ala Gin Ala Gin Asp He Thr Leu Glu Leu 

515 520 525 

Gin Pro Glu Ser Lys Pro He Vil Val Asp Phe Val Pro S-r Tvr L*u 

530 535 540 

Gin Lys Tie He Ser Asn Leu Leu Ser Asn Ala He Lys Tyr Ser Leu 
-^^^ 550 555 3^^,, 

Ala Gly Gly Arg Val Val H« Ser Leu Ala Lys Thr Lys Asn Glu Lys 

565 570 575 

Asa Leu He He Arg Val Ala Asp Asn Gly He Gly He Asp Lys Thr 

580 5B5 590 

Asp Gin Ala His He Phe Asp He Phe Tyr Arg Gly Gin Ser Ala Thr 

5?5 600 
Glu Lys HK^ Gly Ser Gly Val Gly Leu Ser Phe Thr Asn He Leu Val 

610 61S 620 

Glu Asn Leu Arg Gly Thr He Lys Val Glu Ser Gin Pro Gly Lys Gly 
t^-^ , 630 635 640 

S-r Ala Phe Thr He Ser He Pro Thr Gin Asn Gin Ser Ser Ser Ala 

6-'5 S50 655 

Glu He L^=u Pro Trp Leu Fro Ser Ser Asp Asp He Val Met Pro Val 
650 665 670 
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His He Ala Pro A^p Asp Ser Pro Thr Ser Pro Met V^l Ala Ala Leu 

680 gg5 
A.n Mis Arq Ph* Glu Asp Glu Arg Pro Thr He L'M Leu Val elu A3p 
c ^-^ 700 

5 Asn Lys- A«p He A^n Leu Leu Val Lys Leu Leu Leu Cy. A.p Ar, Tyr 

A»n Val Leu Ser Ala Ala As,, Gly Lys Glu Gl'y He Ala Leu Ala ^hr 

^25 7T(i 
Glu Hi. He Pro A-^p He He He Thr A,p H, H,t Met Pro He Met 

Asp Gly He Glu Met Thr He Arg Met I.y., Gin Ser Pre "u Leu Cy== 

760 

H.s He Pro He Val Ala Leu Thr AU Ly. Ser Thr Gl^ Gin A=p Arg 
15 Leu Glu Gly He Ly« Ser Gly Val Val Ser Tyr Leu Cy. Lys Pro Phe 

795 800 



40 



50 



60 



65 



Scr Pro Glu r.lu Leu Leu H.t Arg He Glu gIh Leu Leu Ly. A.p Arg 

805 .-t ^ 



20 820 ''''' "''^ ""^'^ *--P 



B2S 



Pro Glu Clu Glu Pro Gin Pro He A.-p A.-p Ser Ser Met Gin Phe Leu 

840 845 
Leu Ala Ala Lys A.p Ala Val Ser Gly Gly He Lys Gin Asn Pro Asp 
nc 860 

25 Phe see Ala Gin Asp Leu Ala Glu Lyc Met Cys Hct Scr Pro Ser Gin 

Leu Asn Arg Lys Leu Thr Ser Val v,i Gly ?ys Ser Thr He Gly ?yr 

890 no^. 

30 III '•^^ ^'-^ '^"« "P Glu 

^ ^ 905 qi n 

ser Lys Asn He Ser Asp He Ser He Glu Ala Gly Phe Ser Asp Pro 

920 925 

Ala Tyr Phe Ser Arg Thr Phe Lys Arg Tyr llet Asn Cy.- Ser Pro Ser 

oc 935 9«1 0 

J5 Gin Tyr Arg Gin Lys Leu Leu Ala Het Pro Gly Ser Asp Lys Glu Thr 



Val ^^"^ 960 



(2) rtlFORilATION fOR SEQ ID HO: 525 



(i) SEQUENCK CHARACTERISTICS: 

(A) LENGTH: 556 amino acids 

(B) TYPE: amino acid 
10) TOPOLOGY: linear 



(ill IfOLECULE TYPE: protein 

(iii) HYPOTHETICAL: YES 

(Vi) ORIGINAL SOURCE: 

(A» ORGATJISM: Porphyromonas gingivalis 



_ (Ix) FEATURE: 

(A) MAHE/KEY: misc featur- 
es) LOCATION 1. . .556 

(xi) SEQUENCE UESCRI PTlOt^ : SEQ ID NO: 525 

Met Ly^ Lys Leu Leu Gin Ala Ly« Ala Leu He Leu Ala Leu Gly Leu 

Phe Gin Len Pro Ala He Ala Gin Thr Gin M.t Gin Ma A..p ^rg Thr 

Asn Gly Gin ^he Ala Thr Clu Glu Net Gin Arg Ala Phe gL Glu Thr 

A.n Pro Pro Ala Gly Pro Val Arg Ala He Ala Glu ^yr Glu Arg Ser 

Ala Ala V.l Leu Val Arg Tyr Pro Phe Gly He Pro Het Glu Leu He 

70 Ly. Glu r..u Ala Ly. A.n Asp Lys Val Ho Thr He Val Ala Ser Glu 

ser Gin Lys A^n Thr Val He Thr Gin Tyr Thr Gin Ser Gly vfl A^n 

105 iin 

75 ^" u's '^^P Ly. Thr Arp Ser Tyr Trp Thr 

120 125 



SUBSnrUTE sheet (Rule 26) (RO/AU) 



-iP^ii tp^i ir^ tr=ii rt« ""'^b ti=-ji "'"ii ip^nt if^i -ir^h 

U Z3 O «!L U^Cii CHi Dj -iUi O CTJ -Q rui U 



15 



25 



55 



60 




WO 99/29870 PCT/AU98/01023 

483/490 

Arg Asp Tyr The Gly Trp Ph« Ala Met Tyr Asp Thr Asn Lys Val Glv 

130 135 140 

Leu Val Asp Phe He Tyr Asn Arg Pro Arq Pro Asn Asp Asp Glu Ph» 
_ 145 ISO 155 160 

5 Pro Zys Tyr Glu Ala Gin Tyr Leu Gly He Glu MeL Phe Gly Met Lys 
165 170 175 

I.eu Ly^ GJn Thr Gly Gly Asn Tyr Met Thr A^'p Gly Tyr Gly Ser Ala 
180 185 190 

^_ Val Gin Ser His He Ala Tyr Thr Glu Asn Ser Ser Leu Ser Gin Ala 
10 195 200 205 

Gin Val A5rn Gin Lys Met Lys Asp Tyr Leu Gly He Thr His His Asp 

210 215 220 

Val Val Gin Asp Pro Asn Gly Glu Tyr He Asn His Val Asp Cys Trp 
225 230 235 240 

Gly LV3 Tyr Leu Ala Pro Asn Lys He Leu He Arg Lys Val Pro Asp 

245 250 255 

Asn His Pro Gin His Gin Ala Leu Glu Asp Net Ala Ala Tyr Phe Ala 

260 265 210 

Ala Gin Thr Cys Ala Trp Gly Thr Lys Tyr Giu Val Tyr Aro Ala Leu 
20 275 280 285 

Ala Thr Asn Glu Gin Pro Tyr Thr Acn Ser Leu He Leu Asn Asn Arn 

290 295 300 

Val Phe Val Pro Val Asn Gly Pro Ala Ser Val Asp Asn Asp Ala Leu 
•■^05 310 315 320 

Asn Val Tyr Lys Thr Ala Met Pro Gly Tyr Glu He Tie Gly Val Ly-^ 

325 330 335 

Gly Ala Ser Gly Thr Pro Trp Leu Gly Thr Asp Ala Leu His Cys Arg 

340 346 35C 

Thr His Glu Val Ala Asp Lys Gly Tyr Leu Tyr He Lye His Tyr Pro 
30 355 360 365 

He Leu Gly Glu Gin Ala Gly Pro Asp Tyr Lys He Glu Ala Asp Val 

370 375 380 

Val Ser Cys Ala Asa Ala Thr He Ser Pro Val Gin Cys Tyr Tyr Arq 
305 390 395 4OO 

JO He Asn Gly Ser Gly Ser Phe Lys Ala Ala Asp Met Thr Met Giu Ser 
405 410 415 

Thr Gly His Tyr Thr Tyr Ser Phe Thr Gly Leu Asn Lys Asn Asp Lys 

420 425 430 

Val Glu Tyr Tyr He Ser Ala Ala Asp Asn Ser Gly Arg Lys Glu Thr 
40 435 440 445 

Tyr Pro Phe He Gly Glu Pro Asp Pro Phe Lys Phe Thr Cys Met Asn 

450 455 460 

Glu Thr Asn Thr Cys Thr Val Thr Gly Ala Ala Lys Ala Leu Arg Ala 
465 470 475 480 

45 Trp Phe Asn Ala Gly Arg Ser Glu Leu Ala Val Ser Val Ser Leu Asn 
485 490 495 

He Ala Gly Thr Tyr Arg He Lys Leu Tyr Asn Thr Ala Gly Glu Glu 

500 505 510 

Val Ala Ala Met Thr Lys Glu Leu Val Ala Gly Thr Ser Val Phe S-r 
5U 515 520 525 

Met Asp Val Tyr Ser Gin Ala Pro Gly Thr Tyr Val Leu Val Val Glu 

530 535 540 

Gly Asn Gly He Arg Glu Thr Met Lys He Leu Lys 
545 550 £55 



(2) II I FORMATION fc*OR SHQ ID NO: 52 6 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 28 amino acids 

(B) TTPE: amino acid 
(0) TOPOIXXiy: linear 



(iij MOLECULE TYPE: protein 

65 till) HYPOTHETICAL: YES 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Porphyromonas gingi val is 

70 <ix) FEATURE: 

(A) MAI tE/ KEY: mlsc_feature 

(B) LOCATION 1. . . 428 



75 



{y,i} SEQUENCE DESCRIPTION: SEQ ID MO: 526 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



^^^^^ y 
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Lys 








Lys 


Lys 


lie 


Leu 




i.le 


lie 


Ala 


Leu 




Thr 


1 


















1^0* 










1 5 




t tet 


Gly 


■ 

11 LS 


Ala 


Va 1 


tain 


Ala 


Gin 


Phe 


Va 1 


Pro 


Ala 


Pro 


Thr 


Thr 


Ql> 








2n 










25 










30 






11- 


Arg 


H"t 


S*i£ 


vai 


Thr 


Thr 


The 


Lys 


Al a 






Glu 


Lys 


II** 








'* 5 










40 


















Lsu 


Leu 


Val 


His 


Ser 


lie 


Glu 


Lys 


Lys 


Gly 


lie 


Trp 


He 


Asp 


Leu 


Ain 




50 










55 










60 










Gly 


A£p 


Ala 


Thr 


Tyr 


Gin 


Gla Gly 


Glu 


Glu 


He 


Thr 


Val 


Phe 


Asp 


Glu 


65 










7 0 










75 










80 


Ala 


Tyr 


His 


Glu 


Tyr 


The 


He 


Gly Thr 


Gin 


Thr 


Leu 


The 


lie 


Tyr 


Gly 










85 










90 










95 






Thr 


Thr 


Arg 


Lsu 


Gly 


Cys 


Arg 


Ser 


Thr 


Gly 


Al a 


Thr 


Ala 


Val 


Asp 








100 










105 










110 






Vai 


Th r 


Lys 


Asn 


Pro 


Asn 


Leu 


Thr 


Tyr 


Leu 


Ala 


Cys 


Pro 


Lys 


Asn 


Asn 






lis 










120 










125 








Leu 


Lye 


Ssr 


Leu 


Acp 


Leu 


Thr 


Gin 


Asn 


Pro 


Ly5 


Leu 


Leu 


Arg 


Val 


Trp 




130 










135 










140 










Cys 


Arp 


Ser 


Asn 


Glu 


lie 


Glu 


Ser 


Leu 


Asp 


Leu 


Ser 


Gly 


Asn 


Pro 


Ala 


145 










150 










155 










160 


Leu 


lie 


Il« 


Leu 


Gly 


Cys 


Asp 


Arg 


Asn 


Lys 


Leu 


Thr 


Glu 


Leu 


Lys 


Thr 










165 










170 










175 




Asp 


Asn 


Asn 


Pro 


Lys 


Leu 


Ala 


Ser 


Leu 


Trp 


cys 


Ser 


Asp 


Asn 


Asn 


Leu 








180 










185 










190 






Thr 


Glu 


Leu 


Gl U 


Loll 


Ser 


Ala 


Asn 


Pro 


Arq 


Leu 


Asn 


Asp 


Leu 


Trp 


C-jS 






1 95 










7.00 










205 








Phe 


Gly 


Asn 


Arg 


lie 


Thr 


Lys 


Leu 


Asp 


Leu 


Ser 


AlD 


Asn 


Pro 


Leu 


Leu 




210 










215 










220 










Val 


Thr 


Leu 


Trp 


Cyj? 


Ser 


Asp Asn 


Glu 


Leu 


Ser 


Thr 


Leu 


Asp 


Leu 


Ser 


22b 










230 




















240 


Lys 


Asn 


Ser 


Asp 


Val 


Ala 


Tyr 


Leu 


Trp 


Cys 


Ser 


Ser 


Asn 


Lyr 


Leu 


Thr 










24 5 










250 










255 




Ser 


Leu 


A£^n 


Leu 


Ser 


Gly 


\Al 


Lys 


Gly Leu 


Ser 


Val 


Leu 


Val 


Cys 


His 








260 










265 










270 






Ser 


Asn 


Gin 


lie 


Ala 


Gly 


Glu 


Glu 


Met 


Thr 


Lys 


Val 


Val 


Asn 


Ala 


Leu 






275 










280 










285 








Pro 


Thr 


Leu 


Ssr 


Pro 


Gly Ala 


Gly Ala 


Gin 


Ser 


Lys 


Phe 


Val 


Val 


Val 




290 










295 










300 










Asp 


Leu 


Lys 


Asp 


Thr 


Asp Glu 


Lys 


Am 


Il<= 


c-'c 


Thr 


Val 


Lys 


Asp 


Val 


305 










310 










315 










320 


Glu 


hys 


Ala 


Lys 


Ser 


Lys 


Asn 


Trp Arg Val 


Phe 


A^p 


Phe 


Asn 


Gly 


Asp 










325 










330 










335 




iJer 


Asp 


Asn 


net 


Leu 


Pro 


Tyr 


Glu 


Gly Ser 


Pro 


Thr 


Ser 


Asn 


Leu 


Ala 








34 0 










345 










350 






Val 


AFP 


Ala 


Pro 


Thr 


Val 


Arg 


lie 


Tyr 


Pro 


Asn 


Pro 


Val 


Gly 


Arg 


Tyr 






3S5 










360 










3S5 








Ala 


Leu 


Val 


Glu 


lie 


Pro 


Glu 


Ser 


Leu 


Leu Gly 


Gin 


Glu 


Ala 


Ala 


Leu 




310 










37 5 










380 










Tyr 


Asp 


I let 


Asn 


Gly 


Vai 


Lys 


Val 


Tyr 


Ser 


Fhe 


Ala 


Val 


Glu 


Ser 


Leu 


385 










390 










3?5 










400 


Arg 


Gin 


Asn 


lie 


Asp 


Leu 


Thr 


His 


LeM 


Pro 


Asp 


•31 y 


Thr 


Tyr 


Phe 


Phe 










405 










410 










415 




Arg 




Asp 


Asn 


Tyr Thr 


Thr 


Lys 


Leu 


lie 


Lys 


Gin 











420 425 



(21 INFOWtATlW FOR SEQ ID HO; 527 

(1) SEQUFUCE CHARACTERISTICS: 

(A) LEU/GTH: 310 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGy: linear 

(li) MOLECULE TYP3: protein 

{ill) HYPOTHETICAL: YES 

(vi) ORIGIIIAL SOURCR: 

(A) ORGAIIISI-I: Porphyromonas gingiva lis 

(IX) FEATURE: 

(A) MAI IE/KEY: misc feature 

(B) LOCATTOri 1. . .310 

(y.l) SEQUENCE DESCRI PTIO(I : SEQ ID NO: 527 



SUBSTITUTE SHEET (Rule 26) (RO/AU) 



D iCm ja^«i> m ti. J? o cj^ ^j^ 
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70 



75 
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Met 


Arg 




Thi 


He 


He 


Phe 


Cys 


Leu 


Leu 


Leu 


ALa 


Leu 


Phe 


Gly Cys 


1 








5 










10 










IS 


Ser 


Trp 


Ala 


Gin 


Glu 


Arq 


Val 


Asp 


Glu 


Lys 


Val 


Phe 


Ser 


Ala 


Gly Thr 








20 










25 










30 




Sftr 


He 


Phe 


Arg 


Gly 


He 


L'ju 


Glu 


Lyf 


val 


Lys 


Ala 


Pro 


Leu 


Met Tyr 






.15 










40 










45 






Gly Asp 


Arg 


Glu 


Val 


Trp 


Gly 


Met 


Ala 


Arg 


Ala 


Ser 


Glu 


Asp 


Phe Phe 




50 










55 










60 








Phe 


He 


Leu 


Pro 


Val 


Thr 


Arp 


Aap 


Leu 


Thr 


Pro 


Val 


Leu 


Phe 


Tyr Asn 


65 










70 










75 








eo 


Arg 


Lfiu 


Thr 


Asn 


Glu 


Pro 


Cys 


Phe 


Val 


Ser 


Asp 


Gin 


Gly 


He 


Thr Glu 










B5 










90 










95 


Tyr 


Phe 


Lys 


Phe 


Ala 


Gin 


Glu 


Cly 


Asp 


Tyr 


He 


Glu 


Val 


Glu 


Gly Ser 








100 










105 










110 




Ser 


Val 


Phe 


Met 


Ala 


Asn 


Leu 


Leu 


Tyr 


Tyr 


Arg 


Phe 


Phe 


Pro 


Thr Arg 






13 S 










120 










125 






Il-s 


ThL 


Ser 


Tyr 


Asn 


Al<i 


Pro 


He 


Glu 


Giy Val 


Val 


Ser 


Lys 


Thr Gly 




130 










135 










140 








Asn 


Pro 


Ala 


Phe 


Thr 


He 


Pro 


Met. 


Leu 


Pro 


Gly Val 


Ser 


A^p 


Cys He 


145 










150 










155 








160 


Glu 


He 


Ser 


A^n 


Aen 


Arg 


Lys 


Val 


Phe 


Leu 


Thr 


Asn 


Gin 


L-u 


Gly Val 










16b 










1.70 










175 


Val 


Aeii 


Ilo 


Thr 


Asp 


Gly 


Met 


Glu 


Pro 


Pro 


He 


Ho 


Ala 


Gly 


Val Ser 








180 










105 










190 




Ala 


Uer 


Tyr 


Gly 


Ser 


Ser 


Val 


Arg 


Val 


Tyr 


Gly 


His 


Val 


Ser 


Gin Arg 






195 










200 










205 






Trp Asp 


He 


Tl ft 


Gly 


His 


cys 


Tyr 


Leu 


Aep 


He 


Tyr 


Pre 


Thr 


Asn Cys 




210 










215 










220 








Tyr 


Pro 


Leu 


Ser 


Thr 


Lys 


Pro 


Val 


Ala 


Gly Asp 


Asp 


Glu 


Val 


Phe Val 


225 










230 










235 








24 0 


Lys 


Gin 


Gin 


Gly 


Arg 


Gin 


He 


Glu 


He 


Asp 


Ser 


Asn 


Ser 


Pro 


He Val 










245 










250 










255 


Gin 


Val 


Val 


Val 


Tyr 


Asp 


Leu 


Glu 


Gly 


Lys 


Ser 


Val 


Phe 


Arg 


Lys Arg 


















265 










2 to 




f4et 


Thr 


Glu 


Asn 


Ala 


Tyr 


Thr 


Leu 


Scr 


Phs 


Arg 


Ala 


Pro 


Met 


Leu Gly 






275 










280 










285 






Phe 


net 


Thr 


He 


Met 


He 


Glu 


Thr 


Gin 


Asn 


Ser 


He 


He 


Asn 


Lys Lys 




2 90 










295 










300 








L«;u 


Asn 


Val 


Thr 


CJln 


Leu 




















305 










310 




















{2) 


IHkOWIATIOH 




SEQ 


ID MO: 526 















10 



15 



20 



25 



30 



35 



40 



(1) SEQUEUCE CHARACTERISTICS: 
45 (A) LEIIGTH: 4 05 amino acids 

(B) XypE; amino acid 
(D) TOPOLOCy: linear 



HI) MOLECULE TIt'PE: prorein 

Uii) HYPOTHETICAL: YES 

tvi) ORIGIHAL SOURCE: 

tA) ORGANISI-I: Porphyromonas gingival is 

(Ix) FEATURE: 

(A> HAllE/KEY: misc feature 
(0) LOCATIOi^ 1. . .405 





(xi) 


SEQUENCE DESCRIPTICH: SEQ : 


ID NO: 520 










Met 


Lys 


Lys 


Thr 


Thr 


He 


Ile 


Ser 


Leu 


He 


Val 


Phe 


Gly Ala 


Ph- 


Phe 


1 








5 










10 










is 




Ala 


Ala 


Val 


Gly 


Gin 


Thr 


Lys 


Asp 


Asn 


Ser 


Ser 


Tyr 


Lys 


Pro 


Phe 


Ser 








20 










25 










30 






Lys 


Glu 


Asp 


He 


Ala 


Gly 


Cly 


Val 


Tyr 


Ser 


Leu 


Pro 


Thr 


Gin 


Asn 


Arg 






35 










40 










AS 








Ala 


Gin 


Lys 


Asp 


Asn 


Ala 


Glu 


Trp 


Leu 


Leu 


Thr 


Ala 


•i'hr 


Vai 


Ser 


Thr 




50 










55 










so._ 










Asn 


Gin 


Ssc 


Ma 


Asp 


The 


His 


Phe 


He 


Phe 


Asp 


Glu 


Asn 


Asn 


Arg 


Tyr 


6S 










70 










75 










80 


He 


Ala 


Arg 


Asp 


He 


Lys 


Ala 


Asn 


Gly 


Val 


Arg 


Lys 


Ser 


Thr 


Asp 


Ser 










85 










90 










95 




He 


Tyr 


Tyr 


Asp 


Ala 


Asn 


Gly 


Arq 


He 


Ser 


His 


Val 


Asp 


Leu 


Tyc 


He 








100 










105 










HO 







SUBSTITUTE SHEET (Rule 26) (RO/AU) 



tt«it ^ nii cti' ^s. ^ 'pi^'- tt^i ni 



60 



wo 99/29870 PCT/AU98/01023 
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Ser 


Phc 


Sor 


Gly 


GJy 


Gl u 


Pro 


Ala 


Leu 


Asp 


Thr 


Arg 


Phe 


Lys 


Tyr 


Thr 






115 










120 










125 








Tyr 


Arp 


Arp 


Glu 


Gly Lys 


Met 


Thr 


Val 


Arg 


Glu 


Val 


Phe 


Met 


r.cu 


Vfil 




.130 










135 










140 










Met 


Asp 


Pro 


Asn 


Thr 


Pro 


He 


Ser 


Arq 


Leu 


Glu 


Tyr 


His 


Tyr 


Asp 


Ala 


145 










150 










155 










160 


Gin Gly 


Arg 


Leu 


Thr 


His 


Trp 


He 


Ser 


Phe 


Ala 


Phe 


Gly Ala 


Glu 


Ser 










165 










170 










175 




Gln 


Lys 


Acn 


Thr 


Tvr 


His 


Tyr 


Asn 


Glu 


Lys 


Gly 


Leu 


Leu 


Vai 


Ser 


Glu 








180 










185 










190 






Vai 


Leu 


Ser 


Asn 


Ala 


Net 


Gly 


Thr 


Thr 


Tyr 


Ser 


Asp 


Thr 


Gly 


Lys 


Thr 






195 










200 










205 








Glu 


Tyr 


Ser 


Tyr 


ASD 


A«D 


Ala 


Asp 


Asn 


Hct 


Val 


L^S 


Ala 


Glu 


Tyr 


Phe 




210 










215 










220 










Val 


Val 


Gin 


Gin 


Gly 


Lys 


Ala 


Trp 


Gin 


VaI 


Leu 


Lys 


Arg 


Glu 


Glu 


Tyr 


225 










230 










235 










24Q 


Thr Tvr 


Glvi 


Asp 


Asn 


lie 


Cys 


He 


Gin 


Tyr 


Leu 


Ala 


He 


Asn 


Gly 


Thr 








2-15 










250 










255 




Afp 


Thr 


Lys 


Val 


Tyr 


Lys 


Arg 


Asp 


He 


Glu 


Ser 


Asp 


Lys 


Ser 


He 


Ser 








260 










265 










270 






Ala 


Asn 


Val 


Tle 


Asp 


He 


Pro 


Ser 


Met 


Pro 


Glu 


Gin 


Thr 


Trp 


Pro 


Asn 






275 










2B0 










285 








Met 


Tyr 


Gly 


Phe 


Asn 


Ala 


Lys 


Arg 


Leu 


Lys 


Glu 


Thr 


Tyr 


Ser 


Ser 


Tyr 




290 










2 95 










300 










Glu 


Gly 


Arp 


Va5 


Al h 


Thr 


Pro 


He 


Phe 


Asp 


Tyr 


lis 


Tyr 


Thr 


Tyr 


Lys 












310 










315 










320 


Ala 


Leu 


•i'hr 


Ser 


Met 


Ala 


Thr 


Pro 


Ser 


Thr 


Glu 


Ala 


Gin 


Val 


Ala 


Val 










325 










330 










335 




Tyr 


Leu 


Asn 


Pro 


Ser 


Thr 


Asp 


Arg 


Leu 


Val 


He 


Leu 


Ala 


Asn 


Gly 


He 








340 










345 










350 






Thr 


His 


Leu 


Ser 


Het 


Tyr 


Asp Leu 


G\n 


Gly 


Lys 


Leu 


He 


Arg 


Asp 


Cys 






355 










360 










365 








Ala 


T.eu 


Ser 


Gly 


Asp 


L-ys 


Val 


Glu 


Met 


Gly Val 


Gly 


Ser 


Leu 


Thr 


Lys 




370 










375 










380 










Giy Thr 


Tyr 


Leu 


Leu 


Lys 


Val 


Asn 


Thr 


Asp 


Gin 


Gly Ala 


Phe 


Val 


Arg 


385 










390 










395 










400 


Lys 


Val 


Val 


He 


Arg 
































405 


























INFOKHATIOH 


FOR 


SEQ 


ID UO:529 
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( \ 1 SEQUCUCE CHARACTERiSl'ICS : 

(A) LEHGTH: 2037 base pairs 
(Bl TVPE: nucleic acid 
45 IC) STRA/JDEUNESS; double 

(D) TOPOLOGY: circular 

(ii) HOLECL'LS TYPE: DHA (genomic) 

50 (it±) HYPOTHETrCAL: MO 

(ivi AMTI-SEHSE: MO 

(vi) ORIGINAL SOURCE: 
55 (A) ORGANISM: Porphyromonas gingival! s 

(ix) t*EATURE: 

(A) ^JAI■^E/^XY: misc_feature 

(B) LOCATIOK 1. . .2037 



(xi) SEQl'EMCE DESCBIPTiai: SF.Q ID MO: 529 



GTAATGGACT ACAAACTCAC TTCTCGAl"rC AAGCCCAC'30 GCGACCAGCC GGAAGCCATT 60 

CGCCAACTCG TACAGGGCAT CAACGAAGGG ATGCCG'3CTC AGACGCTGCT CGGCGTAACG 120 

65 GGl'TCGGGCA AAACTTTTAC GGTGGCTAAC GTCGTGCCGG CGGTCAATCG TCCGACCCTl' 180 

GTCCTGAGTC ACAACAAGAC CTTGGCAGCA CAGCTATACG GAGAGTTCAA AGCCTTCTTC 24 0 

CCCGAGAATG CCGTGGAGTA TTTCGl'CAGC TACTACGACT ACTATCAGCC CGAGGCCTAC 300 

CTCCCCGTCA CAGACACCTA TATCGAAAAG 6ACATGGCCA TCAACGCGGA GATCGAAAAA 360 

CTGCGATTGA GGGCCACGGC TTCGCTCCTG TCAGGGCGGA AAGATGTGCT TGTGGTCAGC 4 20 

70 TCCGTATCCT GTCTCTACGG TATGGCCAAT CCTGAAGCTT TTTCCGAAAA GGTGATCAGC 4 80 

CTGCACACGG GACAAAGGGC AGACAGGGAT CATTTTATCC GCCTGCTGGT AGAGAGCTAC 540 

TACACGAACA ATAAAGTAGA GTTCGAGAGC GGCAACTTCC GTGTCAAAGG CGACAGCGTG 600 

GACATATTCC CCGCCGTAGA AGGTTATGAC GGCGTGGCAT ACAGGGTGGA GTTTTGGGAT 660 

GGA'SAGGTCG A'-^rGGCTGAG TACCTTCGAT CCGCGAACGG GACGGGAATA CGGCCTGCTG 720 

75 TCGGAGCTGA AGATATATCC GGCCAATCTC TTCGTGACJA CTAAGGASCA GGTGGATCGG 7 80 
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GCAGTAGGGA AAATCGATGT GCATCTGGGC GCACAG5TCG AITTTCTQAA AGAAATCGGC 84 0 



(2) INFORIIATIOII FOR SHQ ID HO: 530 

(i) SEQUEUCE OIARACTERTSTICS: 

(A) LENGTH: 203-1 basG palre 
(D) TYPE: nucleic acid 

(C) STRAIIDEDHESS: double 

(D) TOPOLOGY: circular 

(11) MOLECULE TYPE; Di^A (genomic) 
(ill) HYPOTHETICAL: HO 
(iv) A14VI-SEHSE: NO 



900 



— nd. i, t. & 1 utn/-i /AO/WM V-ov»v* 

AAACCATATG AAGCCAAACG CTTGTATGAG CGGGTCACGT ATGACTTGGA AATGATCCCT -.jy 

GAGrrGGGTT ATTGTTCCGG TATAGAGAAC TATTCGCGCT ACTTCGACGG CCGTGACGCG 960 

GGCGAACGTC CTTTCTGl'CT GTTGGATTAT TTCCCG-SnGG ATTTCCTGTT GGTCATAGAC 1020 

GAAAGCCATG TAACGATACC GCAGATACGT GCCATGTACG GAGGCGATCG TTCGCGCAAG 1090 

GAGAATCTGG TCGAATACGG ATTCCGCCTG CCTGCCGCTC TCGACAATCG GCCGCTTCGC 114 0 

TTCGACGAGT TCGAAGCTCT CACCCCCCGG ACCCTTTATA TCAGTGCCAC GCCTGCCGAC 1200 

TATGAGCTGA ACAGAAGCGA AGGCGTGATC GTCGAGCAGC TGATCCGTCC GACCGGArTG 12 SO 

CTGGATCCCA TCATCGACGT CAAGCCGACG GCAAACCAAG TGGACGATCf GATGGAGGAG IJ'-'O 

ATAGCACGCT GCATCGAAAA GAAAGAGCGC GTACTGGTAA CGACCCTGAC CAAACGTATG 1380 

GCAGAGGAGC TTAGCGAATA CCTGCTACGC C^^CG-STATCA GCACCGGCTA CATACACAGC lA4n 

GATGTGGACA CGCTGGAGCG TGTGCGTATC ATGGAAGACC TGCGCAAGGG GGTCTACGAT 15^6 

GCACTCATCG GGGTGAATCT GCTCCGCGAA *5GATTGGA'JT TGCCGGAAGT 'ITCGCTTGrG 1560 

GCTATTCTGG ATGCGGATAA GGAAGGATTC CTGCGCTCGC ATCGTTCGCT CACGCAGACT 16^0 

15 GCAGGACGTG CCGCCCGGCA CATTCATGGG CGTGTCATCT TCTACGCGGA CAAGATCACC isio 

GACAGTATGC AGCTCACCAT GGACGAGACT GCACGCCGAC GCGCAAAGCA ACTGGCCTAC 17 4 0 

AACGAAGCGC AC&3CATCAC CCCCCAACAG ATAGTGAAGA ACAGTGCTGC CATTTGGGGA 1800 

GAAGGCGATG TGTCGGCCTT GCA/.TCCGAT ACAGAATCCG GTGCGTACAT AGAAGAGAGC I8 60 

AGCATGGTGG CTGCCGATCC TTTGGCCGAC TATCTGACCA AACCCAAGCT GGAAGCACTC 1^20 

ATTGCTTCGA CCAAGAAGCA AATGCTGGCA GCA'SCCAAAG AGCTGGACTT TCTGGAAGCG l^sr^ 
GCA03ACTTr GGGACGAAGC CGCACGATTG GAAAAGAAAC TGGAGCAACT CACAGCC 



60 
120 



(vl) ORIGINAL SOURCE: 

(A) ORGANISIl: Porphyromonas qinqivalls 

( j.X) FEATURE: 

(A) HAI IB/KEY: inisc_f eature 

(B) LOCATION 1...2034 

45 SEQUENCE DESCRIPTION: SEQ ID MO: 530 

ATGGACTACA AACTCACTTC TCGATTCAAG CCCACGGGCG ACCAGCCGGA AGCCATTCGC 
CAACTCGTAC AGGGCATCAA CGAAGGGATG CC&.tCTCAGA CICTGCTCGG CGTAA^GGT 

TO^'^CAAAA CTTTTACGGT GGCTAACGTG GTGGCGGCGG TCAATCGTCC GACCCTTGTC ISO 

OU CTGAGTCACA ACAAGACCTT GGCAGCACAG CTATACGGAG AGTTCAAAGC CTT'TTCCCC 24 0 

GAGAATGCGG TGGAGTATTT CGTCAGCTAC TACGACTACT ATCAGCCCGA GGCCTACCTC 300 

CCCGTCACAG ACACCTATAT CGAAAAGGAC ATGGCCATCA ACGCGGAGAT CGAAAAACTG 360 

CGATTGAGGG CCACGGCl-TC GCTCCTGTCA GGGCGGAAAG ATGTGCTTGT GGTCAGCTC- 4 20 

GTATCCTGTC TCTACGGTAT GGCCAATCCT GAAGCTTTTT CCGAAAAGGT GATCAGCCTG 4 80 

5b CACACGGGAC AAAGGGCAGA CAGGGATCAT TTTATCCGCC TGCTGGTAGA GAGCTACTAC 54 0 

ACGAACAATA AAGTAGAGTT CGAGAGCGGC AACTTCCGTG TCAAAGGCGA CAGCGTGGAC 600 

ATATTCCCCG CCGTAGAAGG TTATGACGGC GTGGCATACA GGGTGGAGTT TTGGGATGGA 6^0 

GGCTGAGTAC CTTCGATCCG CGAACGGGAC GGGAATACGG CCTGCTGTCG 720 

^fS^iS?^?'' TATATCCGGC CAA.TCTCTTC GTGACGACTA AGGAGCAGGT GGATCGGGCA 780 

^^^S^^ J^i^'^'^^^^^ TCTGGGCGCA CAGGTCGATT TTCTGAAAGA AATCGGCAAA 84 0 

^^l^V^^ CCAAACGCTT ^TTATGAGCGG GTCACGTATG ACTTGGAAAT CATCCGTGAG 900 

lZt??-l-r'n. SI^S'-''^'^'''^ AGAGAACTAT TCGCGCTACT TCGACGGCCG TGACGCGGGC 960 

GAACGTCCTT TCTGTCTGTT GGATI'ATTTC CCGGAGGATT TCCTGTTGGT CATAGACGAA 1020 

fic; ^gccatgtaa cgataccgca gatacgtgcc atgtacggag gl-gatcgttc gcg^^g^- loll 

65 aatctggtcg aatacggatt ccgoctgcct gccgctctcg acaatcggcc gcttcgcttc 114S 

^^Si^I'^ AAGCTCTCAC CCCCCGGACC CTTTATATCA GTGCCACGCC TGCCGACTAT 1200 

GAAGCGAAGG CGTGATCGTC GAGCAGCTGA TCCGTCCGAC CGGACTGCTG 126C 

TCGACGTCAA CCCGACGGCA AACCAAGTGG ACGATCTGAT GGAGGAGATA 13^0 

70 J^^r^S^S^ 1^2^"^^^ AGAGCGCGTA CTGGTAACGA CGCTGACCAA ACGTATGGCA 1380 

^^SS^Sf™ GCGAATACCT GCTACGCCAC GGTATCAGCA CCGGCTACAT ACACAGCGAT 1<40 

232?™ TGGAGCGTGT GCGTATCATG GAAGACCTGC GCAAGGGGGT CTAOSATGOA 1500 

CTCATCGCSG TGAATCTGCT CCGCGAAGGA TTCGACTTGC OSGAAGTTTC GCTTGTGGCT 1560 

^^JT^S2SI2 CGGATAAGGA AGGATTCCTG CGCTCGCATC GITCGCTCAC GCAGACTGCA 1620 

7^ in^nnJ^S^ CCCGGCACAT TCATCGGCGT GTCATCTTCT ACGCGGACAA GATCACCGAC 1680 

75 AGTATGCAGC TCACCATGGA CGAGACTGCA CGCCGACGCG CAAAGCAACT GGCCTACAAC 1740 
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2J^™;:iV ^^G'^TCCTTT laGCCGACTAT CTGAGCAAAC CCAAG^fGGA Af5CAr-rrATT 

ssss sss ssg =^ 



(2) IMFORlIATlon FOR SEQ ID HO: 531 

<i) SEQLrEHCE CHARACTERISTICS: 

(A) LENGTH: 67 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGV: linear 

MOLECULF. TYPR: protein 

(iiij HVPOTHETICAL: YES 



20 



25 



(vi) ORIGINAL SOURCF.: 

(AJ ORGAIirSM: Porphyromonae glnglvalls 
(Ix) FEATURE: 

(A) NAI1E/KF.Y: mlsc feature 
( E } LOCATrO» 1 . . . 67 9 

(:<i) SliQUENCE DESCRIPTION: SEQ ID tIO:531 
Val Met A.p Tyr Lys Leu Thr Ser Arq Phe Ly. Pro Thr Gly A^p Gin 
30 Pro Glu Ala lie Arg Gin Leu Val Gin Gl^y He A.n Glu GXy Ket Pro 
Ala Gin Thr Leu Leu Gly Val Thr c!y Ser Gly Ly. Thr Phe Thr Val 

35 so" Leu val Leu S«r Hi. 

A.n Ly. Thr Leu Ala Ala Gin Leu Tyr Gly Glu Phe Lys Ala Phe Phe 

Pro Glu Asn Ala Val Glu Tyr Phe Val Ser ^yr Tyr Asp Tyr Tyr Gin 

40 Pro Glu Ala Tyr Leu Pro Val Thr A.p |hr Tyr He Glu Ly. Asp I.et 



45 



75 



Ala II. Asn Ala Glu He Glu Ly. l;u Arg Leu Axg Ala T^r Ala Ser 



Leu Leu S.r Gly Arg Ly3 Asp vll Leu Val Val Ser Ser Val Ser Cy. 



135 



L.u Tvr Gly Het Ala A^n Vrl Glu Al. Pi.e Ssr gJu Ly,. Val He 3=r 

155 



L.U HI. Thr Gly Gin Ar, Ala A,p Arc, A=p mL Ph, Xle Ar, L,u Leu 
50 val Glu ser Tyr Tyr Thr A.n Asa Lys Glu Phe Glu Ser Gly A.n 

Phe Arg Va, Ly., Gly A.p s,r Val i^p xie Phe Pre Ala Glu Gly 

55 ''''' "=1" rrp A.P G?y Glu va, Glu 

Arg Leu Ser Thr Phe A.p Pr^ Arg Thr Gly Arg Glu Tyr Gly Leu Leu 
ser Glu Leu Lys lie Tyr Pro Ala A.n Leu Phe Val Thr Thr Lys «u 
60 Gin val A.P Arg Ala Val Gly L-.s lie Val A.p Leu Gly III Gin 

val Asp Phe Leu Lys Glu He Gly Ly.' Pro Tyr Glu Ala Lys Arg Leu 



65 tl^'o Arg G?u Leu Gly Tyr 

Cj-. ser Gly He Glu Asn T^r Ser Arg Tyr Phe Asp Gly Arg Asp Ala 
Gly Glu Arg Pro Phe Cys Leu Leu A.p Tyr P^e Pro Glu Asp Ph. f.eu 
70 Leu val lie Asp Glu Ser His Val Thr Pro Gin He Arg III „e. 

Tyr Gly Gly Asp Arg Ser Arg Lys Glu Asn Lc-u Val Glu Tyr Gly Phe 



Arg Leu Pro Ala Ala Leu Asp As"; Arg Pro Leu Arg Phe Asp G,u Phe 



1 800 
1860 
1920 
1990 
203J 



380 
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Glu Ala Leu Thr Pro A 
3B5 



rg Thr Leu Tyr lie Ser Ala Thr Pro Ai 



a Asp 

4 00 
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45 



50 



55 



395 

Tyr Glu Leu /v^n Arg Ser Glu Gly Val lie Val Glu Gin Leu He Arc. 

''OS 4is 
Pr:> Thr Gly Leu Leu Asp Pro He He A^p V^l Lys Pro Thr A) a A.-n 

425 

Gin Val A^p Asp Leu Met Glu Glu He Ala Arg Cys He Glu Lys Lys 

Glu Arg Val L^^u Vnl Thr Thr Leu Thr Lys Arg |.It:L Vll Glu Glu Leu 

ser Glu Tyr Leu Leu Arg His Gly He Ser Thr Gly Tyr He His Ser 

Asp Val Asp Thr Lou GJ u Arg Val Arg He Ilet Glu A*.'p Leu Arg Lys 

'^^^ 490 4o| 

i>ly V,-i-! Tyr Asp Ala Leu He Gly Val Am Leu Leu Arg Glu Gly Leu 

1>0S 52(1 
Asp Leu Pro Glu Val Ser L^u Val Ala He Leu Asp Ala Asp Lys Glu 

=1- £20 
Gly Phe Leu Arg Ser His Arg Ser Leu Thr Gin Thr Ala Gly Arg Ala 

Ala Arg His He Hlc Gly Arg Val He Phe Tyr Ala Asp Lys He Thr 

555 Rgn 
Asp Ser Met Gin Leu Thr Met Asp Glu Thr Ala Arg Arg Arg Ala E s 

--55 ' 
Gin Leu Ala Tyr Asn Glu Ala His Gly He Thr Pro Gin Gin He Val 

£85 500 
Lys A^n Ser Ala Ala He Trp Gly Glu Gly Asp Val Ser Ala Leu Gin 

S9S SCO 
Ser Asp Thr Glu S-r Gly Ala Tyr He Glu Glu Ser Scr Met Val Ala 

Ala Asp Pro Leu Ala A^-p Tyr Leu Ser Lys Pro Lys Leu Glu Ala L-^u 

°-^0 635 
He Ala Ser Thr Lys Lys Gin Met Leu Ala Ala Ala Lys Glu Leu A-p 

Phe Leu Giu Ala Ala Arg Leu Arg Asp Glu AI 

6*>0 665 
Lys Leu Glu Gin Leii Thr Ala 
67 5 



655 

Ala Arg Leu Glu Lys 
670 



iZ) IHFORJtATIOt^ FOR SEQ ID 1*0:532 

(i) SEQUEIICE CHARACTERISTICS: 

(A) LENGTH: 678 amino acids 
(D) TypK: amino acid 
(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE; protein 

(ill) HYPOTHETICAL: 1X0 

{vi] ORIGINAL SOURCE; 

lA> ORGAHISM: Porphyromonas gingival is 

(ix) FEATURE: 

(A) NAME/KEY: misc feature 

(B) LOCATION 1. . .678 



(Xl) SEQUEIICE DESCRIPTIorvf: SEQ ID MO:53;> 



60 



65 



70 



75 



Met 


Asp 


Tyr 


Lys 


Leu 


Thr 


Ser 


Arg 


Phe 


Lys 


Pro 


Glu 








5 










10 




Ala 


He 


Arg 


Gin 


Leu 


Val 


Gin 


Gly 


He 


Asn 


Gin 






20 










25 




Thr 


Leu 


Leu 


Gly Val 


Thr 


Gly 


Ser 


Gly 


Lys 






35 










40 






Asn 


Val 


Val 


Ala 


Ala 


Val 


Asn 


Arg 


Pro 


Thr 


Leu 




50 










55 








Lys 


Thr 


Leu 


Ala 


Ala 


Gin 


Leu 


Tyr 


Gly 


Glu 


Phe 


65 










7 0 








75 


Glu 


Asn 


Ala 


Val 


Glu 


Tyr 


Phe 


Val 


Ser 


Tyr 


Tyr 


Glu 








85 










90 


Ala 


r/r 


Lou 


Pro 


Val 


Thr 


Asp 


Thr 


Tyr 


He 


He 






100 










105 






Asp 


Ala 
115 


Glu 


He 


Glu 




Leu 


Arg 


Leu 


Arg 














120 







60 



15 



Ser His A-n 



ys Ala Phe Phe Pro 

80 

sp Tyr Tyr Gin Pro 
95 

lu Lys Affp Met Ala 
110 

la Thr Ala S^r Leu 
125 
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70 



Leu 


Ser 


Gly 


Arg 


Lys 


Asp Val 


Leu 


Val 


Val 


Ser 


Ser 


Val 


Ser 


Cys 


LeU 




130 










135 










14 0 










Tyr 


Gly 


Met 


Al a 


Asn 


Pre 


Glu 


Al a 


Phe 


Ser 


Giu 


Lys 


Val 


He 


Ser 


Leu 


1 <1 5 










150 










155 










160 


His 


Thr 


laly 


Gin 


Arg 


Ala 


Asp 


Arg 


Asp 


His 


Phc 


He 


Arg 


Leu 


Leu 


/al 










1 65 










I/O 










175 




Glu 


Ser 


Tyr 


Tyr 


Thr 


Asn 


TVsn 


Lys 


Val 


Glu 


Phe 


Glu 


Ser 


Giy Asn 


Phe 








180 










185 










190 






Arg 


Val 


Lys 


Gly 


Asp 


Ser 


Val 


Asp 


He 


Phe 


Pro 


Al a 


Val 


Glu 


Gly 


Tyr 
















200 










205 








Asp 


Gly 


Val 


Ala 


Tyr 


Arg 


Val 


Glu 


Phe 


Trp 


Asp 


t»ly 


Glu 


Val 


Glu 


Arg 




210 










215 










220 










r*eu 


Set 


Tnr 


Pne 


Asp 


Pro Arg 


Thr 


Gly Arq Glu 


Tyr 


Gly Leu 


Leu 


Scr 


225 










230 










235 












Glu 


Leu 


Lys 


He 


Tyr 


Pro 


Ala 


Asn 


Leu 


Phe 


Val 


Thr 


Thr 


Lys 


Glu 


Gin 










245 










250 










255 




Val 


Asp 


Arg 


Ala 


Val 


Gly 


Ly£ 


He 


Asp Val 


Asp 


Leu 


Gly Ala 


Gin 


Val 








260 










265 










270 






Arp 


Phc 


Leu 


Lys 


Glu 


He 


Gly 


Lys 


Pro 


Tyr 


Glu 


Ala 


Lys 


Arg 


Leu 


Tyr 






275 










2ao 










285 








Glu 


Arq 


Val 


Thr 


Tyr 


Asp 


Leu 


Glu 


>let 


He 


Arg 


Glu 


Leu 


G\y 


Tyr 


Cys 




290 










295 










300 










Ser 


Giy 


He 


Glu 


Asn 


Tyr 


Ser 


Axg 


Tyr 


Phe 


Asp 


Gly 


Arg 


Asp 


Ala 


Gly 


30i> 










310 










315 










320 


GLu 


Arg 


fro 


Phe 


Cys 


Leu 


Leu 


Asp 


Tyr 


Phe 


Pro 


Glu 


A?p 


Phe 


Leu 


Leu 










325 










330 










335 




Val 


He 


Asp 


Glu 


Ser 


His 


Val 


Thr 


He 


Pro 


Gin 


He 


Arg 


Ala 


Met 


Xyr 








340 










345 










350 






Gly 


Gly 


Asp 


Arg 


Ser Arg 


Lys 


Glu 


Asn 


Leu 


Val 


Glu 


Tyr 


Gly 


Phe 


Arg 






355 










360 










365 








Leu 


Pro 


Ala 


Ala 


Leu 


Asp 


Asn 


Arg 


Pro 


Leu Arg 


Phe 


Asp 


Glu 


Phe 


Glu 




370 










375 










380 










Ala 


Leu 


Thr 


Pro 


Arg Thr 


Leu 


Tyr 


He 


Ser 


Ala 


Thr 


Pro 


Ala 


Asp 


Tyr 


365 










390 










395 










4 00 


Glu 


Leu 


Asn 


Arq 


Ser 


Glu 


Gly Val 


He 


Val 


Glu 


Gin 


Lsu 


He 


Arg 


Pro 










405 










410 










415 




Thr 


Gly 


Leu 


Leu 


Asp 


Pro 


He 


He 


Asp 


Val 


Lys 


Pro 


Thr 


Ala 


Asn 


Gin 








420 










425 










430 






Val 


Asp 


Aap 


Leu 


Het 


Glu 


Glu 


He 


Ala 


Arg 


Cys 


He 


Glu 


Lys 


Lys 


Glu 






435 










440 










445 








Aro 


Val 


Leu 


Val 


Thr 


Thr 


Leu 


Thr 


Lys 


Arg 


Met 


Ala 


Glu 


Glu 


Leu 


Ser 




450 










455 










4 60 










Glu 


Tyr 


Leu 


Leu 


Arg 


His 


Gly 


lie 


Ser 


Thr 


Gly 


Tyr 


He 


His 


Ser 


Asp 


4 S5 










410 










475 












Val 


Asp 


Thr 


Leu 


Glu 


Arg 


Val 


Arg 


He 


Met 


Glu 


Acp 


Leu 


Arg 


Lys 


Gl y 










405 










490 










4 95 




Vol 


Tyr 


Asp 


Ala 


Leu 


He 


Gly Val 


Asn 


Leu 


Leu 


Arg 


Glu 


Gly 


Leu 


Asp 








500 










505 










510 






Leu 


fc*ro 


Glu 


Val 


Sar 


Leu 


Val 


Ala 


He 


Leu 


Asp 


Ala 


Asp 


Lys 


Glu 


Gly 






515 










520 










525 








Phs 


Lcii 


Arg 


Ser 


His 


Arg 


Ser 


Leu 


Thr 


Gin 


Thr 


Ala 


Gly Arq 


Ala 


Ala 




530 










535 










540 










Arg 


His 


He 


Hie 


Gly Arg 


Val 


He 


Phe 


Tyr 


Ala 


Asp 


Lys 


He 


Thr 


Asp 


545 










550 










555 










560 


Ser 


Met 


Gin 


Leu 


Thr 


l-let 


Asp Glu 


Thr 


Ala 


Arg 


Arg 


Arg 


Ala 


Lys 


Gin 










565 










570 










575 




Leu 


Ala 


Tyr 


Asn 


Glu 


Ala 


His 


Gly 


He 


Thr 


Pro 


Gin 


Gin 


He 


Val 


Lys 








580 










585 










590 






Asn 


Ser 


Ala 


Ala 


lie Trp Gly Glu 


Gly Aflp 


Val 


Ser 


Ala 


Leu 


Gin 


Ser 






595 










600 










605 








Ajfp 


Thr 


Glu 


Ser 


Gly Ala 


Tyr 


He 


Glu 


Glu 


Ser 


Ser 


Het 


Val 


Ala 


Ala 




€10 










615 










620 










Asp 


Pro 


Leu 


Ala 


Asp 


Tyr 


Leu 


Scr 


Lys 


Pro 


Lys 


L-u 


Glu 


Ala 


Leu 


He 


625 










630 










635 










64 0 


Ala 


Ser 


Thr 


Lys 


Lys 


Gin 


Het 


Leu 


ALk 


Ala 


Ala 


Lys 


Glu 


Leu 


Asp 


Phc 










645 










650 










€55 




Leu 


Glu 


Ala 


Ala 


Arg 


Leu 


Aig 


Asp 


Glu 


Ala 


Ala 


Arg 


Leu 


Glu 


Lys 


Lys 








660 










665 










67 0 






Leu 


Glu 


Gin 


Leu 


Thr 


Ala 
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